turso

mirror of https://github.com/aljazceru/turso.git synced 2026-02-23 17:05:36 +01:00

Author	SHA1	Message	Date
Nikita Sivukhin	27fcb81f48	add more complex schema changes test for raw WAL API	2025-07-24 22:43:31 +04:00
Jussi Saurio	0b627ed331	Merge 'btree/balance: support case where immediate parent page of unbalanced child page also overflows' from Jussi Saurio Closes #2241 ## What When an index interior cell is deleted, it steals the leaf cell with the largest key in its left subtree, deletes the old interior cell and then replaces it with the stolen cell. This ensures the binary-search-tree aspect of the btree remains correct. However, this can cause a situation where both are true: 1. The leaf page is now UNDERFULL and must be rebalanced 2. The leaf's IMMEDIATE parent page is now OVERFULL and must be rebalanced ## Why is this a problem We simply didn't support the case where: - Leaf page P is unbalanced and rebalancing starts on it - Its immediate parent is ALSO unbalanced and _overflows_. We had an assertion against this happening (see #2241) ## The fix Allow exactly 1 overflow cell in the parent under very particular conditions: 1. The parent page must be an index interior page 2. The parent must be positioned exactly at the divider cell whose left child page underflows This is the _only_ case where the immediate parent of a page about to undergo rebalancing can have overflow cells. ## Implementation details The parent overflow cell is folded into `cell_array` fairly early on and `parent.overflow_cells` is cleared. However we need to be careful with `cell_idx` for dividers other than the overflow cell because they get shifted left on the page in `drop_cell()`. I've added a long comment about this. ## Testing Adds fuzz test that does inserts and deletes on an index btree and asserts that all the expected keys are found at the end in the right order. This test runs into this case quite frequently so I was able to verify it. Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #2243	2025-07-24 18:48:36 +03:00
Pere Diaz Bou	46f5609fce	Merge 'Append WAL frames one by one' from Pere Diaz Bou Let's make sure we don't end up in a weird situation by appending frames one by one and we can later think of optimizations. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2034	2025-07-24 16:44:51 +02:00
Pekka Enberg	0f8e22a989	Merge 'pager: Clear stale page cache if database changed' from Jussi Saurio SQLite behavior is: if another connection has modified the DB when a read tx starts, the connection beginning the transaction must clear its page cache due to the potentiality of there being stale versions of pages in it. Evidence of this here: https://github.com/sqlite/sqlite/blob/ded1959/src/pager.c#L3258-L3260 and here: https://github.com/sqlite/sqlite/blob/master/src/wal.c#L3368-L3370 In the future, we may want to do either: 1. a more granular invalidation logic for per-conn cache, or 2. a shared versioned page cache But right now we must follow SQLite to make our current behavior not corrupt data Closes #2248 Closes #2257	2025-07-24 17:22:10 +03:00
Pere Diaz Bou	ce598b772e	clippy i hate you so much	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	b07e57d9d1	review fixes	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	75f9c23ed3	end txn on vdbe failures	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	674d88e140	do not clear dirty pages on cacheflush::start	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	d77c899fa6	clippy	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	5a1773edf1	clippy	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	14de7c55af	set connection state to None in vdbe rollback	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	c397588ad6	change connection state after finding error on I/O	2025-07-24 15:29:06 +02:00
Pere Diaz Bou	5f8e386b48	reset internal states on rollback	2025-07-24 15:29:06 +02:00
Jussi Saurio	37955e9a04	Pager/WAL: fix not clearing stale page cache SQLite behavior is: if another connection has modified the DB when a read tx starts, it must clear its page cache due to the potentiality of there being stale versions of pages in it. In the future, we may want to do either: 1. a more granular invalidation logic for per-conn cache, or 2. a shared versioned page cache But right now we must follow SQLite to make our current behavior not corrupt data	2025-07-24 16:23:12 +03:00
Pere Diaz Bou	066ffcc940	append frame one by one Let's make sure we don't end up in a weird situation by appending frames one by one and we can later think of optimizations.	2025-07-24 15:12:13 +02:00
Pekka Enberg	e7d7c21baf	Merge 'sim/aws: ignore child process exits with code 137' from Jussi Saurio we run the simulator on aws every night for 4 hours, but for the past few days it's been exiting with 137 and the parent sim-loop process hasn't handled this case and also exited. i suspect the child process is being killed due to memory consumption or something, but hard to be sure. either way, let's not end the sim loop when this happens Closes #2255	2025-07-24 15:16:23 +03:00
Jussi Saurio	60f64aed77	sim/aws: ignore child process exits with code 137 i suspect the child process is being killed due to memory consumption or something, but hard to be sure. either way, let's not stop the sim-loop parent process when this happens.	2025-07-24 14:57:52 +03:00
Pekka Enberg	648ad3e74d	Merge 'sqlite3: Improve SQLite error handling and fix C-string safety' from Ceferino Patino This PR addresses two related issues in the SQLite3 C API implementation: 1. Properly sets error codes on the database object when handling SQL preparation failures 2. Fixes C string handling in sqlite3_errstr_impl to ensure proper null- termination Error codes should be correctly set on the database object, making them accessible to API consumers. Also fixes and issue where C strings weren't properly null-terminated in error reporting functions, which causes the issue in #2197 where messages all get pushed onto a single line. Its possible that this also fixes a buffer overflow, but I'm not actually sure what the behavior was. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2212	2025-07-24 14:40:37 +03:00
Pekka Enberg	62f5a42008	Merge 'WAL insert API: force schema re-parse if necessary after WAL sync session end' from Nikita Sivukhin This PR partially fixes issue when schema changes were invisible after WAL sync calls. Now, `wal_insert_end` always read fresh schema cookie and re-parse schema from scratch if cookie changed. Generally, the problem of "silent" schema update can be more generic if(when?) `turso-db` will support multi-process setup. But for now only single-process can work with `turso-db`, so I decided to inject re-parse logic explicitly in WAL raw API in order to not introduce any unnecessary overhead in the ordinary execution path. This fix is not complete, as if we will have already prepared statements - they should be re-prepared too in case of schema changes. But this problem already tracked in the PR https://github.com/tursodatabase/turso/pull/2214 Reviewed-by: Pedro Muniz (@pedrocarlo) Closes #2246	2025-07-24 14:39:46 +03:00
Pekka Enberg	2e0161a1d5	serverless: 0.1.1	2025-07-24 14:29:12 +03:00
Pekka Enberg	ea12fb4011	Merge 'serverless: Add Statement.run() method' from Pekka Enberg Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2254	2025-07-24 14:27:28 +03:00
Pekka Enberg	099dba4ba9	serverless: Add Statement.run() method	2025-07-24 13:50:09 +03:00
Jussi Saurio	d1b1617231	btree: add index insert-delete fuzz test	2025-07-24 13:18:33 +03:00
Jussi Saurio	d773a7924d	fix/btree/balance: allow exactly 1 parent overflow cell for index balancing	2025-07-24 13:18:33 +03:00
Pekka Enberg	c58511b71c	antithesis: Update Docker image build to use Rust 1.88	2025-07-24 13:05:00 +03:00
Jussi Saurio	025ea8808a	Merge 'WAL insert: mark pages as dirty' from Nikita Sivukhin WAL insert API introduced in the #2231 works incorrectly as it never mark inserted pages as dirty. This PR fixes this issue and also add simple fuzz test which fails without fixes. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2245	2025-07-24 12:58:01 +03:00
Jussi Saurio	9a08c57b58	Merge 'make add dirty to change flag and also add page to the dirty list' from Nikita Sivukhin Make `add_dirty` helper to set flag and add page to the dirt list. This makes API safer as now its harder to do one thing and forget about another (which can lead to DB corruption). Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2244	2025-07-24 12:22:33 +03:00
Nikita Sivukhin	10836510df	remove tracing_subscriber	2025-07-24 11:52:07 +04:00
Nikita Sivukhin	6daa6d07f1	re-parse schema if necessary after WAL sync end	2025-07-24 11:52:07 +04:00
Nikita Sivukhin	edd6ef2d21	fix after rebase	2025-07-24 11:51:33 +04:00
Nikita Sivukhin	3d2a38eb88	add simple helper	2025-07-24 11:49:39 +04:00
Nikita Sivukhin	fb83862013	fix clippy	2025-07-24 11:49:39 +04:00
Nikita Sivukhin	4a80306705	fix wal insert frame raw API - we need to properly mark pages as dirty after insertion	2025-07-24 11:49:39 +04:00
Nikita Sivukhin	435ca7fe7a	add fuzz tests for raw WAL API	2025-07-24 11:49:39 +04:00
Nikita Sivukhin	d618463906	simplify add_dirty API	2025-07-24 11:29:01 +04:00
Jussi Saurio	2d3c9001ee	Merge 'emit SetCookie after DropTable' from Glauber Costa The SetCookie opcode is used, among other things, to notify the transaction of schema changes. We are not issuing it on DropTable. Without it, the transaction thinks the schema hasn't changed, and does not update the schema of the connection back to the database. SQLite will, of course, issue it: 35 DropTable 0 0 0 foo 0 36 SetCookie 0 1 2 0 Unfortunately I don't have a unit test that breaks with this, because the one that is supposed to break is having, let's put it this way, bigger problems. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2249	2025-07-24 10:12:16 +03:00
Jussi Saurio	92a10f94d8	Merge 'Bail early for read-only virtual tables' from Preston Thorpe This PR adds a const associated value on the VTabModule trait, `READONLY` defaulted to `true`, so we can bail early when a write operation is done on an invalid vtable. This prevents extensions from having to implement `insert`,`update`, `delete` just to return `Error::ReadOnly`, and prevents us from having to step through `VUpdate` just to error out. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2247	2025-07-24 10:12:07 +03:00
Jussi Saurio	9b5ef02cfb	Merge 'measure only the time it takes to open the actual connection' from Glauber Costa The current code includes creating the database object, which is slow. Unfortunately the same cannot be done on the standard SQLite. Reviewed-by: Preston Thorpe (@PThorpe92) Closes #2242	2025-07-24 10:09:00 +03:00
Jussi Saurio	dd30729f5e	Merge 'Explicit rowid insert' from Nikita Sivukhin This PR adds support for `INSERT` queries with explicit value for `rowid` column (not thought rowid alias): ``` turso> create table t(x, y, z); turso> insert into t(rowid, x, y, z) values (10, 1, 2, 3); turso> select rowid, * from t; ┌───────┬───┬───┬───┐ │ rowid │ x │ y │ z │ ├───────┼───┼───┼───┤ │ 10 │ 1 │ 2 │ 3 │ └───────┴───┴───┴───┘ ``` Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2239	2025-07-24 10:08:42 +03:00
Jussi Saurio	49b2bf4fdb	Merge 'Deserialize keys only once when sorting immutable records' from Iaroslav Zeigerman Before this update, the entire immutable record was fully deserialized every time it was compared in the sorter. This PR extends the sorter with incremental deserialization of record keys, only when needed and only if they weren’t already deserialized in a previous iteration. I hate that we panic on failed deserialization in `cmp`, but unfortunately, I can’t return `Result` as part of this interface. Looking for feedback around a better way to handle this. Alternatively, I could store the deserialization error as part of `SortableImmutableRecord` and check it before returning the record in `next`, thereby deferring the error handling. The downside of this approach is that it complicates debugging, since the error will be completely decoupled from the place where it occurs. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2207	2025-07-24 10:08:16 +03:00
Jussi Saurio	52b4c22be9	Merge 'fix: SUM returns correct float for mixed numeric/non-numeric types & return value on empty set' from Axel Tobieson Rova # Fix SUM aggregate function for mixed types Fixes #2133 The SUM aggregate function was returning incorrect results when processing tables with mixed numeric and non-numeric values. According to SQLite documentation: > "If any input to sum() is neither an integer nor a NULL, then sum() returns a floating point value" [*](https://sqlite.org/lang_aggfunc.html) Now both SQLite and Turso yield the same output of 44.0. -- I modified `Sum` to increment only for numeric values, skipping non- numeric values. However, if we have mixed numeric values or non-numeric values, we return a float output. Added a flag to keep track of it. as pointed out by @FHaggs , If there are no non-NULL input rows then sum() returns NULL but total() returns 0.0. I decided to include it in this PR as well. Empty was such a natural test case. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2182	2025-07-24 10:08:01 +03:00
Glauber Costa	2a2468026c	emit SetCookie after DropTable The SetCookie opcode is used, among other things, to notify the transaction of schema changes. We are not issuing it on DropTable. Without it, the transaction thinks the schema hasn't changed, and does not update the schema of the connection back to the database. SQLite will, of course, issue it: 35 DropTable 0 0 0 foo 0 36 SetCookie 0 1 2 0 Unfortunately I don't have a unit test that breaks with this, because the one that is supposed to break is having, let's put it this way, bigger problems.	2025-07-23 19:34:41 -05:00
PThorpe92	3358e85889	Update py tests for new error msg	2025-07-23 17:05:46 -04:00
PThorpe92	d7c3256a5a	Update vtab derive macro to pass readonly const to module impl	2025-07-23 16:59:18 -04:00
PThorpe92	b68539fc45	Make the ReadOnly error more generic	2025-07-23 16:58:22 -04:00
PThorpe92	0871a8c7f3	Bail early when we detect a readonly virtual table	2025-07-23 16:57:30 -04:00
PThorpe92	e7ce3efb3f	Remove default trait impl from csv extension	2025-07-23 16:50:57 -04:00
PThorpe92	9c3f9426c3	Add readonly method for VirtualTable to bail early	2025-07-23 16:49:42 -04:00
PThorpe92	eff455fb03	Add READONLY const property to virtual table module trait	2025-07-23 16:44:04 -04:00
Iaroslav Zeigerman	1e51d23bd6	store the key deserialization error instead of panicking	2025-07-23 11:22:01 -07:00

1 2 3 4 5 ...

6529 Commits