turso

mirror of https://github.com/aljazceru/turso.git synced 2026-01-01 15:34:19 +01:00

Author	SHA1	Message	Date
Glauber Costa	b5927dcfd5	support doubly qualified identifiers	2025-07-25 14:52:45 -05:00
Pekka Enberg	c6c0db19e9	Merge 'Fix schema reparse logic' from Nikita Sivukhin `maybe_reparse_schema` function introduced in the #2246 was incorrect as it didn't update `schema_version` for internal schema representation and basically updated only schema for connection which called `maybe_reparse_schema`. This PR fixes this issue by reading schema and cookie value within a single transaction and updating both schema content and its version for internal representation. Reviewed-by: Pedro Muniz (@pedrocarlo) Closes #2259	2025-07-25 13:24:23 +03:00
Nikita Sivukhin	020d567e78	fix clippy	2025-07-25 13:55:37 +04:00
Pekka Enberg	669b231714	Merge 'parser: Distinguish quoted identifiers and unify Id into Name enum' from bit-aloo Closes: #1947 This PR replaces the `Name(pub String)` struct with a `Name` enum that explicitly models how the name appeared in the source either as an unquoted identifier (`Ident`) or a quoted string (`Quoted`). In the process, the separate `Id` wrapper type has been coalesced into the `Name` enum, simplifying the AST and reducing duplication in identifier handling logic. While this increases the size of some AST nodes (notably `yyStackEntry`). cc: @levydsa Reviewed-by: Levy A. (@levydsa) Reviewed-by: Preston Thorpe (@PThorpe92) Closes #2251	2025-07-25 12:08:54 +03:00
Pekka Enberg	c6d4a5c3ed	Rename DatabaseIndexer to DatabaseCatalog Avoid confusion with actual database indexes.	2025-07-25 10:36:33 +03:00
Glauber Costa	988b16f962	Support ATTACH (read only) Support for attaching databases. The main difference from SQLite is that we support an arbitrary number of attached databases, and we are not bound to just 100ish. We for now only support read-only databases. We open them as read-only, but also, to keep things simple, we don't patch any of the insert machinery to resolve foreign tables. So if an insert is tried on an attached database, it will just fail with a "no such table" error - this is perfect for now. The code in core/translate/attach.rs is written by Claude, who also played a key part in the boilerplate for stuff like the .databases command and extending the pragma database_list, and also aided me in the test cases.	2025-07-24 19:19:48 -05:00
Nikita Sivukhin	e05660133b	update schema version for internal schema represenation in maybe_reparse_schema	2025-07-24 22:43:31 +04:00
Nikita Sivukhin	8b0e5b151e	simplify parse_schema_rows signature	2025-07-24 22:43:31 +04:00
Pekka Enberg	2141293017	Merge 'Fix page_count pragma' from meteorgan Closes: #1415 ### What this PR does 1. Removes database initialization from the `read_tx` function. 2. Adds checks for database initialization when executing `.schema`, `.indexes`, `.tables` and `.import` commands, as they rely on `sqlite_schema` table. ### About the second issue I think we have another solution for the second issue: create the `sqlite_schema` table in `Schema` only during page1 initialization, rather than during `Schema` initialization. #### Pros This approach has the advantage of unifying the logic for the `sqlite_schema` table with other user tables when running `select` statements #### Cons - we still need to check error codes for commands like `.schema`. - this approach may increase the complexity of the `pager` implementation. I'd like to hear your thoughts and feedback. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2099	2025-07-24 19:21:35 +03:00
Jussi Saurio	b33527c3c4	Merge 'btree: clear overflow pages when insert overwrites a cell (= UPDATE)' from Jussi Saurio Closes #2227 , enables fixing #2225 ## What Although we cleared overflow pages on DELETE, we never did it for INSERT/UPDATE, which means any overflow pages were left dangling and not added to freelist. ## Why is this a problem This means that we are not able to reuse these pages to solve #2225, causing massive bloat in the DB when UPDATEs are executed. ## Fix Clear overflow pages when `BTreeCursor::insert()` overwrites a cell. Needed a new state machine for `overwrite_cell` + new `WriteState` variants Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #2230	2025-07-24 18:59:15 +03:00
Jussi Saurio	0b627ed331	Merge 'btree/balance: support case where immediate parent page of unbalanced child page also overflows' from Jussi Saurio Closes #2241 ## What When an index interior cell is deleted, it steals the leaf cell with the largest key in its left subtree, deletes the old interior cell and then replaces it with the stolen cell. This ensures the binary-search-tree aspect of the btree remains correct. However, this can cause a situation where both are true: 1. The leaf page is now UNDERFULL and must be rebalanced 2. The leaf's IMMEDIATE parent page is now OVERFULL and must be rebalanced ## Why is this a problem We simply didn't support the case where: - Leaf page P is unbalanced and rebalancing starts on it - Its immediate parent is ALSO unbalanced and _overflows_. We had an assertion against this happening (see #2241) ## The fix Allow exactly 1 overflow cell in the parent under very particular conditions: 1. The parent page must be an index interior page 2. The parent must be positioned exactly at the divider cell whose left child page underflows This is the _only_ case where the immediate parent of a page about to undergo rebalancing can have overflow cells. ## Implementation details The parent overflow cell is folded into `cell_array` fairly early on and `parent.overflow_cells` is cleared. However we need to be careful with `cell_idx` for dividers other than the overflow cell because they get shifted left on the page in `drop_cell()`. I've added a long comment about this. ## Testing Adds fuzz test that does inserts and deletes on an index btree and asserts that all the expected keys are found at the end in the right order. This test runs into this case quite frequently so I was able to verify it. Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #2243	2025-07-24 18:48:36 +03:00
Jussi Saurio	7968be9d71	btree/insert: cell can also underflow after overwrite	2025-07-24 18:43:02 +03:00
Jussi Saurio	a4535684b3	btree: WriteState: remove CheckNeedsBalancing variant	2025-07-24 18:40:49 +03:00
Jussi Saurio	b0edd3b716	btree: WriteState: add comments	2025-07-24 18:36:07 +03:00
meteorgan	c48a5ef538	we don't need read_tx return IOResult anymore	2025-07-24 23:19:33 +08:00
meteorgan	ab2bc547cd	check if datbase is initialized before display schema, index and tables	2025-07-24 23:18:29 +08:00
meteorgan	08f1803a6b	end read tx in op_transaction when write transaction return io	2025-07-24 23:18:29 +08:00
meteorgan	7ef50e0690	fix page_count pragma	2025-07-24 23:18:26 +08:00
Pere Diaz Bou	46f5609fce	Merge 'Append WAL frames one by one' from Pere Diaz Bou Let's make sure we don't end up in a weird situation by appending frames one by one and we can later think of optimizations. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2034	2025-07-24 16:44:51 +02:00
Pere Diaz Bou	ce598b772e	clippy i hate you so much	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	b07e57d9d1	review fixes	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	75f9c23ed3	end txn on vdbe failures	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	674d88e140	do not clear dirty pages on cacheflush::start	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	d77c899fa6	clippy	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	5a1773edf1	clippy	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	14de7c55af	set connection state to None in vdbe rollback	2025-07-24 15:29:21 +02:00
Pere Diaz Bou	c397588ad6	change connection state after finding error on I/O	2025-07-24 15:29:06 +02:00
Pere Diaz Bou	5f8e386b48	reset internal states on rollback	2025-07-24 15:29:06 +02:00
Jussi Saurio	37955e9a04	Pager/WAL: fix not clearing stale page cache SQLite behavior is: if another connection has modified the DB when a read tx starts, it must clear its page cache due to the potentiality of there being stale versions of pages in it. In the future, we may want to do either: 1. a more granular invalidation logic for per-conn cache, or 2. a shared versioned page cache But right now we must follow SQLite to make our current behavior not corrupt data	2025-07-24 16:23:12 +03:00
Pere Diaz Bou	066ffcc940	append frame one by one Let's make sure we don't end up in a weird situation by appending frames one by one and we can later think of optimizations.	2025-07-24 15:12:13 +02:00
Pekka Enberg	62f5a42008	Merge 'WAL insert API: force schema re-parse if necessary after WAL sync session end' from Nikita Sivukhin This PR partially fixes issue when schema changes were invisible after WAL sync calls. Now, `wal_insert_end` always read fresh schema cookie and re-parse schema from scratch if cookie changed. Generally, the problem of "silent" schema update can be more generic if(when?) `turso-db` will support multi-process setup. But for now only single-process can work with `turso-db`, so I decided to inject re-parse logic explicitly in WAL raw API in order to not introduce any unnecessary overhead in the ordinary execution path. This fix is not complete, as if we will have already prepared statements - they should be re-prepared too in case of schema changes. But this problem already tracked in the PR https://github.com/tursodatabase/turso/pull/2214 Reviewed-by: Pedro Muniz (@pedrocarlo) Closes #2246	2025-07-24 14:39:46 +03:00
Jussi Saurio	2b045ccfd8	btree: clear overflow pages when insert overwrites a cell	2025-07-24 13:44:11 +03:00
Jussi Saurio	d1b1617231	btree: add index insert-delete fuzz test	2025-07-24 13:18:33 +03:00
Jussi Saurio	d773a7924d	fix/btree/balance: allow exactly 1 parent overflow cell for index balancing	2025-07-24 13:18:33 +03:00
Jussi Saurio	025ea8808a	Merge 'WAL insert: mark pages as dirty' from Nikita Sivukhin WAL insert API introduced in the #2231 works incorrectly as it never mark inserted pages as dirty. This PR fixes this issue and also add simple fuzz test which fails without fixes. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2245	2025-07-24 12:58:01 +03:00
Jussi Saurio	9a08c57b58	Merge 'make add dirty to change flag and also add page to the dirty list' from Nikita Sivukhin Make `add_dirty` helper to set flag and add page to the dirt list. This makes API safer as now its harder to do one thing and forget about another (which can lead to DB corruption). Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2244	2025-07-24 12:22:33 +03:00
bit-aloo	3cb2db933d	remove Id	2025-07-24 14:40:24 +05:30
bit-aloo	9a54ef214e	parser: Distinguish quoted identifiers and unify Id into Name enum This commit replaces the `Name(pub String)` struct with a `Name` enum that explicitly models how the name appeared in the source either as an unquoted identifier (`Ident`) or a quoted string (`Quoted`). In the process, the separate `Id` wrapper type has been coalesced into the `Name` enum, simplifying the AST and reducing duplication in identifier handling logic. While this increases the size of some AST nodes (notably `yyStackEntry`), it improves correctness and makes source structure more explicit for later phases.	2025-07-24 14:40:19 +05:30
Nikita Sivukhin	6daa6d07f1	re-parse schema if necessary after WAL sync end	2025-07-24 11:52:07 +04:00
Nikita Sivukhin	edd6ef2d21	fix after rebase	2025-07-24 11:51:33 +04:00
Nikita Sivukhin	3d2a38eb88	add simple helper	2025-07-24 11:49:39 +04:00
Nikita Sivukhin	4a80306705	fix wal insert frame raw API - we need to properly mark pages as dirty after insertion	2025-07-24 11:49:39 +04:00
Nikita Sivukhin	d618463906	simplify add_dirty API	2025-07-24 11:29:01 +04:00
Jussi Saurio	2d3c9001ee	Merge 'emit SetCookie after DropTable' from Glauber Costa The SetCookie opcode is used, among other things, to notify the transaction of schema changes. We are not issuing it on DropTable. Without it, the transaction thinks the schema hasn't changed, and does not update the schema of the connection back to the database. SQLite will, of course, issue it: 35 DropTable 0 0 0 foo 0 36 SetCookie 0 1 2 0 Unfortunately I don't have a unit test that breaks with this, because the one that is supposed to break is having, let's put it this way, bigger problems. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2249	2025-07-24 10:12:16 +03:00
Jussi Saurio	92a10f94d8	Merge 'Bail early for read-only virtual tables' from Preston Thorpe This PR adds a const associated value on the VTabModule trait, `READONLY` defaulted to `true`, so we can bail early when a write operation is done on an invalid vtable. This prevents extensions from having to implement `insert`,`update`, `delete` just to return `Error::ReadOnly`, and prevents us from having to step through `VUpdate` just to error out. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2247	2025-07-24 10:12:07 +03:00
Jussi Saurio	dd30729f5e	Merge 'Explicit rowid insert' from Nikita Sivukhin This PR adds support for `INSERT` queries with explicit value for `rowid` column (not thought rowid alias): ``` turso> create table t(x, y, z); turso> insert into t(rowid, x, y, z) values (10, 1, 2, 3); turso> select rowid, * from t; ┌───────┬───┬───┬───┐ │ rowid │ x │ y │ z │ ├───────┼───┼───┼───┤ │ 10 │ 1 │ 2 │ 3 │ └───────┴───┴───┴───┘ ``` Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2239	2025-07-24 10:08:42 +03:00
Jussi Saurio	49b2bf4fdb	Merge 'Deserialize keys only once when sorting immutable records' from Iaroslav Zeigerman Before this update, the entire immutable record was fully deserialized every time it was compared in the sorter. This PR extends the sorter with incremental deserialization of record keys, only when needed and only if they weren’t already deserialized in a previous iteration. I hate that we panic on failed deserialization in `cmp`, but unfortunately, I can’t return `Result` as part of this interface. Looking for feedback around a better way to handle this. Alternatively, I could store the deserialization error as part of `SortableImmutableRecord` and check it before returning the record in `next`, thereby deferring the error handling. The downside of this approach is that it complicates debugging, since the error will be completely decoupled from the place where it occurs. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2207	2025-07-24 10:08:16 +03:00
Jussi Saurio	52b4c22be9	Merge 'fix: SUM returns correct float for mixed numeric/non-numeric types & return value on empty set' from Axel Tobieson Rova # Fix SUM aggregate function for mixed types Fixes #2133 The SUM aggregate function was returning incorrect results when processing tables with mixed numeric and non-numeric values. According to SQLite documentation: > "If any input to sum() is neither an integer nor a NULL, then sum() returns a floating point value" [*](https://sqlite.org/lang_aggfunc.html) Now both SQLite and Turso yield the same output of 44.0. -- I modified `Sum` to increment only for numeric values, skipping non- numeric values. However, if we have mixed numeric values or non-numeric values, we return a float output. Added a flag to keep track of it. as pointed out by @FHaggs , If there are no non-NULL input rows then sum() returns NULL but total() returns 0.0. I decided to include it in this PR as well. Empty was such a natural test case. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2182	2025-07-24 10:08:01 +03:00
Glauber Costa	2a2468026c	emit SetCookie after DropTable The SetCookie opcode is used, among other things, to notify the transaction of schema changes. We are not issuing it on DropTable. Without it, the transaction thinks the schema hasn't changed, and does not update the schema of the connection back to the database. SQLite will, of course, issue it: 35 DropTable 0 0 0 foo 0 36 SetCookie 0 1 2 0 Unfortunately I don't have a unit test that breaks with this, because the one that is supposed to break is having, let's put it this way, bigger problems.	2025-07-23 19:34:41 -05:00
PThorpe92	b68539fc45	Make the ReadOnly error more generic	2025-07-23 16:58:22 -04:00

1 2 3 4 5 ...

3592 Commits