turso

mirror of https://github.com/aljazceru/turso.git synced 2026-02-05 00:04:23 +01:00

Author	SHA1	Message	Date
Pere Diaz Bou	86119b0dba	Merge 'core/mvcc/cursor: implement prev and last ' from Pere Diaz Bou Backward scan of a table wasn't implemented yet in MVCC so this achieves that. I added simple test for mixed btree and mvcc backward scan but I should add more intense testing for this. <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Implements backward scanning and last() in MVCC lazy cursor and adds directional rowid iteration in the MVCC store, with new tests for mixed MVCC+B-Tree backward scans. > > - MVCC Cursor (`core/mvcc/cursor.rs`): > - Implement `prev()` and `last()` with mixed MVCC/B-Tree coordination using `IterationDirection`. > - Add `PrevState` and extend state machine to handle backward iteration. > - Update `get_new_position_from_mvcc_and_btree(...)` to choose rowids based on direction. > - Integrate B-Tree cursor calls (`last`, `prev`) and adjust `rewind`/rowid selection; tweak next-rowid when at `End`. > - MVCC Store (`core/mvcc/database/mod.rs`): > - Add `get_prev_row_id_for_table(...)` and generalized `get_row_id_for_table_in_direction(...)` supporting forward/backward scans. > - Add tracing and minor refactors around next/prev rowid retrieval. > - Tests (`core/mvcc/database/tests.rs`): > - Add test for backward scan combining B-Tree and MVCC and an ignored test covering delete during backward scan. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `430bd457e6`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> Closes #3980	2025-11-20 18:41:41 +01:00
Pekka Enberg	c205f41158	Merge 'core: Switch to parking_lot::Mutex' from Pekka Enberg It's faster and we eliminate bunch of unwrap() calls. Closes #3993	2025-11-20 11:17:12 +02:00
Pekka Enberg	d808db6af9	core: Switch to parking_lot::Mutex It's faster and we eliminate bunch of unwrap() calls.	2025-11-20 10:42:02 +02:00
Pere Diaz Bou	430bd457e6	core/mvcc: fix tests with delete	2025-11-19 17:18:44 +01:00
Pere Diaz Bou	bf1afb56cf	core/mvcc: test with delete after checkpoint	2025-11-19 16:56:32 +01:00
Pere Diaz Bou	b4c11705f3	core/mvcc: few suggestions from pr	2025-11-19 16:44:24 +01:00
Jussi Saurio	9a3cc2ac40	Merge 'Triggers: fix issues with ALTER TABLE' from Jussi Saurio Still in draft, because there's a ton of stupid AI slop ## Fixes Closes #3983 Closes #3984 - Disallow DROP COLUMN on columns referenced in triggers - Propagate RENAME COLUMN to trigger SQL definitions ## Drop Column details DROP COLUMN is not allowed when the column is mentioned in a trigger on the table the column is dropped from, eg: ``` turso> CREATE TABLE t(x,y); turso> CREATE TRIGGER foo BEFORE INSERT ON t BEGIN INSERT INTO t VALUES (NEW.x); END; turso> ALTER TABLE t DROP COLUMN x; × Parse error: cannot drop column "x": it is referenced in trigger foo ``` However, it is allowed if the trigger is on another table: ``` turso> CREATE TABLE t(x,y); turso> CREATE TABLE u(x,y); turso> CREATE TRIGGER bar BEFORE INSERT ON t BEGIN INSERT INTO u(y) VALUES (NEW.x); END; turso> ALTER TABLE u DROP COLUMN y; turso> INSERT INTO t VALUES (1,1); × Parse error: table u has no column named y ``` ## AI Disclosure Nearly all of the code here is vibecoded. I first asked Cursor Composer to create an initial implementation. Then, I asked it to try to discover edge cases using the `turso` and `sqlite3` CLIs, and write tests+fixes for the edge cases found. The code is a bit slop and there is a LOT of it because the AST traversal to rewrite column references is all mostly from scratch, but this isn't a particularly performance-critical use case and it should solve most of the issues with RENAME and DROP COLUMN. Closes #3986	2025-11-19 15:00:07 +02:00
Jussi Saurio	32063334f9	fix operator precedence bug	2025-11-19 14:29:33 +02:00
Jussi Saurio	fddcea788b	refactor	2025-11-19 14:29:33 +02:00
Jussi Saurio	5d9a0b15f8	Handle qualified column references in triggers wrt ALTER TABLE	2025-11-19 14:29:33 +02:00
Jussi Saurio	dbdf60a628	extract common functionality	2025-11-19 14:29:33 +02:00
Jussi Saurio	745cdc3aa2	Align trigger sql rewrite behavior with sqlite SQLite doesn't rewrite INSERT lists or WHEN clause, it instead lets the trigger go "stale" and will cause runtime errors. This may not be great behavior, but it's compatible...	2025-11-19 14:29:33 +02:00
Jussi Saurio	5b1c69a9d0	fix ai slop with more ai slop	2025-11-19 14:29:33 +02:00
Jussi Saurio	a0a1bd6637	Triggers: fix issues with ALTER TABLE - Disallow DROP COLUMN on columns referenced in triggers - Propagate RENAME COLUMN to trigger SQL definitions DROP COLUMN is not allowed when the column is mentioned in a trigger on the table the column is dropped from, eg: ``` turso> CREATE TABLE t(x,y); turso> CREATE TRIGGER foo BEFORE INSERT ON t BEGIN INSERT INTO t VALUES (NEW.x); END; turso> ALTER TABLE t DROP COLUMN x; × Parse error: cannot drop column "x": it is referenced in trigger foo ``` However, it is allowed if the trigger is on another table: ``` turso> CREATE TABLE t(x,y); turso> CREATE TABLE u(x,y); turso> CREATE TRIGGER bar BEFORE INSERT ON t BEGIN INSERT INTO u(y) VALUES (NEW.x); END; turso> ALTER TABLE u DROP COLUMN y; turso> INSERT INTO t VALUES (1,1); × Parse error: table u has no column named y ``` Nearly all of the code here is vibecoded. I first asked Cursor Composer to create an initial implementation. Then, I asked it to try to discover edge cases using the `turso` and `sqlite3` CLIs, and write tests+fixes for the edge cases found. The code is a bit slop, but this isn't a particularly performance-critical use case and it should solve most of the issues with RENAME and DROP COLUMN.	2025-11-19 14:29:33 +02:00
Jussi Saurio	fb31fd56ba	Merge 'Simulator: refactor and simplify `InteractionPlan`' from Pedro Muniz Depends on #3775 - to remove noise from this PR. ## Motivation In my continued efforts in making the simulator more accessible and simpler to work with, I have over time simplified and optimized some parts of the codebase like query generation and decision making so that more people from the community can contribute and enhance the simulator. This PR is one more step in that direction. Before this PR, our `InteractionPlan` stored `Vec<Interactions>`. `Interactions` are a higher level collection that will generate a list of `Interaction` (yes I know the naming can be slightly confusing sometimes. Maybe we can change it later as well. Especially because `Interactions` are mainly just `Property`). However, this architecture imposed a problem when MVCC enters the picture. MVCC requires us to make sure that DDL statements are executed serially. To avoid adding even more complexity to plan generation, I opted on previous PRs to check before emitting an `Interaction` for execution, if the interaction is a DDL statement, and if it is, I emit a `Commit` for each connection still in a transaction. This worked slightly fine, but as we do not store the actual execution of interactions in the interaction plan, only the higher level `Interactions`, this meant that I had to do some workarounds to modify the `Interactions` inside the plan to persist the `Commit` I generated on demand. ## Problem However, I was stupid and overlooked the fact that for certain properties that allow queries to be generated in the middle (referenced as extensional queries in the code), we cannot specify the connection that should execute that query, meaning if a DDL statement occurred there, the simulator could emit the query but could not save it properly in the plan to reproduce in shrinking. So to correct and make interaction generation/emission less brittle, I refactored the `InteractionPlan` so that it stores `Vec<Interaction>` instead. ## Implications - `Interaction` is not currently serializable using `Serde` due to the fact that it stores a function in `Assertion`. This means that we cannot serialize the plan into a `plan.json`. Which to me is honestly fine, as the only things that used `plan.json` was `--load` and `--watch` options. Which are options almost nobody really used. - For load, instead of generating the whole plan it just read the plan from disk. The workaround for that right now is just load the `cli_opts` that were last run for that particular seed and use those exact options to run the simulation. - For watch, currently there is not workaround but, @alpaylan told me has some plans to make assertions serializable by embedding a custom language into the `plan.sql` file, meaning we will probably not need a json file at all to store the interaction plan. And this embedded language will make it much easier to bring back a more proper watch mode. - The current shrinking algorithms all have some notion of properties and removal of properties, but `Interaction` do not have this concept. So I added some metadata to interactions and a origin ID to each `Interaction` so that we can search through the list of interactions using binary search to get all of the interactions that are part of the same `Property`. To support this, I added an `InteractionBuilder` and some utilities to iterate and remove properties in the `InteractionPlan` ## Conclusion Overall, this code simplifies emission of interactions and ensures the `InteractionPlan` always stores the actual interactions that get executed. This also decouples more query generation logic from query emission logic. Closes #3774	2025-11-19 11:10:51 +02:00
Pekka Enberg	687d9faf37	Turso 0.4.0-pre.2	2025-11-19 09:40:08 +02:00
Jussi Saurio	92f47dffb0	Merge 'Trigger support' from Jussi Saurio ## Trigger Support This PR adds support for triggers: - `CREATE TRIGGER` - `DROP TRIGGER` Supported - `BEFORE/AFTER INSERT` - `BEFORE/AFTER DELETE` - `BEFORE/AFTER UPDATE [OF <col1,col2,col3>]` Not supported: - `INSTEAD OF` - `TEMPORARY` ### Implementation details - Triggers are executed within a new `Insn::Program` instruction. The spec of the insn differs a bit from SQlite: we store a `Statement` inside that instruction that we can `reset()` for every invocation. - Like Sqlite, trigger programs take `NEW` and `OLD` rows as program parameters. Whenever there are triggers that would fire as the result of a DML statement: - `DELETE` writes the rows being deleted into a `RowSet` first. - `UPDATE` and `INSERT` write the rows being updated into an ephemeral table first. ### Other shit Also added `EXPLAIN` support - the bytecode plans for trigger subprograms are appended after the main program. ### AI disclosure Used Cursor quite a bit for generating boilerplate code for this - you can blame all the bad code on the AI of course 🤡 ### Follow-ups: 1. ALTER TABLE ops need to rewrite the sql in the CREATE TRIGGER statement e.g. if a column is renamed. Columns cannot be dropped if referenced in triggers. 2. Fix weird rowid -1 fallback: https://github.com/tursodatabase/turso/pull/3979#issuecomment-3547999449 Closes #3979	2025-11-19 08:42:41 +02:00
Pere Diaz Bou	ca30756dfd	core/mvcc/cursor: implement `prev` and `last`	2025-11-18 19:51:27 +01:00
Pere Diaz Bou	b38e69b515	core/mvcc: add get_row_id_for_table_in_direction(forward/backwards)	2025-11-18 19:51:27 +01:00
Pere Diaz Bou	b19762a812	core/mvcc/cursor: get_new_position_from_mvcc_and_btree backwards and last fix	2025-11-18 19:51:27 +01:00
Pere Diaz Bou	73d9f0016c	core/mvcc: test order by desc with mvcc	2025-11-18 19:51:27 +01:00
Pere Diaz Bou	72bf195f4b	Merge 'core/mvcc/cursor: rowid don't seek first rowid' from Pere Diaz Bou rowid should only try to use the current's position. So if we are not pointing to a `Loaded` row, then it should return None <!-- CURSOR_SUMMARY --> --- > [!NOTE] > Change `rowid()` to return `None` unless cursor is on a `Loaded` row, removing the implicit seek from `BeforeFirst`. > > - Core MVCC Cursor (`core/mvcc/cursor.rs`): > - Adjust `rowid()` behavior: remove implicit first-row seek when `BeforeFirst`; return `None` unless position is `Loaded`. > > <sup>Written by [Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit `8848775a71`. This will update automatically on new commits. Configure [here](https://cursor.com/dashboard?tab=bugbot).</sup> <!-- /CURSOR_SUMMARY --> Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #3977	2025-11-18 19:51:19 +01:00
Jussi Saurio	ad753281b6	Remove unneeded too_many_arguments annotation	2025-11-18 18:41:45 +02:00
Jussi Saurio	129ee8c82b	Remove more AI-generated unnecessary code	2025-11-18 17:24:10 +02:00
Jussi Saurio	2cbc83a01c	triggers: add ParamMap abstraction to reduce code noise a bit	2025-11-18 17:08:22 +02:00
Jussi Saurio	11528cff12	Remove weird AI-innovated negative index hack	2025-11-18 16:56:27 +02:00
Jussi Saurio	2674145937	Avoid allocation when no triggers exist	2025-11-18 15:40:06 +02:00
Jussi Saurio	d33c294380	remove unhelpful comment	2025-11-18 15:39:53 +02:00
Jussi Saurio	5c1ebbd011	Use VecDeque for trigger storage for similar reasons as indexes do	2025-11-18 15:19:01 +02:00
Jussi Saurio	e1dee4a072	triggers: add a lot of different kinds of tests	2025-11-18 15:19:01 +02:00
Jussi Saurio	9aa09d5ccf	Add EXPLAIN support for trigger subprograms They get printed after the parent program.	2025-11-18 15:19:01 +02:00
Jussi Saurio	423a1444d1	Don't crash if table cursor is already opened	2025-11-18 15:19:01 +02:00
Jussi Saurio	7f536506c3	Clear deferred_seeks for cursor when it is closed Sometimes the deferred seek never happens, so we don't want it to dangle if the same cursor is reused for another seek	2025-11-18 15:19:01 +02:00
Jussi Saurio	d398f12471	triggers: subprograms shouldnt commit or use the transaction opcode	2025-11-18 15:19:01 +02:00
Jussi Saurio	be6f8ab8b3	state.end_statement() should not be called separately in cases where abort() already does it	2025-11-18 15:19:01 +02:00
Jussi Saurio	7a12e184a8	Only reset FK violation counter if stmt was rolled back In the case of trigger subprograms the statement didn't roll back, since the parent program will roll it back.	2025-11-18 15:19:01 +02:00
Jussi Saurio	770c6eef9f	triggers: subprograms dont use transactions	2025-11-18 15:19:01 +02:00
Jussi Saurio	70267f8710	triggers: add translation logic for INSERT triggers	2025-11-18 15:19:01 +02:00
Jussi Saurio	e28301dc2e	triggers: add translation logic for UPDATE triggers	2025-11-18 15:19:01 +02:00
Jussi Saurio	516dae5b6a	triggers: add translation logic for DELETE triggers	2025-11-18 15:19:01 +02:00
Jussi Saurio	5b037b0f75	resolve labels for RowSetRead insn	2025-11-18 15:19:01 +02:00
Jussi Saurio	7d1543fcc5	triggers: take triggers into account in optimizer decision - optimize the select plan used for the RowSet in DELETE - require ephemeral table when UPDATE involves triggers	2025-11-18 15:19:01 +02:00
Jussi Saurio	78ce3c8658	triggers: add capability for DeletePlan to write the write set into a RowSet first This is needed for safe DELETE when there are DELETE triggers on the affected table.	2025-11-18 15:19:01 +02:00
Jussi Saurio	e60e37da7d	triggers: add execution plumbing to translation and vdbe layers	2025-11-18 15:19:01 +02:00
Jussi Saurio	3d00686f48	triggers: translation functions for DDL	2025-11-18 12:18:07 +02:00
Jussi Saurio	d4b487eebc	triggers: add in-memory schema entries	2025-11-18 12:14:27 +02:00
Preston Thorpe	e61234d522	Merge 'translate/insert: Implement INSERT OR REPLACE' from Preston Thorpe This PR implements support for `INSERT OR REPLACE INTO t`. For `OR IGNORE`, we currently rewrite this internally to an `ON CONFLICT DO NOTHING`, and I was hopeful we could do this with OR REPLACE, however it seems SQLite actually deletes the row and then proceeds to insert, so we could not simply rewrite this to an `ON CONFLICT DO UPDATE SET col=excluded.col`, as this would result in differing rowid's when compared to SQLite. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #3972	2025-11-17 18:50:15 -05:00
PThorpe92	56f35ad4cd	cargo fmt	2025-11-17 12:22:55 -05:00
PThorpe92	c3185d0b8c	Properly handle foreign keys for INSERT OR REPLACE	2025-11-17 12:19:33 -05:00
Pere Diaz Bou	8848775a71	core/mvcc/cursor: rowid don't seek first rowid rowid should only try to use the current's position. So if we are not pointing to a `Loaded` row, then it should return None	2025-11-17 16:17:52 +01:00

1 2 3 4 5 ...

10954 Commits