turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-26 20:44:23 +01:00

Author	SHA1	Message	Date
Jussi Saurio	592ba41137	Add assertion forbidding duplicate cursor keys	2025-05-29 01:04:45 +03:00
Jussi Saurio	77ce4780d9	Fix ProgramBuilder::cursor_ref not having unique keys Currently we have this: program.alloc_cursor_id(Option<String>, CursorType)` where the String is the table's name or alias ('users' or 'u' in the query). This is problematic because this can happen: `SELECT * FROM t WHERE EXISTS (SELECT * FROM t)` There are two cursors, both with identifier 't'. This causes a bug where the program will use the same cursor for both the main query and the subquery, since they are keyed by 't'. Instead introduce `CursorKey`, which is a combination of: 1. `TableInternalId`, and 2. index name (Option<String> -- in case of index cursors. This should provide key uniqueness for cursors: `SELECT * FROM t WHERE EXISTS (SELECT * FROM t)` here the first 't' will have a different `TableInternalId` than the second `t`, so there is no clash.	2025-05-29 00:59:24 +03:00
Jussi Saurio	85316d8419	Merge 'clear page cache on transaction failure' from Pere Diaz Bou This is the first step towards rollback, since we still don't spill pages with WAL, we can simply invalidate page cache in case of failure. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1599	2025-05-28 23:14:44 +03:00
krishvishal	fb1d53b0ec	Fix test. off by one.	2025-05-29 00:22:38 +05:30
krishvishal	5b57efd894	A couple more tests to test this case.	2025-05-29 00:10:59 +05:30
krishvishal	e3bc78f7e4	Fix unreachable panic when calling `serialize` on Value::Integer(0) by handling `SerialTypeKind` `ConstInt0` and `ConstInt1` in `Record::serialize()` - Changed `test_serialize_integers` to accomodate this change.	2025-05-29 00:08:37 +05:30
Pere Diaz Bou	28bd24b7d4	clear page cache on transaction failure This is the first step towards rollback, since we still don't spill pages with WAL, we can simply invalidate page cache in case of failure.	2025-05-28 15:54:28 +02:00
Jussi Saurio	dad1e6293b	Btree: fix cursor record state not being updated in insert_into_page() overwrite_cell() requires that the cursor state is pointing to a valid record, but this was not currently set properly.	2025-05-28 16:54:00 +03:00
Jussi Saurio	7ab243dc4e	Merge 'Make WhereTerm::consumed a Cell<bool>' from Jussi Saurio Currently in the main translation logic after planning and optimization, we don't _really_ need to pass a `&mut Vec<WhereTerm>` around anymore, except for the fact that virtual table constraint resolution is done ad- hoc in `init_loop()`. Even there, the only thing we mutate is `WhereTerm::consumed` which is a boolean indicating that the term has been "used up" by the optimizer and shouldn't be evaluated as a normal where clause condition anymore. In the upcoming branch for WHERE clause subqueries, I want to store immutable references to WHERE clause expressions in `Resolver`, but this is unfortunately not possible if we still use the aforementioned mutable references. Hence, we can temporarily make `WhereTerm::consumed` a `Cell<bool>` which allows us to pass an immutable reference to `init_loop()`, and the `Cell` can be removed once the virtual table constraint resolution is moved to an earlier part of the query processing pipeline. Closes #1597	2025-05-28 11:14:40 +03:00
Jussi Saurio	73e806ad84	Make WhereTerm::consumed a Cell<bool> Currently in the main translation logic after planning and optimization, we don't _really_ need to pass a &mut Vec<WhereTerm> around anymore, except for the fact that virtual table constraint resolution is done ad-hoc in `init_loop()`. Even there, the only thing we mutate is `WhereTerm::consumed` which is a boolean indicating that the term has been "used up" by the optimizer and shouldn't be evaluated as a normal where clause condition anymore. In the upcoming branch for WHERE clause subqueries, I want to store immutable references to WHERE clause expressions in `Resolver`, but this is unfortunately not possible if we still use the aforementioned mutable references. Hence, we can temporarily make `WhereTerm::consumed` a `Cell<bool>` which allows us to pass an immutable reference to `init_loop()`, and the `Cell` can be removed once the virtual table constraint resolution is moved to an earlier part of the query processing pipeline.	2025-05-28 11:02:39 +03:00
Jussi Saurio	51605ad2a4	Use lifetimes in walk_expr() to guarantee that child expr has same lifetime as parent expr	2025-05-28 10:56:30 +03:00
Jussi Saurio	a9ae1af75c	Fix: init_limit() in wrong place for Delete	2025-05-27 21:26:31 +03:00
Jussi Saurio	3c587b91b5	Add comment on init_limit()	2025-05-27 21:19:28 +03:00
Jussi Saurio	4e9d9a2470	Fix LIMIT handling Currently we have some usages of LIMIT where the actual limit counter is initialized next to the DecrJumpZero instruction, and then `program.mark_last_insn_constant()` is used to hoist the counter initialization to the beginning of the program. This is very fragile, and already FROM clause subquery handling works around this with a hack (removed in this PR), and (upcoming) WHERE clause subqueries would also run into problems because of this, because the LIMIT might need to be initialized once for every iteration of the subquery. This PR removes those usages for LIMIT, and LIMIT processing is now more intuitive: - limit counter is now initialized at the start of the query processing - a function init_limit() is extracted to do this for select/update/delete	2025-05-27 21:12:22 +03:00
Jussi Saurio	8abe5efe99	Merge 'Add Schema reference to Resolver - needed for adhoc subquery planning' from Jussi Saurio enabler for WHERE clause subquery ad-hoc planning&translation Closes #1589	2025-05-27 20:19:45 +03:00
Jussi Saurio	ad0f2bb399	Merge 'Small VDBE insn tweaks' from Jussi Saurio 1. allow calling op_null with Insn::BeginSubrtn - BeginSubrtn is identical to Null, but named differently so that its use in context is clearer 2. Insn::Return: add possibility to fallthrough on non-integer values as per sqlite spec Closes #1588	2025-05-27 20:19:31 +03:00
meteorgan	2f82762ca2	add function parse_signed_number	2025-05-28 00:33:41 +08:00
meteorgan	d9d3a5ecbb	Use the SetCookie opcode to implement user_version pragma	2025-05-28 00:31:11 +08:00
Jussi Saurio	d2a287f67f	Add Schema reference to Resolver - needed for adhoc subquery planning	2025-05-27 19:12:47 +03:00
Jussi Saurio	6914d61180	allow calling op_null with Insn::BeginSubrtn	2025-05-27 19:09:15 +03:00
Jussi Saurio	70965f4b28	Insn::Return: add possibility to fallthrough on non-integer values as per sqlite spec	2025-05-27 19:09:10 +03:00
Pekka Enberg	8d7f20b7d2	Merge 'Add libsql_wal_get_frame() API' from Pekka Enberg This pull request implements the `libsql_wal_get_frame()` API. To do that, we also introduce a `wait_for_completion()` API in I/O dispatcher. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1533	2025-05-27 18:17:32 +03:00
Pekka Enberg	59d28eac93	core: Switch Completion "is_completed" to use Cell Suggested by Jussi	2025-05-27 14:05:07 +03:00
Pekka Enberg	3250560eb8	sqlite3: Add libsql_wal_get_frame() API	2025-05-27 13:47:40 +03:00
Pekka Enberg	05df548b10	core/io: Add wait_for_completion() to I/O dispatcher	2025-05-27 13:47:40 +03:00
Jussi Saurio	a88e1c38f3	Merge 'Fix bug: op_vopen should replace cursor slot, not add new one' from Jussi Saurio Found this when reviewing #1528 locally and this was crashing ```sql INSERT INTO t SELECT * FROM generate_series(1,10,1); ``` Reason was that `op_vopen` was not replacing the already allocated cursor slot, but using `.insert()` Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #1583	2025-05-27 12:50:11 +03:00
Pere Diaz Bou	312bb5205a	Merge 'Reset idx delete state after successful finish' from Pere Diaz Bou If we don't reset the state of `IdxDelete`, next `IdxDelete` will start in `Deleting` state which is completely wrong since it should seek from the start. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1584	2025-05-27 11:31:25 +02:00
Pere Diaz Bou	a5a8a52a07	reset-idx-delete-state	2025-05-27 10:47:21 +02:00
Pekka Enberg	eca9a5b703	core/io: Switch to Arc<Completion>	2025-05-27 11:28:49 +03:00
Jussi Saurio	360b1fcdae	Fix bug: op_vopen should replace cursor slot, not add new one	2025-05-27 10:52:36 +03:00
Jussi Saurio	b72b99c973	Merge 'feature: `INSERT INTO <table> SELECT`' from Pedro Muniz Closes #1528 . - Modified `translate_select` so that the caller can define if the statement is top-level statement or a subquery. - Refactored `translate_insert` to offload the translation of multi-row VALUES and SELECT statements to `translate_select` - I did not try to change much of `populate_column_registers` as I did not want to break `translate_virtual_table_insert`. Ideally, I would want to unite this remaining logic folding `populate_column_registers` into `populate_columns_multiple_rows` and the `translate_virtual_table_insert` into `translate_insert`. But, I think this may be best suited for a separate PR. ## TODO - ~Tests~ - Done - ~Need to emit a temp table when we are selecting and inserting into the Same Table - https://github.com/sqlite/sqlite/blob/master/src/insert.c#L1369~ - Done - Optimization when table have the exact same schema - open an Issue about it - Virtual Tables do not benefit yet from this feature - open an Issue about it Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1566	2025-05-27 10:50:26 +03:00
Pekka Enberg	21535018aa	core: Don't pass page to begin_read_wal_frame() Make `begin_read_wal_frame()` a bit more generic by not requiring a page to be passed.	2025-05-27 10:02:36 +03:00
Jussi Saurio	3ba9f2ab97	Small cleanups to pager/wal/vdbe - mostly naming - Instead of using a confusing CheckpointStatus for many different things, introduce the following statuses: * PagerCacheflushStatus - cacheflush can result in either: - the WAL being written to disk and fsynced - but also a checkpoint to the main BD file, and fsyncing the main DB file Reflect this in the type. * WalFsyncStatus - previously CheckpointStatus was also used for this, even though fsyncing the WAL doesn't checkpoint. * CheckpointStatus/CheckpointResult is now used only for actual checkpointing. - Rename HaltState to CommitState (program.halt_state -> program.commit_state) - Make WAL a non-optional property in Pager * This gets rid of a lot of if let Some(...) boilerplate * For ephemeral indexes, provide a DummyWAL implementation that does nothing. - Rename program.halt() to program.commit_txn() - Add some documentation comments to structs and functions	2025-05-26 10:37:34 +03:00
pedrocarlo	1410e57112	correct union result_row or yield emission + test	2025-05-26 01:06:26 -03:00
pedrocarlo	ee93316c46	fix num_values detection + emitting correct column for temp_table + tests	2025-05-25 19:15:28 -03:00
pedrocarlo	e3fd1e589e	support using a INSERT SELECT that references the same table in both statements	2025-05-25 19:15:28 -03:00
pedrocarlo	90e3c8483d	tests with compound select	2025-05-25 19:15:28 -03:00
pedrocarlo	72c1f2f582	fix rebase issues and make code compile by cloning query type. Adjust the compound select behavior with insert	2025-05-25 19:13:40 -03:00
pedrocarlo	c8144340a0	adjust proper ordering for value insert	2025-05-25 19:12:30 -03:00
pedrocarlo	810211b3d1	passing incorrect number of values to virtual table insert	2025-05-25 19:12:30 -03:00
pedrocarlo	4bcfc8ca60	create separate function to populate multiple columns in a multi-row VALUES clause or in an INSERT INTO <table> SELECT. Virtual Table insert is broken, need to fix it still	2025-05-25 19:12:30 -03:00
pedrocarlo	bb7da39c72	remove assumption that translate_select is always called from a top-level context + adjust insert to use translate_select when needed	2025-05-25 19:12:30 -03:00
pedrocarlo	fd9e0db5cc	pass the owned ast to `translate_insert` + remove assumption of a list of values in populate_columns_insert	2025-05-25 19:02:17 -03:00
pedrocarlo	15ffdd3e51	modify translate_select to return number of result columns	2025-05-25 19:02:17 -03:00
Jussi Saurio	07fa3a9668	Rename SelectQueryType to QueryDestination	2025-05-25 21:23:04 +03:00
Jussi Saurio	d893a55c55	UNION	2025-05-25 21:23:04 +03:00
Jussi Saurio	7c07c09300	Add stable internal_id property to TableReference Currently our "table id"/"table no"/"table idx" references always use the direct index of the `TableReference` in the plan, e.g. in `SelectPlan::table_references`. For example: ```rust Expr::Column { table: 0, column: 3, .. } ``` refers to the 0'th table in the `table_references` list. This is a fragile approach because it assumes the table_references list is stable for the lifetime of the query processing. This has so far been the case, but there exist certain query transformations, e.g. subquery unnesting, that may fold new table references from a subquery (which has its own table ref list) into the table reference list of the parent. If such a transformation is made, then potentially all of the Expr::Column references to tables will become invalid. Consider this example: ```sql -- Assume tables: users(id, age), orders(user_id, amount) -- Get total amount spent per user on orders over $100 SELECT u.id, sub.total FROM users u JOIN (SELECT user_id, SUM(amount) as total FROM orders o WHERE o.amount > 100 GROUP BY o.user_id) sub WHERE u.id = sub.user_id -- Before subquery unnesting: -- Main query table_references: [users, sub] -- u.id refers to table 0, column 0 -- sub.total refers to table 1, column 1 -- -- Subquery table_references: [orders] -- o.user_id refers to table 0, column 0 -- o.amount refers to table 0, column 1 -- -- After unnesting and folding subquery tables into main query, -- the query might look like this: SELECT u.id, SUM(o.amount) as total FROM users u JOIN orders o ON u.id = o.user_id WHERE o.amount > 100 GROUP BY u.id; -- Main query table_references: [users, orders] -- u.id refers to table index 0 (correct) -- o.amount refers to table index 0 (incorrect, should be 1) -- o.user_id refers to table index 0 (incorrect, should be 1) ``` We could ofc traverse every expression in the subquery and rewrite the table indexes to be correct, but if we instead use stable identifiers for each table reference, then all the column references will continue to be correct. Hence, this PR introduces a `TableInternalId` used in `TableReference` as well as `Expr::Column` and `Expr::Rowid` so that this kind of query transformations can happen with less pain.	2025-05-25 20:26:17 +03:00
Jussi Saurio	b5ac095716	Fix off-by-one error in max_frame after WAL load	2025-05-25 19:34:51 +03:00
Jussi Saurio	f388bc571e	Merge 'xConnect for virtual tables to query core db connection' from Preston Thorpe Re-Opening #1076 because it had bit-rotted to a point of no return. However it has improved. Now with Weak references and no incrementing Rc strong counts. This also includes a better test extension that returns info about the other tables in the schema. ![image](https://github.com/user- attachments/assets/4292dc9c-121e-4ba2-8a51-4533bbcf2afd) (theme doesn't show rows column) Closes #1366	2025-05-25 14:37:38 +03:00
Jussi Saurio	621ae60ab5	Merge 'Reconstruct WAL frame cache when WAL is opened' from Jussi Saurio Fixes #1567 Probably also fixes #1485 Currently we are simply unable to read any WAL frames from disk once a fresh process w/ Limbo is opened, since we never try to read anything from disk unless we already have it in our in-memory frame cache. This commit implements a crude way of reading entire WAL into memory as a single buffer and reconstructing the frame cache. Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #1570	2025-05-25 14:35:47 +03:00

1 2 3 4 5 ...

2735 Commits