turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-19 09:34:18 +01:00

Author	SHA1	Message	Date
Jussi Saurio	cc2e14b11c	Read page 1 from pager always, no separate db_header	2025-06-24 14:41:49 -03:00
Nils Koch	2827b86917	chore: fix clippy warnings	2025-06-23 19:52:13 +01:00
pedrocarlo	74beac5ea8	ephemeral table for update when rowid is being update	2025-06-20 16:28:10 -03:00
Jussi Saurio	c69047106c	Merge 'Implement RowData opcode' from meteorgan The `RowData` opcode is required to implement #1575. I haven't found a ideal way to test this PR independently, but I verified its functionality while working on #1575(to be committed soon), and it performs effectively. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1756	2025-06-20 21:58:47 +03:00
meteorgan	5c7d1423e7	Support indent for Goto opcode when executing explain	2025-06-17 23:55:39 +08:00
meteorgan	4f742a3a0f	Implement RowData opcode	2025-06-16 22:07:35 +08:00
Pekka Enberg	882c5ca168	Merge 'Simple integrity check on btree' from Pere Diaz Bou This PR adds support for the instruction `IntegrityCk` which performs an integrity check on the contents of a single table. Next PR I will try to implement the rest of the integrity check where we would check indexes containt correct amount of data and some more. <img width="1151" alt="image" src="https://github.com/user- attachments/assets/29d54148-55ba-480f-b972-e38587f0a483" /> Closes #1719	2025-06-16 13:46:26 +03:00
Pekka Enberg	90c1e3fc06	Switch Connection to use Arc instead of Rc Connection needs to be Arc so that bindings can wrap it with `Mutex` for multi-threading.	2025-06-16 10:43:19 +03:00
pedrocarlo	8dbf09bb42	betters instrumentation for btree operations	2025-06-11 23:34:32 -03:00
Pere Diaz Bou	9383ba207d	introduce integrity_check pragma	2025-06-11 11:14:29 +02:00
Jussi Saurio	da2437408e	get_new_rowid(): fix off by one - rowids start at 1	2025-06-10 14:16:26 +03:00
Jussi Saurio	2bac140d73	Remove SeekOp::EQ and encode eq_only in LE&GE - needed for iteration direction aware equality seeks	2025-06-10 14:16:26 +03:00
Pere Diaz Bou	77b6896eae	implement lazy record and rowid in cursor This also comments save_context for now	2025-06-10 14:16:26 +03:00
Pekka Enberg	c6ef19396d	Merge 'Add support for pragma table-valued functions' from Piotr Rżysko This PR adds support for table-valued functions for PRAGMAs (see the [PRAGMA functions section](https://www.sqlite.org/pragma.html)). Additionally, it introduces built-in table-valued functions. I considered using extensions for this, but there are several reasons in favor of a dedicated mechanism: * It simplifies the use of internal functions, structs, etc. For example, when implementing `json_each` and `json_tree`, direct access to internals was necessary: https://github.com/tursodatabase/limbo/pull/1088 * It avoids FFI overhead. [Benchmarks](https://github.com/piotrrzysko/li mbo/blob/pragma_vtabs_bench/core/benches/pragma_benchmarks.rs) on my hardware show that `pragma_table_info()` implemented as an extension is 2.5× slower than the built-in version. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1642	2025-06-04 09:08:10 +03:00
Piotr Rzysko	6300deb77f	Move VTabOpaqueCursor to vtab module	2025-06-01 07:45:57 +02:00
pedrocarlo	0757109676	instrument trace_insn	2025-05-30 11:33:22 -03:00
Pere Diaz Bou	d4f1b8e068	update i64::MAX comment	2025-05-30 14:02:05 +02:00
Pere Diaz Bou	da4190a23e	Convert u64 rowid to i64 Rowids can be negative, therefore let's swap to i64	2025-05-30 13:07:31 +02:00
Jussi Saurio	5632a6046e	Merge 'Fix: allow DeferredSeek on more than one cursor per program' from Jussi Saurio Found while fuzzing nested subqueries. Since subqueries result in nested plans, it quickly revealed that there can be multiple `DeferredSeek` instructions issued for different cursors, but our `ProgramState` only supported one at a time. Closes #1610	2025-05-30 09:39:23 +03:00
Jussi Saurio	69133b3b2e	Fix: allow DeferredSeek on more than one cursor per program	2025-05-29 16:05:47 +03:00
Jussi Saurio	cc405dea7e	Use new TableReferences struct everywhere	2025-05-29 11:44:56 +03:00
Jussi Saurio	77ce4780d9	Fix ProgramBuilder::cursor_ref not having unique keys Currently we have this: program.alloc_cursor_id(Option<String>, CursorType)` where the String is the table's name or alias ('users' or 'u' in the query). This is problematic because this can happen: `SELECT * FROM t WHERE EXISTS (SELECT * FROM t)` There are two cursors, both with identifier 't'. This causes a bug where the program will use the same cursor for both the main query and the subquery, since they are keyed by 't'. Instead introduce `CursorKey`, which is a combination of: 1. `TableInternalId`, and 2. index name (Option<String> -- in case of index cursors. This should provide key uniqueness for cursors: `SELECT * FROM t WHERE EXISTS (SELECT * FROM t)` here the first 't' will have a different `TableInternalId` than the second `t`, so there is no clash.	2025-05-29 00:59:24 +03:00
Jussi Saurio	360b1fcdae	Fix bug: op_vopen should replace cursor slot, not add new one	2025-05-27 10:52:36 +03:00
Jussi Saurio	3ba9f2ab97	Small cleanups to pager/wal/vdbe - mostly naming - Instead of using a confusing CheckpointStatus for many different things, introduce the following statuses: * PagerCacheflushStatus - cacheflush can result in either: - the WAL being written to disk and fsynced - but also a checkpoint to the main BD file, and fsyncing the main DB file Reflect this in the type. * WalFsyncStatus - previously CheckpointStatus was also used for this, even though fsyncing the WAL doesn't checkpoint. * CheckpointStatus/CheckpointResult is now used only for actual checkpointing. - Rename HaltState to CommitState (program.halt_state -> program.commit_state) - Make WAL a non-optional property in Pager * This gets rid of a lot of if let Some(...) boilerplate * For ephemeral indexes, provide a DummyWAL implementation that does nothing. - Rename program.halt() to program.commit_txn() - Add some documentation comments to structs and functions	2025-05-26 10:37:34 +03:00
Pekka Enberg	e3f71259d8	Rename OwnedValue -> Value We have not had enough merge conflicts for a while so let's do a tree-wide rename.	2025-05-15 09:59:46 +03:00
pedrocarlo	3526a206e4	support Unique properly by creating a vec of auto indices	2025-05-14 11:34:39 -03:00
Piotr Rzysko	977b6b331a	Fix memory leak caused by unclosed virtual table cursors The following code reproduces the leak, with memory usage increasing over time: ``` #[tokio::main] async fn main() { let db = Builder::new_local(":memory:").build().await.unwrap(); let conn = db.connect().unwrap(); conn.execute("SELECT load_extension('./target/debug/liblimbo_series');", ()) .await .unwrap(); loop { conn.execute("SELECT * FROM generate_series(1,10,2);", ()) .await .unwrap(); } } ``` After switching to the system allocator, the leak becomes detectable with Valgrind: ``` 32,000 bytes in 1,000 blocks are definitely lost in loss record 24 of 24 at 0x538580F: malloc (vg_replace_malloc.c:446) by 0x62E15FA: alloc::alloc::alloc (alloc.rs:99) by 0x62E172C: alloc::alloc::Global::alloc_impl (alloc.rs:192) by 0x62E1530: allocate (alloc.rs:254) by 0x62E1530: alloc::alloc::exchange_malloc (alloc.rs:349) by 0x62E0271: new<limbo_series::GenerateSeriesCursor> (boxed.rs:257) by 0x62E0271: open_GenerateSeriesVTab (lib.rs:19) by 0x425D8FA: limbo_core::VirtualTable::open (lib.rs:732) by 0x4285DDA: limbo_core::vdbe::execute::op_vopen (execute.rs:890) by 0x42351E8: limbo_core::vdbe::Program::step (mod.rs:396) by 0x425C638: limbo_core::Statement::step (lib.rs:610) by 0x40DB238: limbo::Statement::execute::{{closure}} (lib.rs:181) by 0x40D9EAF: limbo::Connection::execute::{{closure}} (lib.rs:109) by 0x40D54A1: example::main::{{closure}} (example.rs:26) ``` Interestingly, when using mimalloc, neither Valgrind nor mimalloc’s internal statistics report the leak.	2025-05-05 21:26:23 +02:00
Pere Diaz Bou	c808863256	test update with index	2025-05-01 11:44:23 +03:00
Pere Diaz Bou	a30241ca91	Add state machine for op_idx_delete + DeleteState simplification DeleteState had a bit too many unnecessary states so I removed them. Usually we care about having a different state when I/O is triggered requiring a state to be stored for later. Furthermore, there was a bug with op_idx_delete where if balance is triggered, op_idx_delete wouldn't be re-entrant. So a state machine was added to prevent that from happening.	2025-04-29 14:58:20 +03:00
Jussi Saurio	c3441f9685	vdbe: move comments if instructions were moved around in emit_constant_insns()	2025-04-24 11:05:21 +03:00
Jussi Saurio	0f5c791784	vdbe: refactor label resolution to account for insn offsets changing	2025-04-24 11:05:21 +03:00
Jussi Saurio	9dadc58194	Add support for Insn::Once	2025-04-14 11:23:37 +03:00
Jussi Saurio	d286a56e15	refactor: fold Async/Await insns into a single instruction	2025-04-14 09:40:20 +03:00
Levy A.	d210ee1497	cargo fmt	2025-04-12 18:55:48 -03:00
Levy A.	5c0b112125	fix: return null when parameter is unbound	2025-04-12 17:43:04 -03:00
Pekka Enberg	31f0d174d7	core/vdbe: Move `exec_*()` funtions to execute.rs	2025-04-10 09:42:03 +03:00
Pekka Enberg	5906d7971a	core/vdbe: Clean up imports	2025-04-10 09:25:15 +03:00
Jussi Saurio	3e42a62cd0	Add SeekLE/SeekLT operations to VDBE	2025-04-09 10:14:29 +03:00
Jussi Saurio	ed929d2a0a	Merge 'Properly handle insertion of indexed columns' from Preston Thorpe ```console limbo> create index p on products(price); limbo> explain insert into products (name,price) values ('huh',32); addr opcode p1 p2 p3 p4 p5 comment ---- ----------------- ---- ---- ---- ------------- -- ------- 0 Init 0 17 0 0 Start at 17 1 OpenWriteAsync 0 273 0 0 2 OpenWriteAwait 0 0 0 0 3 String8 0 3 0 huh 0 r[3]='huh' 4 Integer 32 4 0 0 r[4]=32 5 OpenWriteAsync 1 297 0 0 6 OpenWriteAwait 0 0 0 0 7 NewRowId 0 1 0 0 8 MakeRecord 2 3 5 0 r[5]=mkrec(r[2..4]) 9 InsertAsync 0 5 1 0 10 InsertAwait 0 0 0 0 11 Copy 4 6 0 0 r[6]=r[4] 12 Copy 1 7 0 0 r[7]=r[1] 13 MakeRecord 6 2 8 0 r[8]=mkrec(r[6..7]) 14 IdxInsertAsync 1 8 6 0 key=r[8] 15 IdxInsertAwait 1 0 0 0 16 Halt 0 0 0 0 17 Transaction 0 1 0 0 write=true 18 Null 0 2 0 0 r[2]=NULL 19 Goto 0 1 0 0 ``` When an insert happens, we need to be sure to insert into the index btree as well. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1265	2025-04-09 10:13:59 +03:00
PThorpe92	4b9b6c969b	Parse schema rows after extensions are loaded	2025-04-08 20:10:47 -04:00
PThorpe92	1f29307fe8	Support proper index handling when doing insertions	2025-04-08 08:55:14 -04:00
PThorpe92	a2b9d8d371	Use Correct flag on idx insert to prevent seeking	2025-04-05 11:19:09 -04:00
Pekka Enberg	c592e27dca	core/vdbe: Move explain() to last method in Program ...it's the least interesting one and we should have `step()` at the top.	2025-04-02 18:55:35 +03:00
Pere Diaz Bou	7e4b57f2e2	VDBE with direct function dispatch This PR is unapologetically stolen from @vmg's implementation in Vitess implemented here https://github.com/vitessio/vitess/pull/12369. If you want a more in depth explanation of how this works you can read the [blog post he carefully wrote](https://planetscale.com/blog/faster-interpreters-in-go-catching-up-with-cpp). In limbo we have a huge problem with [register spilling](https://en.wikipedia.org/wiki/Register_allocation), this can be easily observed with the prolog of `Program::step` before: ```llvm start: %e.i.i304.i = alloca [0 x i8], align 8 %formatter.i305.i = alloca [64 x i8], align 8 %buf.i306.i = alloca [24 x i8], align 8 %formatter.i259.i = alloca [64 x i8], align 8 ..................... these are repeated for hundreds of lines ..................... %formatter.i52.i = alloca [64 x i8], align 8 %buf.i53.i = alloca [24 x i8], align 8 %formatter.i.i = alloca [64 x i8], align 8 %buf.i.i = alloca [24 x i8], align 8 %_87.i = alloca [48 x i8], align 8 %_82.i = alloca [24 x i8], align 8 %_73.i = alloca [24 x i8], align 8 %_66.i8446 = alloca [24 x i8], align 8 %_57.i = alloca [24 x i8], align 8 %_48.i = alloca [24 x i8], align 8 ``` After these changes we completely remove the need of register spilling (yes that is the complete prolog): ```llvm start: %self1 = alloca [80 x i8], align 8 %pager = alloca [8 x i8], align 8 %mv_store = alloca [8 x i8], align 8 store ptr %0, ptr %mv_store, align 8 store ptr %1, ptr %pager, align 8 %2 = getelementptr inbounds i8, ptr %state, i64 580 %3 = getelementptr inbounds i8, ptr %state, i64 576 %4 = getelementptr inbounds i8, ptr %self, i64 16 %5 = getelementptr inbounds i8, ptr %self, i64 8 %6 = getelementptr inbounds i8, ptr %self1, i64 8 br label %bb1, !dbg !286780 ``` When it comes to branch prediction, we don't really fix a lot because thankfully rust already compiles `match` expressions to a jump table: ```llvm %insn = getelementptr inbounds [0 x %"vdbe::insn::Insn"], ptr %self657, i64 0, i64 %index, !dbg !249527 %332 = load i8, ptr %insn, align 8, !dbg !249528, !range !38291, !noundef !14 switch i8 %332, label %default.unreachable26674 [ i8 0, label %bb111 i8 1, label %bb101 i8 2, label %bb100 i8 3, label %bb110 ... i8 104, label %bb5 i8 105, label %bb16 i8 106, label %bb14 ], !dbg !249530 ``` Some results ---- ``` function dispatch: Execute `SELECT 1`/limbo_execute_select_1 time: [29.498 ns 29.548 ns 29.601 ns] change: [-3.6125% -3.3592% -3.0804%] (p = 0.00 < 0.05) main: Execute `SELECT 1`/limbo_execute_select_1 time: [33.789 ns 33.832 ns 33.878 ns] ```	2025-04-02 14:55:37 +02:00
Pekka Enberg	6199c3994a	Merge 'Create plan for Update queries' from Preston Thorpe closes #1186, or at least works towards it by implementing an actual Plan for update queries instead of translating everything inline. This allows for actually using index's, rewriting const expressions, pushing predicates instead of hardcoding a full scan in the translation. ### TODOs: 1. `RETURNING` clause/result columns 2. `OFFSET` clauses 3. on conflict ### LIMIT: By supporting `LIMIT` directly in update queries, we'll have to put the tests outside of the compatibility tests, maybe in the CLI tests. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1189	2025-04-01 17:33:32 +03:00
Pekka Enberg	169864456e	Merge 'Fix IdxCmp insn comparisons' from Jussi Saurio We never hit bugs due to these because of 1. not having multi column indexes in our TCL test databases, 2. otherwise not really having Rust tests involving indexes, and 3. `IdxLt` and `IdxLe` not actually being used anywhere yet Also as @PThorpe92 pointed out there are some nuances to the comparison logic we may need to eventually implement regarding comparisons with uneven number of keys: https://github.com/sqlite/sqlite/blob/master/src/vdbeaux.c#L4719 Reviewed-by: Preston Thorpe (@PThorpe92) Closes #1215	2025-03-31 10:40:01 +03:00
Jussi Saurio	42e25d23dd	Fix IdxCmp insn comparisons	2025-03-30 23:01:41 +03:00
PThorpe92	516e443a2b	Fix use index cursor id in emitter and prevent reinsert pk on update	2025-03-30 12:15:25 -04:00
Ihor Andrianov	a234aa3647	remove vec cloning from json agg functions	2025-03-30 19:10:15 +03:00
Ihor Andrianov	40bb867d54	clippy	2025-03-30 19:01:16 +03:00

... 2 3 4 5 6 ...

675 Commits