turso

mirror of https://github.com/aljazceru/turso.git synced 2026-01-19 08:04:19 +01:00

Author	SHA1	Message	Date
Pekka Enberg	9492a29d47	Merge 'Fix performance regression' from Jussi Saurio Closes #2440 ## Fix 1 Do not start a read transaction when a SELECT is not going to access the database, which means we can avoid checking whether the schema has changed. ## Fix 2 Add a field `accesses_db` to `Program` and `Statement` so we can avoid even checking for `SchemaUpdated` errors when it's not possible to get one. ## Fix 3 Avoid doing any work in `commit_txn` when not in a transaction. This optimization is only enabled when `mv_store.is_none()`, because MVCC has its own logic and this doesn't work with MVCC enabled, and honestly I'm too tired to find out why. Left an inline comment about it, though. ```sql Execute `SELECT 1`/limbo_execute_select_1 time: [21.440 ns 21.513 ns 21.586 ns] change: [-60.766% -60.616% -60.453%] (p = 0.00 < 0.05) Performance has improved. ``` Effect is even more dramatic in CI where the latency is down over 80% Closes #2441	2025-08-05 16:30:18 +03:00
Jussi Saurio	1feb5ba2d3	perf/vdbe: avoid doing work in commit_txn if not in txn	2025-08-05 15:25:28 +03:00
Jussi Saurio	3f633247f7	perf/stmt: avoid checking for SchemaUpdated errors if it's impossible	2025-08-05 15:10:55 +03:00
Jussi Saurio	c498196c7b	fix/perf: fix regression in SELECT 1 benchmark Do not start a read transaction when a SELECT is not going to access the database, which means we can avoid checking whether the schema has changed.	2025-08-05 15:10:55 +03:00
Pere Diaz Bou	474f0d8bbc	core/mvcc: implement exists	2025-08-05 13:34:51 +02:00
Jussi Saurio	a66b56678d	Merge 'Reprepare Statements when Schema changes' from Pedro Muniz Closes #1967 To support this I had to change how we did `epilogue` similarly to how SQLite does it. SQLIte first declares a `beginWriteOperation` when some statement is going to necessitate a Write Transaction. And as we now need to pass the current schema cookie to `epilogue` it was easier to call epilogue only in one location (like we do with prologue), and just have each statement declare their intentions separately. This allows us to not have to pass the Schema around just to do the epilogue. I believe this is something that @jussisaurio would be interested in. ~Also had to disable the MVCC test, as it was extremely buggy for me.~ Just disabled reprepare statements for MVCC Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2214	2025-08-05 00:01:14 +03:00
pedrocarlo	f1df9a909e	state machine for 'rewind'	2025-08-04 12:59:52 -03:00
pedrocarlo	e9c3f0d55b	disable schema reprepare for MVCC	2025-08-04 12:32:34 -03:00
pedrocarlo	0e3e64878c	workaround the fact that to reparse schema we have to avoid falling into a reprepared statement loop	2025-08-04 12:32:34 -03:00
pedrocarlo	266a7e1c66	do not error in `op_transaction` if page 1 was not allocated	2025-08-04 12:32:34 -03:00
pedrocarlo	0779c23bbf	fix merge conflicts	2025-08-04 12:32:34 -03:00
pedrocarlo	d2019e95f3	pass schema to epilogue for schema_version checking + do not Pragma Schema Version in `open_with_flags` to avoid infinite loop in reprepare. Just access the database header directly	2025-08-04 12:32:34 -03:00
pedrocarlo	736748cdf7	Simplify program epilogue by tracking the transaction mode and rollback status in the ProgramBuilder and then calling epilogue just once	2025-08-04 12:32:34 -03:00
pedrocarlo	c567636deb	Adjust Transaction OpCode to accept schema cookie + check if cookie changed	2025-08-04 12:32:34 -03:00
pedrocarlo	f8eb4ba14d	implement reprepare for statements	2025-08-04 12:32:34 -03:00
pedrocarlo	54636241c2	store Sql String inside `Program` for reprepare	2025-08-04 12:32:34 -03:00
Jussi Saurio	506bb5f67f	Merge 'Direct schema mutation – add instruction' from Levy A. Resolves #2378. ``` `ALTER TABLE _ RENAME TO _`/limbo_rename_table/ time: [15.645 ms 15.741 ms 15.850 ms] Found 12 outliers among 100 measurements (12.00%) 8 (8.00%) high mild 4 (4.00%) high severe `ALTER TABLE _ RENAME TO _`/sqlite_rename_table/ time: [34.728 ms 35.260 ms 35.955 ms] Found 15 outliers among 100 measurements (15.00%) 8 (8.00%) high mild 7 (7.00%) high severe ``` <img width="1000" height="199" alt="image" src="https://github.com/user- attachments/assets/ad943355-b57d-43d9-8a84-850461b8af41" /> Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2399	2025-08-04 16:55:38 +03:00
Pere Diaz Bou	f26e442597	core/mvcc: fix new rowid next rowid was being tracked globally for all tables and restarted to 0 every time database was opened	2025-08-04 12:31:17 +02:00
Levy A.	b9a3a93ef0	fix: clippy	2025-08-02 20:06:05 -03:00
Levy A.	b14a11a2fd	fix: change `name` for schema btree + fix benchmark	2025-08-02 17:17:36 -03:00
Jussi Saurio	130e1f80ea	fix/vdbe: call seek_to_last() only once in op_new_rowid	2025-08-02 14:18:58 +03:00
Pekka Enberg	2c05a3e787	Merge 'perf/vdbe: remove eager cloning in op_comparison' from Jussi Saurio Shaves off about 100-200ms of runtime from TPC-H `19.sql` Closes #2385	2025-08-02 10:01:47 +03:00
Jussi Saurio	be1456f7cb	Merge 'use state machine for NoConflict opcode' from Mikaël Francoeur This will save some work when yielding to IO. Previously, on every invocation, if the record was a packed record, we parsed it and iterated through the values to check for nulls. Now, the pre-seeking work is done only once. Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2394	2025-08-02 09:37:00 +03:00
Levy A.	1e177053cb	feat: add `RenameTable` instruction direct schema mutation, no reparsing	2025-08-01 21:11:25 -03:00
Mikaël Francoeur	81412b4a17	use state machine for NoConflict opcode	2025-08-01 17:29:57 -04:00
rajajisai	f6d43df46f	Merge branch 'tursodatabase:main' into issue/2077	2025-08-01 15:20:36 -04:00
rajajisai	d09dd4170b	Format code	2025-08-01 11:59:57 -07:00
Mikaël Francoeur	444a7bb5ac	wrap doc	2025-08-01 14:53:46 -04:00
rajajisai	30c059483e	Parse value as float if it cannot be parsed as integer(when the value cannot fit in i64)	2025-08-01 10:49:40 -07:00
rajajisai	7e84148883	Fix integer overflow check in number parser	2025-08-01 10:10:02 -07:00
Pere Diaz Bou	764523a8bb	core/mvcc: fix tests with state machines	2025-08-01 15:48:09 +02:00
Pere Diaz Bou	c3f00475eb	state_machine: rename transition -> step	2025-08-01 13:56:57 +02:00
Jussi Saurio	d58d71ad1b	perf/vdbe: remove eager cloning in op_comparison	2025-08-01 14:04:56 +03:00
Pere Diaz Bou	0f70e7101f	core/state_machine: move state_machine to its own file	2025-08-01 12:49:32 +02:00
Pere Diaz Bou	27757ab4eb	core/mvcc commit_txn generic state machinery Unfortunately it seems we are never reaching the point to remove state machines, so might as well make it easier to make. There are two points that must be highlighted: 1. There is a `StateTransition` trait implemented like: ```rust pub trait StateTransition { type State; type Context; fn transition<'a>(&mut self, context: &Self::Context) -> Result<TransitionResult>; fn finalize<'a>(&mut self, context: &Self::Context) -> Result<()>; fn is_finalized(&self) -> bool; } ``` where there exists `transition` which tries to move state forward, and `finalize` which marks the state machine as "finalized" so that **no other call to finalize will forward the state and it will panic instead. 2. Before, we would store the state of a state machine inside the callee's struct, but I'm proposing we do something different where the callee will return the state machine and the caller will be responsible of advancing it. This way we don't need to track many reset operations in case of failures or rollbacks, and instead we could simply drop a state machine and all other nested state machines will drop in a cascade.	2025-08-01 12:36:02 +02:00
Pere Diaz Bou	5ad7d10790	core/mvcc: fix use of rwlock	2025-08-01 10:38:41 +02:00
Pere Diaz Bou	c4318cac36	core/mvcc: fix tests	2025-08-01 10:38:41 +02:00
Pere Diaz Bou	49a00ff338	core/mvcc: load table's rowid on initialization We need to load rowids into mvcc's store in order before doing any read in case there are rows. This has a performance penalty for now as expected because we should, ideally, scan for row ids lazily instead.	2025-08-01 10:38:41 +02:00
Pere Diaz Bou	b399ddea1b	core/mvcc: begin pager read txn on mvcc begin_txn	2025-08-01 10:38:41 +02:00
Pere Diaz Bou	b4ac38cd25	core/mvcc: persist writes on mvcc commit On Mvcc `commit_txn` we need to persist changes to database, for this case we re-use pager's semantics of transactions: 1. If there are no conflicts, we start `pager.begin_write_txn` 2. `pager.end_txn`: We flush changes to WAL 3. We finish Mvcc transaction by marking rows with new timestamp.	2025-08-01 10:38:41 +02:00
Jussi Saurio	d465abeced	Merge 'Open a temporary on-disk file for ephemeral tables' from Jussi Saurio Closes #2219 ## What Ephemeral tables and indexes should use a temporary database file instead of being backed only by memory. ## Why This makes them able to spill to disk when necessary when their page cache is nearing its memory limit. However, they should spill directly to the temporary database file without WAL journaling, since a WAL is not necessary (or even desirable) for ephemeral tables. Spilling is not implemented yet for any use case - this is just an enabler for it. ## Implementation details - Create random filename using `io.generate_random_number()` in platform-specific temporary directory - Make `pager.wal` an optional property again, removing `DummyWAL` - Remove `FileMemoryStorage` as it is never used Closes #2315	2025-08-01 11:06:08 +03:00
Jussi Saurio	7259751eba	Merge 'Support the OFFSET clause for Compound select' from meteorgan Closes #2376	2025-08-01 10:18:13 +03:00
Jussi Saurio	e147494642	pager: make WAL optional again and remove DummyWAL	2025-08-01 10:14:35 +03:00
Jussi Saurio	8c6293ebb7	VDBE: use temporary on-disk file for OpenEphemeral	2025-08-01 10:14:01 +03:00
meteorgan	6262ff4267	support offset for values	2025-08-01 00:46:46 +08:00
pedrocarlo	543cdb3e2c	underscoring completions and IOResult to avoid warning messages	2025-07-31 11:51:17 -03:00
Preston Thorpe	bd9df6262f	Merge 'IN queries' from Glauber Costa Merge 'IN queries' from Glauber Costa Implement IN queries. It is currently as todo!(), but my main motivation is that scavenging for EXPLAINs, that pattern, at least in simple queries like SELECT ... IN (1,2,3) uses the AddImm instruction we just added. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2342	2025-07-31 10:00:18 -04:00
Glauber Costa	9e8ba5263b	Implement the AddImm opcode It is a simple opcode. The hard part was finding a sqlite statement that uses it =)	2025-07-31 08:08:07 -05:00
Jussi Saurio	9e1fca2eba	vdbe: disallow checkpointing in interactive tx	2025-07-31 13:16:33 +03:00
Jussi Saurio	f619556344	Merge 'Direct `DatabaseHeader` reads and writes – `with_header` and `with_header_mut`' from Levy A. This PR introduces two methods to pager. Very much inspired by `with_schema` and `with_schema_mut`. `Pager::with_header` and `Pager::with_header_mut` will give to the closure a shared and unique reference respectively that are transmuted references from the `PageRef` buffer. This PR also adds type-safe wrappers for `Version`, `PageSize`, `CacheSize` and `TextEncoding`, as they have special in-memory representations. Writing the `DatabaseHeader` is just a single `memcpy` now. ```rs pub fn write_database_header(&self, header: &DatabaseHeader) { let buf = self.as_ptr(); buf[0..DatabaseHeader::SIZE].copy_from_slice(bytemuck::bytes_of(header)); } ``` `HeaderRef` and `HeaderRefMut` are used in the `with_header*` methods, but also can be used on its own when there are multiple reads and writes to the header, where putting everything in a closure would add too much nesting. Reviewed-by: Preston Thorpe (@PThorpe92) Closes #2234	2025-07-31 10:02:47 +03:00

1 2 3 4 5 ...

1070 Commits