turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-22 02:34:20 +01:00

Author	SHA1	Message	Date
pedrocarlo	1f67d69e8e	forgot to set the state to NewTrunk if we have more leaf pages than free entries	2025-07-17 15:09:52 -03:00
pedrocarlo	b80218324d	fix merge conflicts	2025-07-17 12:25:31 -03:00
pedrocarlo	7b8eec90bd	edit state machine in Btree for freeing pages + Pager state machine for free_page	2025-07-17 12:24:43 -03:00
pedrocarlo	5771d1a00e	disable wal sync timeout on checkpoint	2025-07-17 12:24:43 -03:00
Pekka Enberg	1fc6126157	core/storage: Allocate page1 lazily for MVCC transactions	2025-07-17 14:13:22 +03:00
Levy A.	d0e26db01a	use lock for database schema	2025-07-16 13:54:39 -03:00
Levy A.	4c77d771ff	only copy schema on writes	2025-07-16 13:54:36 -03:00
Jussi Saurio	bb0c017d9f	Merge 'btree: fix trying to go upwards when we are already at the end of the entire btree' from Jussi Saurio ## What does this fix This PR fixes an issue with BTree upwards traversal logic where we would try to go up to a parent node in `next()` even though we are at the very end of the btree. This behavior can leave the cursor incorrectly positioned at an interior node when it should be at the right edge of the rightmost leaf. ## Why doesn't it cause problems on main This bug is masked on `main` by every table `insert()` (wastefully) calling `find_cell()`: - `op_new_rowid` called, let's say the current max rowid is `666`. Cursor is left pointing at `666`. - `insert()` is called with rowid `667`, cursor is currently pointing at `666`, which is incorrect. - `find_cell()` does a binary search every time, and hence somewhat accidentally positions the cursor correctly _after_ `666` so that the insert goes to the correct place ## Why was this issue found in #1988, I am removing `find_cell()` entirely in favor of always performing a seek to the correct location - and skipping `seek` when it is not required, saving us from wasting a binary search on every insert - but this change means that we need to call `next()` after `op_new_rowid` to have the cursor positioned correctly at the new insertion slot. Doing this surfaces this upwards traversal bug in that PR branch. ## Details of solution - Store `cell_count` together with `cell_idx` in pagestack, so that chlidren can know whether their parents have reached their end without doing IO - To make this foolproof, pin pages on `PageStack` so the page cache cannot evict them during tree traversal - `cell_indices` renamed to `node_states` since it now carries more information (cell index AND count, instead of just index) Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #2005	2025-07-16 19:44:21 +03:00
Diego Reis	b86674adbb	Remove cache clearing in cacheflush	2025-07-16 11:11:52 -03:00
Jussi Saurio	8558675c4c	page cache: pin pages on the stack	2025-07-16 17:09:05 +03:00
Diego Reis	817ad8d50f	Separate user-callable cacheflush from internal cacheflush logic Cacheflush should only spill pages to WAL as non-commit frames, without checkpointing nor syncing. Check SQLite's sqlite3PagerFlush	2025-07-16 11:08:50 -03:00
Diego Reis	0e9771ac07	refactor: Change redundant "Status" enums to IOResult Let's unify the semantics of "something done" or yields I/O into a single type	2025-07-15 20:56:18 -03:00
Diego Reis	d0af54ae77	refactor: Change CursorResult to IOResult The reasoning here is to treat I/O operations (Either is "Done" or yields to IO) with the same generic type.	2025-07-15 20:52:25 -03:00
meteorgan	b42a1ef272	minor improvements based on PR comments	2025-07-15 16:34:07 +08:00
meteorgan	f123c77ee8	fix set page_size in pager	2025-07-15 16:34:07 +08:00
meteorgan	a6faab17e9	fix query page size	2025-07-15 16:34:07 +08:00
meteorgan	cf126824de	Support set page size	2025-07-15 16:34:07 +08:00
Pekka Enberg	55cf9c8f02	Merge 'Add async header accessor functionality' from Zaid Humayun This PR addresses https://github.com/tursodatabase/turso/issues/1828 in a phased manner. Making database header access async in one PR will be complicated. This PR ports adds an async API to `header_accessor.rs` and ports over some of `pager.rs` to use this API. This will allow gradual porting over of all call sites. Once all call sites are ported over, one mechanical rename will fix everything in the repo so we don't have any `<header_name>_async` functions. Also, porting header accessors over from sync to async would be a good way to get introduced to the Limbo codebase for first time contributors. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1966	2025-07-14 13:08:29 +03:00
Nils Koch	1a91966c7e	fix clippy errors for rust 1.88.0 (manual fix)	2025-07-12 18:58:55 +03:00
Nils Koch	828d4f5016	fix clippy errors for rust 1.88.0 (auto fix)	2025-07-12 18:58:41 +03:00
Zaid Humayun	90a5a53b0e	Added Async Header Accessor API's This commit introduces async header accessor API's in addition to the sync ones. Allows gradual porting instead of one big shot PR.	2025-07-12 09:38:18 +05:30
pedrocarlo	367002fb72	rename `change_schema` to `schema_did_change`	2025-07-07 11:58:16 -03:00
pedrocarlo	d8ad4a27f8	only finish appending frames when we are done in cacheflush	2025-07-07 11:53:45 -03:00
pedrocarlo	b85687658d	change instrumentation level to INFO	2025-07-07 11:53:45 -03:00
pedrocarlo	4639a4565f	change max_frame count only after wal sync in cacheflush	2025-07-07 11:53:45 -03:00
pedrocarlo	7c10ac01e6	`do_allocate_page` should return a `Result`	2025-07-07 11:53:45 -03:00
pedrocarlo	711b1ef114	make all `run_once` be run under statement or connection so that rollback is called	2025-07-07 11:51:25 -03:00
pedrocarlo	5559c45011	more instrumentation + write counter should decrement if pwrite fails	2025-07-07 11:50:21 -03:00
pedrocarlo	897426a662	add error tracing to relevant functions + rollback transaction in step_end_write_txn + make move_to_root return result	2025-07-07 11:50:21 -03:00
pedrocarlo	db005c81a0	add option to disable wal checkpoint	2025-07-03 12:04:17 -03:00
Pere Diaz Bou	5d856499c4	move update schema global on commit and not on rollback txn	2025-07-03 12:36:48 +02:00
Pere Diaz Bou	c799396c3d	rollback schema in connection	2025-07-03 12:36:48 +02:00
Pere Diaz Bou	65a7fe13cf	remove lock from private schema copy	2025-07-03 12:36:48 +02:00
Pere Diaz Bou	abf1699dd2	set scheam version and update shared schema in txn	2025-07-03 12:36:48 +02:00
KaguraMilet	aca08238d8	fix buffer pool is not thread safe problem	2025-07-01 16:06:55 +08:00
Pere Diaz Bou	486c4b69fb	WAL record db_size frame on commit last frame `db_size` is `>0` in case of last frame written of a transaction. This is necessary as we need to know -- while recovering wal contents -- that we have read a transaction fully instead of treating every frame as its own transaction.	2025-06-27 16:21:48 +02:00
Pekka Enberg	5791ab9dff	Merge 'Cache `reserved_space` and `page_size` values at Pager init to prevent doing redundant IO' from Krishna Vishal ### Problem Profiling revealed that `usable_space()` calls were consuming 60% of total execution time for simple SELECT queries, making Limbo approximately `6x` slower than SQLite for SELECT operations. The bottleneck was caused by `usable_space()` performing expensive I/O operations on every call to read `page_size` and `reserved_space` from the database header, despite `page_size` values being effectively immutable after database initialization. Only `reserved_space` is allowed to increase in SQLite. Evidence: https://share.firefox.dev/44tCUIy ### Solution Implemented OnceCell-based caching for both page_size and reserved_space values in the Pager struct: `page_size: OnceCell<u16>` - Page size is immutable after database initialization per SQLite specification `reserved_space: OnceCell<u8>` - Reserved space rarely changes and only grows, safe to cache ### Performance Impact Benchmark results: Simple SELECT query time reduced from ~2.89ms to ~1.29ms (~55% improvement) Closes #1852	2025-06-27 16:40:14 +03:00
Pere Diaz Bou	8e0f8041ed	properly set database header contents on initialization After moving page1 write to be async I moved the contents update to wrong place. This should fix it.	2025-06-27 11:44:11 +02:00
Krishna Vishal	cda1ab8d76	Use `OnceCell` instead of `OnceLock`.	2025-06-27 13:32:03 +05:30
Krishna Vishal	af2ab87810	Cache `reserved_space` and `page_size` values at Pager init. We use `OnceLock` for this. TODO: Invalidate reserved_space when we make functionality the to change it.	2025-06-27 12:51:11 +05:30
Pere Diaz Bou	aa93b70a96	empty -> unitialized	2025-06-26 17:59:23 +02:00
Pere Diaz Bou	4d80b8237d	write page1 on database initialization Page 1 must be initialized and written as soon as possible without marking page as dirty.	2025-06-26 14:44:23 +02:00
Pekka Enberg	a48198ec60	Merge '`Rollback` simple support' from Pere Diaz Bou Support for simple interactive rollback like: ```sql create table t (x); insert into t values (1); begin; insert into t values (2); rollback; select * from t; ``` This PR also fixes some other issues I found while debugging: * Checkpoint would never `clear_dirty` on pages in page cache. * Auto commit for interactive transactions was not respected so any `insert` after `begin` would flush frames regardless of `auto_commit` state. * `max_frame` on wal shared state was being updated after every `append_frame` which was incorrect, as another transaction would be able to use that new `max_frame` even tho the transaction could've rolled back. Instead we update the private copy of `max_frame` and only update it at the end. Follow up for later are savepoints which require implementing a subjournal to track savepoints and their modified pages. Closes #1825	2025-06-25 20:02:09 +03:00
Pere Diaz Bou	c02337c8cc	clear dirty pages on rollback	2025-06-25 14:01:53 +02:00
Pere Diaz Bou	a3ad138df8	checkpoint clear dirty page if it was on cache	2025-06-25 14:01:53 +02:00
Pere Diaz Bou	22f9cd695d	commit_txn track rollback case	2025-06-25 14:00:57 +02:00
Pere Diaz Bou	bdd2010df3	autocommit rollback	2025-06-25 14:00:57 +02:00
Jussi Saurio	27b3ecf599	core/db&pager: fix locking for initializing empty database When `struct Database` is constructed, store `is_empty` as an `Arc<AtomicBool>` - the value is true if: 1. DB size is zero 2. WAL has no frames When `struct Pager` is constructed, this `Arc` is simply cloned. When any connection runs a transaction it will first check `is_empty`, and if the DB is empty, it will lock `init_lock` and then check `is_empty` again, and if it's still true, it allocates page1 and stores `false` in the `is_empty` `AtomicBool` and drops the lock. --- Note that Limbo can currently have a zero DB and a WAL with frames, as we have no special logic for folding page1 to the main DB file during initialization. Page 1 allocation currently happens on the first transaction (read or write, due to having to support `select * from sqlite_schema` on an empty DB; we should really check how SQLite actually does this.).	2025-06-25 14:45:21 +03:00
Jussi Saurio	480f0a04b5	make clippy happy about mutating database_size immediately after default construction	2025-06-24 14:41:50 -03:00
Diego Reis	1921fcb943	Add comments to clarify current behaviour	2025-06-24 14:41:50 -03:00

1 2 3 4 5 ...

276 Commits