turso

mirror of https://github.com/aljazceru/turso.git synced 2026-01-06 17:54:20 +01:00

Author	SHA1	Message	Date
Pere Diaz Bou	57eb63cee0	core/bree: remove duplicated code in BTreeCursor	2025-10-16 14:50:08 +02:00
Pekka Enberg	bf5de920f2	core: Unsafe Send and Sync pushdown This patch pushes unsafe Send and Sync to individual components instead of doing it at Database level. This makes it easier for us to incrementally fix thread-safety, but avoid developers adding more thread unsafe code.	2025-10-16 11:26:50 +03:00
Pekka Enberg	7cf51e74ca	Merge 'core/mvcc: implement CursorTrait on MVCC cursor' from Pere Diaz Bou Closes #3714	2025-10-14 10:24:42 +03:00
Pekka Enberg	9a1bd2112d	Merge 'Run simulator under Miri' from Bob Peterson This adds support for running the simulator under Miri to detect UB. There are a few things to note about Miri and its limitations - It has limited `libc` coverage, so it's not really possible to have Miri help with `UringIO`/`UringFile` or `UnixIO`/`UnixFile`. That's a big gap ☹️ - It can work for `GenericIO`/`GenericFile`, which only uses `std` - It can't call external C libraries, so even using `sqlite` is out (hence adding `--disable-integrity-check` to the simulator for Miri use) - It runs on nightly, consequently there are a few new lints that don't exist on turso's pinned version of rustc Some questions I have about this MR - I made `GenericFile::{lock_file,unlock_file}` noops so I could use `GenericIO`. This isn't great, but if/when you update from Rust 1.88.0 to 1.89.0, `std::File::{lock,lock_shared,unlock}` will be stabilized and available. Should I note that as a TODO or something? - Previously, the sim runner shelled out to `git` to get stuff like the current git hash and the repo directory. For Miri, that's out, and so is `git2`. Unfortunately, `gix` is also out since it has a required dependency that uses inline assembly, which Miri doesn't like. I wrote a hacky shim that uses only std to look for `.git` and find the hash that HEAD is pointing to. It doesn't deal with stuff like packed-refs or the repo being a secondary one made with `git worktree`. I'm happy to support that, but wanted to hear from maintainers before doing more work. Two UB occurrences I already found: - `TursoRwLock::read` used `AtomicU64::compare_exchange_weak`, which is (evidently) [allowed to spuriously fail](https://doc.rust-lang.org/std/s ync/atomic/struct.AtomicU64.html#method.compare_exchange_weak) in exchange for perf. Miri forces this behavior, which triggers trivial read deadlocks even with zero readers/writers. I changed it to `compare_exchange`, but I'm not an atomics expert. - Uninitialized read in non-Unix `core::storage::buffer_pool::arena::alloc`. This is a simple one, resolved by using `std::alloc::alloc_zeroed` instead of `std::alloc::alloc` Moving forward, I'd be interested in potentially getting the tests to run in Miri, too. `tokio` looks like a good example of a project with partial coverage that runs it where they can. They have some extra test config to allow as many as possible to run under Miri, with appropriately scaled-down parameter values since Miri is super slow Closes #3720	2025-10-14 09:26:55 +03:00
pedrocarlo	0ef5ec007c	remove cfg for `MAP_ANONYMOUS`	2025-10-13 18:05:18 -03:00
Bob Peterson	dfc77b0350	Non-Unix arena: use zeroed alloc to avoid UB Reads to the arena were flagged by Miri as UB since it contained uninitialized memory	2025-10-13 14:54:16 -05:00
Bob Peterson	74ef9ad5ca	Drop weak in TursoRwLock::read's compare_exchange compare_exchange_weak can spuriously fail, which Miri obliges us with, causing a read deadlock	2025-10-13 14:54:16 -05:00
Bob Peterson	cd56f52bd6	Add cfg attributes for running under Miri	2025-10-13 14:54:16 -05:00
Pere Diaz Bou	bc05497d99	core/mvcc: implement CursorTrait on MVCC cursor	2025-10-13 19:26:18 +02:00
Pekka Enberg	77492641db	Merge 'Move all checksum tests behind the feature flag' from Avinash Sajjanshetty Closes #3704	2025-10-13 11:46:56 +03:00
Avinash Sajjanshetty	4a29694475	rename checksums tests appropriately	2025-10-13 13:48:07 +05:30
Avinash Sajjanshetty	ee479d2e52	Move all checksum tests behind the feature flag	2025-10-13 13:47:25 +05:30
Pekka Enberg	4af61d8049	Merge 'core/btree: try to introduce trait for cursors' from Pere Diaz Bou I've added a trait called `CursorTrait`. I know it's not a good name for now, but I didn't know what tto change then enum `Cursor` to. This trait wraps all common functionality, and some functionality that is yet too specific that needs to be fixed. This is needed in order to have layered cursors where for example, MvccCursor will need a fallback BTreeCursor. Closes #3660	2025-10-10 19:25:39 +03:00
Pere Diaz Bou	d0d6db301b	core/btree: CursorTrait	2025-10-10 15:04:15 +02:00
Jussi Saurio	acb3c97fea	Merge 'When pwritev fails, clear the dirty pages' from Pedro Muniz If we don't clear the dirty pages, we will initiate a rollback. In the rollback, we will attempt to clear the whole page cache, but it will then panic because there will still be dirty pages from the failed writev Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #3189	2025-10-09 10:38:47 +03:00
Pekka Enberg	13566e5cad	Merge 'Integrity check enhancements' from Jussi Saurio - add index root pages to list of root pages to check - check for dangling (unused) pages ```sql $ cargo run wut.db turso> .mode list turso> pragma integrity_check; Page 3: never used Page 4: never used Page 7: never used Page 8: never used ``` ```sql $ sqlite3 wut.db 'pragma integrity_check;' * in database main * Page 3: never used Page 4: never used Page 7: never used Page 8: never used ``` Closes #3613	2025-10-08 08:57:18 +03:00
Levy A.	cf53ecb7e3	refactor: remove `TextRef` and `RawSlice` and fix tests	2025-10-07 10:43:45 -03:00
Levy A.	77a412f6af	refactor: remove unsafe reference semantics from `RefValue` also renames `RefValue` to `ValueRef`, to align with rusqlite and other crates	2025-10-07 10:43:44 -03:00
Pere Diaz Bou	3e508a4b42	core/io: remove new_dummy in place of new_yield Yield is a completion that does not allocate any inner state. By design it is completed from the start and has no errors. This allows lightly yield without allocating any locks nor heap allocate inner state.	2025-10-07 12:00:33 +02:00
Jussi Saurio	5941c03a4f	integrity check: check for dangling (unused) pages	2025-10-07 11:35:38 +03:00
Pekka Enberg	a72b07e949	Merge 'Fix VDBE program abort' from Nikita Sivukhin This PR add proper program abort in case of unfinished statement reset and interruption. Also, this PR makes rollback methods non-failing because otherwise of their callers usually unclear (if rollback failed - what is the state of statement/connection/transaction?) Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #3591	2025-10-07 09:07:07 +03:00
pedrocarlo	5a7390735d	rename Completion functions	2025-10-06 11:07:06 -03:00
Nikita Sivukhin	8dae601fac	make rollback non-failing method	2025-10-06 13:21:45 +04:00
Nikita Sivukhin	38d2630969	remove unnecessary SchemaLocked error - lock() return error in case when another thread panicked while holding the same lock - we better to just panic too in any such case	2025-10-06 12:15:15 +04:00
Pekka Enberg	be6f3d09ea	core/storage: Switch checkpoint_inner() to completion group	2025-10-06 07:33:31 +03:00
pedrocarlo	911b6791b9	when pwritev fails, clear the dirty pages add flag to `clear_page_cache`	2025-10-05 20:02:21 -03:00
pedrocarlo	f3dc0bef5d	remove some explicit `Arc<dyn File> references`	2025-10-03 16:39:57 -03:00
pedrocarlo	e93add6c80	remove `dyn DatabaseStorage` and replace it with `DatabaseFile`	2025-10-03 14:14:15 -03:00
Pere Diaz Bou	9c9d4d147e	core/btree: fuzz tests force page 1 allocation with a transaction	2025-10-03 13:28:28 +02:00
Pere Diaz Bou	8f103f7c35	core/wal: introduce transaction_count, same as iChange in sqlite	2025-10-03 13:02:47 +02:00
Pekka Enberg	c98bf9b593	Merge 'core/wal: check index header on begin_write_tx' from Pere Diaz Bou Fixes a page cache staleness issue where connections could incorrectly believe the database hasn't changed after checkpointing. This can happen when writes following a checkpoint resulted in the same `max_frame value`, causing connections to miss updates since they only checked `max_frame` to detect changes. Closes #3502	2025-10-03 13:51:22 +03:00
Pere Diaz Bou	b5a969933c	core/wal: remove dbg!	2025-10-03 12:17:35 +02:00
Pekka Enberg	b11246278f	Merge 'Enable encryption properly in Rust bindings, whopper, and throughput tests' from Avinash Sajjanshetty This is a follow up from PR - #3457 which requires users to opt in to enable encryption. This patch - Makes appropriate changes to Whopper and Encryption throughput tests - Updated Rust bindings to pass the encryption options properly - Added a test for rust bindings To use encryption in Rust bindings, one needs to do: ```rust let opts = EncryptionOpts { hexkey: "b1bbfda...02a5669fc76327".to_string(), cipher: "aegis256".to_string(), }; let builder = Builder::new_local(&db_file).experimental_encryption(true).with_encryption(opts.clone()); let db = builder.build().await.unwrap(); ``` We will remove the `experimental_encryption` once the feature is stable. Closes #3532	2025-10-02 18:32:06 +03:00
Avinash Sajjanshetty	3653c1a853	clear page cache when the encryption context is set	2025-10-02 19:50:12 +05:30
Avinash Sajjanshetty	09ba4615ba	return appropriate error if checksum was not compiled	2025-10-02 16:11:18 +05:30
Avinash Sajjanshetty	6d7dc6d183	enable checksums only if its opted in via feature flag	2025-10-02 16:01:56 +05:30
Jussi Saurio	a9d782e319	Merge 'Add encryption internals docs' from Avinash Sajjanshetty preview - https://github.com/tursodatabase/turso/blob/8d2ef700c9b087a7e2 904c25052e4365395b33b3/docs/manual.md#encryption-1 Closes #3461	2025-10-02 07:04:16 +03:00
Avinash Sajjanshetty	ca0d738f4d	Add encryption internals docs	2025-10-02 00:14:28 +05:30
Charly Delaroche	5856dc8733	core/storage: Apple platforms support	2025-10-01 09:59:22 -07:00
Pere Diaz Bou	fe29fcbb09	core/wal: update checkpoint_seq and last_checkpoint on begin_read_tx	2025-10-01 16:17:40 +02:00
Pere Diaz Bou	e84f960516	core/wal: check index header on begin_write_tx	2025-10-01 16:03:17 +02:00
Pekka Enberg	02023ce821	Merge 'core/storage: Switch page cache queue to linked list' from Pekka Enberg The page cache implementation uses a pre-allocated vector (`entries`) with fixed capacity, along with a custom hash map and freelist. This design requires expensive upfront allocation when creating a new connection, which severely impacted performance in workloads that open many short-lived connections (e.g., our concurrent write benchmarks that create a new connection per transaction). Therefore, replace the pre-allocated vector with an intrusive doubly- linked list. This eliminates the page cache initialization overhead from connection establishment, but also reduces memory usage to entries that are actually used. Furthermore, the approach allows us to grow the page cache with much less overhead. The patch improves concurrent write throughput benchmark by 4x for single-threaded performance. Before: ``` $ write-throughput --threads 1 --batch-size 100 -i 1000 --mode concurrent Running write throughput benchmark with 1 threads, 100 batch size, 1000 iterations, mode: Concurrent Database created at: write_throughput_test.db Thread 0: 100000 inserts in 3.82s (26173.63 inserts/sec) ``` After: ``` $ write-throughput --threads 1 --batch-size 100 -i 1000 --mode concurrent Running write throughput benchmark with 1 threads, 100 batch size, 1000 iterations, mode: Concurrent Database created at: write_throughput_test.db Thread 0: 100000 inserts in 0.90s (110848.46 inserts/sec) ``` Closes #3456	2025-10-01 16:39:47 +03:00
Pekka Enberg	2b168cf7b0	core/storage: Switch page cache queue to linked list The page cache implementation uses a pre-allocated vector (`entries`) with fixed capacity, along with a custom hash map and freelist. This design requires expensive upfront allocation when creating a new connection, which severely impacted performance in workloads that open many short-lived connections (e.g., our concurrent write benchmarks that create a new connection per transaction). Therefore, replace the pre-allocated vector with an intrusive doubly-linked list. This eliminates the page cache initialization overhead from connection establishment, but also reduces memory usage to entries that are actually used. Furthermore, the approach allows us to grow the page cache with much less overhead. The patch improves concurrent write throughput benchmark by 4x for single-threaded performance. Before: ``` $ write-throughput --threads 1 --batch-size 100 -i 1000 --mode concurrent Running write throughput benchmark with 1 threads, 100 batch size, 1000 iterations, mode: Concurrent Database created at: write_throughput_test.db Thread 0: 100000 inserts in 3.82s (26173.63 inserts/sec) ``` After: ``` $ write-throughput --threads 1 --batch-size 100 -i 1000 --mode concurrent Running write throughput benchmark with 1 threads, 100 batch size, 1000 iterations, mode: Concurrent Database created at: write_throughput_test.db Thread 0: 100000 inserts in 0.90s (110848.46 inserts/sec) ```	2025-10-01 14:41:35 +03:00
Jussi Saurio	8a08f085e8	Merge 'Fix SQLite database file pending byte page' from Pedro Muniz Sqlite has a crazy easter egg where a 1 Gib file offset, it creates a `PENDING_BYTE_PAGE` that is used only by the VFS layer, and is never read or written into. To properly test this, I took inspiration from SQLITE testing framework, and defined a helper method, that is conditionally compiled with the `test_helper` feature enabled. https://github.com/sqlite/sqlite/blob/7e38287da43ea3b661da3d8c1f431aa907 d648c9/src/main.c#L4327 As the `PENDING_BYTE` is normally at the 1 Gib mark, I created a function that modifies the static `PENDING_BYTE` atomic to whatever value we want. This means we can test this unusual behaviours at any DB file size we want. `fuzz_pending_byte_database` is the test that fuzzes different pending byte offsets and does an integrity check at the end to confirm, we are compatible with SQLITE Closes #2749 <img width="1100" height="740" alt="image" src="https://github.com/user- attachments/assets/06eb258f-b4b4-47bf-85f9-df1cf411e1df" /> Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #3431	2025-10-01 08:55:44 +03:00
Jussi Saurio	65abe3efdc	Merge 'MVCC: Handle table ID / rootpages properly for both checkpointed and non-checkpointed tables' from Jussi Saurio Handle table ID / rootpages properly for both checkpointed and non- checkpointed tables Table ID is an opaque identifier that is only meaningful to the MV store. Each checkpointed MVCC table corresponds to a single B-tree on the pager, which naturally has a root page. We cannot use root page as the MVCC table ID directly because: - We assign table IDs during MVCC commit, but - we commit pages to the pager only during checkpoint which means the root page is not easily knowable ahead of time. Hence: - MVCC table ids are always negative - sqlite_schema rows will have a negative rootpage column if the table has not been checkpointed yet. - on checkpoint when the table is allocated a real root page, we update the row in sqlite_schema and in MV store's internal mapping On recovery: - All sqlite_schema tables are read directly from disk and assigned `table_id = -1 * root_page` -- root_page on disk must be positive - Logical log is deserialized and inserted into MV store - Schema changes from logical_log are captured into the DB's global schema Note about recovery: I changed MVCC recovery to happen on DB initialization which should prevent any races, so no need for `recover_lock`, right @pereman2 ? Closes #3419	2025-10-01 08:55:10 +03:00
pedrocarlo	65cd4d998d	page_size can be 0 when it is not initialized, so account for that	2025-09-30 15:58:38 -03:00
pedrocarlo	aa5055e563	fuzz tests for pending_byte	2025-09-30 13:52:40 -03:00
pedrocarlo	3d5978c718	add special hipp pending page that is supposed to be ignored	2025-09-30 13:43:10 -03:00
Jussi Saurio	a52dbb7842	Handle table ID / rootpages properly for both checkpointed and non-checkpointed tables Table ID is an opaque identifier that is only meaningful to the MV store. Each checkpointed MVCC table corresponds to a single B-tree on the pager, which naturally has a root page. We cannot use root page as the MVCC table ID directly because: - We assign table IDs during MVCC commit, but - we commit pages to the pager only during checkpoint which means the root page is not easily knowable ahead of time. Hence, we: - store the mapping between table id and btree rootpage - sqlite_schema rows will have a negative rootpage column if the table has not been checkpointed yet.	2025-09-30 16:53:12 +03:00
Avinash Sajjanshetty	c8111f9555	Put encryption behind an opt in (runtime) flag	2025-09-30 18:29:18 +05:30

1 2 3 4 5 ...

1576 Commits