turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-19 01:24:20 +01:00

Author	SHA1	Message	Date
PThorpe92	e28a38abc5	Fix wal tag safety issues, and add debug assertion that we are reading the proper frames	2025-08-20 17:28:48 -04:00
PThorpe92	d2c3ba14c8	Remove inefficient vec in WAL for tracking pages present in frame cache	2025-08-20 17:28:18 -04:00
PThorpe92	00f2a0f216	Performance improvements to checkpointing. prevent serializing I/O	2025-08-20 17:26:54 -04:00
Pekka Enberg	c2208a542a	Merge 'Initial pass to support per page encryption' from Avinash Sajjanshetty This patch adds support for per page encryption. The code is of alpha quality, was to test my hypothesis. All the encryption code is gated behind a `encryption` flag. To play with it, you can do: ```sh cargo run --features encryption -- database.db turso> PRAGMA key='turso_test_encryption_key_123456'; turso> CREATE TABLE t(v); ``` Right now, most stuff is hard coded. We use AES GCM 256. This information is not stored anywhere, but in future versions we will start saving this info in the file. When writing to disk, we will generate a cryptographically secure random salt, use that to encrypt the page. Then we will store the authentication tag and the salt in the page itself. To accommodate this encryption hardcodes reserved space of 28 bytes. Once the key is set in the connection, we propagate that information to pager and the WAL, to encrypt / decrypt when reading from disk. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2567	2025-08-20 11:11:24 +03:00
Avinash Sajjanshetty	40a209c000	simplify feature flag usage for encryption	2025-08-20 12:49:38 +05:30
Avinash Sajjanshetty	bd9b4bbfd2	encrypt/decrypt when writing/reading from DB	2025-08-20 11:47:23 +05:30
Avinash Sajjanshetty	94d38be1a2	Set reserved_space to 28 for encrypted databases We will use this space to store nonce and tag	2025-08-20 11:39:09 +05:30
Avinash Sajjanshetty	a6e9237c94	Set encryption key in pager and WAL	2025-08-20 11:39:09 +05:30
pedrocarlo	46c756b130	clear locked on pages when completion errors	2025-08-19 17:29:57 -03:00
pedrocarlo	de1811dea7	abort completions on error	2025-08-19 10:48:21 -03:00
pedrocarlo	ab3b68e360	change completion callbacks to take a Result param + create separate functions to declare a completion errored	2025-08-19 10:48:21 -03:00
Jussi Saurio	33eb730ef8	finish_read_page() never fails, so it does not need to return Result	2025-08-19 15:44:16 +03:00
pedrocarlo	aa7f8fbfc4	Tripwire for Wal Syncing + set syncing to true before calling sync	2025-08-18 15:34:20 -03:00
Nikita Sivukhin	71bbc36f61	adjust log message	2025-08-15 13:19:46 +04:00
Nikita Sivukhin	2361587d81	cargo fmt	2025-08-15 13:18:50 +04:00
Nikita Sivukhin	9b08f8c0f2	set in-memory WAL information according to last commited frame - before that pages_in_frames, frame_cache and last_checksum was set to the latest written (not commited!) frame in the WAL found on disk	2025-08-15 13:13:33 +04:00
PThorpe92	5a7c9325e2	Add assertion for expected write amount in writev callback	2025-08-14 10:53:16 -04:00
Jussi Saurio	bd8c6f3c7c	make PageSize more robust: only accept literal '1' value if it comes directly from db header	2025-08-14 12:40:58 +03:00
Jussi Saurio	a2a88e2c69	Make exception for page size literal value 1	2025-08-14 12:40:58 +03:00
Jussi Saurio	bb21bd93da	Use type-safe PageSize newtype for pager.page_size	2025-08-14 12:40:58 +03:00
Jussi Saurio	fd72a2ff20	Fix: do computations on usable_space as usize, not as u16 Otherwise page size 65536 will not work as casting to u16 will make it wrap around to 0.	2025-08-13 17:20:29 +03:00
pedrocarlo	8d48be0e61	cleanup	2025-08-13 10:24:55 +03:00
pedrocarlo	925d4b15e1	adjust more pager IO returns	2025-08-13 10:24:55 +03:00
pedrocarlo	a7f5912e7d	adjust wal checkpoint to return completions	2025-08-13 10:24:55 +03:00
pedrocarlo	2e68296107	create `IOCompletions`	2025-08-13 10:24:55 +03:00
PThorpe92	c098a48924	Remove clear dirty from checkpoint state machine in sratch page	2025-08-11 16:40:05 -04:00
PThorpe92	688851c97a	Take instead of clone vec of buffers in write_pages_vectored	2025-08-11 16:18:43 -04:00
PThorpe92	e2896d2f95	Minor improvement in flush api	2025-08-11 16:01:41 -04:00
PThorpe92	1f554c2707	Cleanup some minor checkpointing issues	2025-08-11 15:40:36 -04:00
PThorpe92	f81bc3236a	Use a single packed u64 for LimboRwLock and relax memory ordering	2025-08-09 14:08:25 -04:00
PThorpe92	66964fd8d2	Add documentation and comments to new buffer pool, add get_frame api	2025-08-08 10:55:28 -04:00
PThorpe92	dffa47b048	Use temp buffer for wal header	2025-08-08 10:55:27 -04:00
PThorpe92	4ffb273b53	Adjust IO to use new buffer pool and buffer API	2025-08-08 10:55:26 -04:00
Preston Thorpe	7a793b818d	Merge 'perf: a few small insert optimizations' from Jussi Saurio 1. We spend a lot of time in `cell_get_raw_region` in the balancing routine, and especially calling `contents.page_type()` there a lot, so extract a version that can take some precomputed arguments so those don't have to be redundantly computed multiple times for successive calls where those values are going to be the same 2. Avoid calling `self.usable_space()` in a loop in `insert_into_page()`. 3. Avoid accessing `pages_in_frames` lock if we're not going to modify it main improvement is to the "insert 100 rows" bench which ends up doing balancing a lot: ``` Insert rows in batches/limbo_insert_1_rows time: [22.856 µs 24.342 µs 27.496 µs] change: [-3.3579% +15.495% +67.671%] (p = 0.62 > 0.05) No change in performance detected. Benchmarking Insert rows in batches/limbo_insert_10_rows: Collecting 100 samples in estim Insert rows in batches/limbo_insert_10_rows time: [32.196 µs 32.604 µs 32.981 µs] change: [+1.3253% +2.9177% +4.5863%] (p = 0.00 < 0.05) Performance has regressed. Insert rows in batches/limbo_insert_100_rows time: [89.425 µs 92.105 µs 96.304 µs] change: [-18.317% -13.605% -9.1022%] (p = 0.00 < 0.05) Performance has improved. ``` Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2483	2025-08-07 21:33:30 -04:00
Jussi Saurio	1fe32dadf3	PageContent: make read_x/write_x methods private and add dedicated methods Problem: A very easy source of bugs is to mistakenly use e.g. PageContent::read_u16() instead of PageContent::read_u16_no_offset(). The difference between the two is that `read_u16()` adds 100 bytes to the requested byte offset if and only if the page in question is page 1, which contains a 100-byte database header. Case in point: see #2491. Observation: In all of the cases where we want to read from or write to a page "header-sensitively", those reads/writes are to so-called "well known offsets", e.g. specific bytes in a btree page header. In all other cases, the "no-offset" versions, i.e. the ones taking the absolute byte offset as parameter, should be used. Solution: 1. Make all the offset-sensitive versions (read_u16() and friends) private methods of `PageContent`. 2. Expose dedicated methods for things like updating rightmost pointer, updating fragmented bytes count and so on, and use them instead of the plain read/write methods universally.	2025-08-07 17:00:06 +03:00
Jussi Saurio	4b27cc0d46	btree: add fast path version of cell_get_raw_region	2025-08-07 09:57:56 +03:00
Jussi Saurio	c98136c8c4	btree: use new cell start helper method in cell_get_raw_region	2025-08-07 09:37:33 +03:00
Jussi Saurio	3db25cf84c	perf/btree: add method for getting raw offset of cell payload start	2025-08-07 09:34:05 +03:00
PThorpe92	f6a68cffc2	Remove RefCell from IO and Page apis	2025-08-05 16:24:49 -04:00
PThorpe92	914c10e095	Remove Clone impl for Buffer and PageContent	2025-08-05 14:26:53 -04:00
Jussi Saurio	a28e64bfdd	cleanup: remove unused page uptodate flag	2025-08-05 14:25:42 +03:00
Jussi Saurio	5b84ad6b0f	Merge 'Update defragment page to defragment in-place' from João Severo Change original code from doing a full copy of the original buffer to modify the buffer in-place using a temporary vector with offsets. Closes #2258	2025-08-05 11:22:22 +03:00
Nikita Sivukhin	2e23230e79	extend raw WAL API with few more methods - try_wal_watermark_read_page - try to read page from the DB with given WAL watermark value - wal_changed_pages_after - return set of unique pages changed after watermark WAL position	2025-08-04 16:55:50 +04:00
Joao Severo	785c3fdb03	Split into two methods	2025-08-02 13:24:12 +03:00
pedrocarlo	543cdb3e2c	underscoring completions and IOResult to avoid warning messages	2025-07-31 11:51:17 -03:00
Jussi Saurio	7d082ab614	small fix after header accessor refactor	2025-07-31 10:05:52 +03:00
Jussi Saurio	f619556344	Merge 'Direct `DatabaseHeader` reads and writes – `with_header` and `with_header_mut`' from Levy A. This PR introduces two methods to pager. Very much inspired by `with_schema` and `with_schema_mut`. `Pager::with_header` and `Pager::with_header_mut` will give to the closure a shared and unique reference respectively that are transmuted references from the `PageRef` buffer. This PR also adds type-safe wrappers for `Version`, `PageSize`, `CacheSize` and `TextEncoding`, as they have special in-memory representations. Writing the `DatabaseHeader` is just a single `memcpy` now. ```rs pub fn write_database_header(&self, header: &DatabaseHeader) { let buf = self.as_ptr(); buf[0..DatabaseHeader::SIZE].copy_from_slice(bytemuck::bytes_of(header)); } ``` `HeaderRef` and `HeaderRefMut` are used in the `with_header*` methods, but also can be used on its own when there are multiple reads and writes to the header, where putting everything in a closure would add too much nesting. Reviewed-by: Preston Thorpe (@PThorpe92) Closes #2234	2025-07-31 10:02:47 +03:00
PThorpe92	ade1c182de	Add is_full method to checkpoint batch	2025-07-30 19:42:54 -04:00
PThorpe92	693b71449e	Clean up writev batching and apply suggestions	2025-07-30 19:42:53 -04:00
PThorpe92	ef69df7258	Apply review suggestions	2025-07-30 19:42:53 -04:00

1 2 3 4 5 ...

278 Commits