turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-19 17:34:19 +01:00

Author	SHA1	Message	Date
pedrocarlo	aa7f8fbfc4	Tripwire for Wal Syncing + set syncing to true before calling sync	2025-08-18 15:34:20 -03:00
PThorpe92	605b3e233f	Move in memory reset logic to before truncation in wal restart	2025-08-16 17:42:22 -04:00
PThorpe92	eae68f3669	Update stale in memory wal header after restarting log	2025-08-16 17:36:48 -04:00
PThorpe92	2c526c4c37	Add io_yield_x macros to reduce boilerplate	2025-08-16 16:14:00 -04:00
Jussi Saurio	c2e89f94f8	Change more page size panics to corrupt errors	2025-08-14 12:40:58 +03:00
Jussi Saurio	f94fa2bbbe	salt tweak	2025-08-14 12:40:58 +03:00
Jussi Saurio	c75e4c1092	Fix non-4096 page sizes by making WAL header lazy	2025-08-14 12:40:58 +03:00
Nikita Sivukhin	a78992e309	add assertion with more context	2025-08-13 15:23:18 +04:00
pedrocarlo	187ffb5698	adjust test	2025-08-13 10:24:55 +03:00
pedrocarlo	56905fc340	refactor `checkpoint` in pager	2025-08-13 10:24:55 +03:00
pedrocarlo	a7f5912e7d	adjust wal checkpoint to return completions	2025-08-13 10:24:55 +03:00
pedrocarlo	10ad43d9e0	sync should return completion	2025-08-13 10:24:55 +03:00
pedrocarlo	a7873d8907	remove write counters from Wal impls	2025-08-13 10:24:55 +03:00
Jussi Saurio	43628e2a9a	Merge 'IO Cleanups to use and ' from Pedro Muniz Depends on #2512 . This is an attempt to remove all the noise from my IO completions refactor into separate PRs Closes #2566	2025-08-13 10:24:15 +03:00
pedrocarlo	85e86d427b	cleanups - use io.block in many functions and `return_if_io`	2025-08-13 08:32:38 +03:00
Nikita Sivukhin	1ac3d632ca	support checkpoint from remote	2025-08-12 19:53:04 +04:00
Pekka Enberg	0285fddd85	Merge 'Fix max_frame determination and comments in WAL checkpointing' from Preston Thorpe in #2521, I messed up and introduced improper calculation of the current checkpoint's max safe frame (mostly due to incorrect comments that I had left on the method). The confusion partially stems from our lack of Busy handling at the moment, but essentially when determining the max safe frame for all readers, for passive mode we cannot simply `break` out of the loop when we find a reader with a lower read mark than we have, because _another_ reader might have an even _lower_ read mark, and we could proceed with the first mark < shared_max. And for !passive modes, we still attempt to backfill with the same lower frame, we just return `Busy` at the end, after backfilling what we can (we just don't reset the log for restart/truncate). Most of the changes in this PR is just the renaming the fields of Checkpoint Result, because the names were confusing Closes #2560	2025-08-12 18:24:59 +03:00
Nikita Sivukhin	1d8ed9aa55	add methods for read/write schema cookie	2025-08-12 17:29:56 +04:00
PThorpe92	c3b536957f	Fix max_frame determination and comments in WAL checkpointing	2025-08-12 08:57:35 -04:00
PThorpe92	c098a48924	Remove clear dirty from checkpoint state machine in sratch page	2025-08-11 16:40:05 -04:00
PThorpe92	e2896d2f95	Minor improvement in flush api	2025-08-11 16:01:41 -04:00
PThorpe92	1f554c2707	Cleanup some minor checkpointing issues	2025-08-11 15:40:36 -04:00
PThorpe92	5061617363	Fix bug in checkpoitn that sets nbackfills and lets readers skip mapping the wal	2025-08-09 16:43:05 -04:00
PThorpe92	0ba52a489e	Fix check in CheckpointState::Start for work after determining safe frame	2025-08-09 15:45:47 -04:00
PThorpe92	23f30a481a	Add tests for Full checkpoint mode	2025-08-09 15:45:47 -04:00
PThorpe92	cd7fb2f329	Add Full checkpoint mode in WAL	2025-08-09 15:45:45 -04:00
PThorpe92	01f281bae5	Fix doc test/comment	2025-08-09 14:41:15 -04:00
PThorpe92	3d518baef5	Fix assertion to make sure a reader or writer exists when unlock is called	2025-08-09 14:40:03 -04:00
PThorpe92	f81bc3236a	Use a single packed u64 for LimboRwLock and relax memory ordering	2025-08-09 14:08:25 -04:00
PThorpe92	faf248df03	Add more docs and comments for TempBufferCache	2025-08-08 10:55:28 -04:00
PThorpe92	66964fd8d2	Add documentation and comments to new buffer pool, add get_frame api	2025-08-08 10:55:28 -04:00
PThorpe92	4ffb273b53	Adjust IO to use new buffer pool and buffer API	2025-08-08 10:55:26 -04:00
Jussi Saurio	c5bdbe306d	perf/wal: avoid accessing pages_in_frames unless necessary	2025-08-07 10:27:11 +03:00
Nikita Sivukhin	b612259a3a	more friendly copmletely runtime agnostic turso-sync-engine crate	2025-08-06 19:26:55 +04:00
PThorpe92	f6a68cffc2	Remove RefCell from IO and Page apis	2025-08-05 16:24:49 -04:00
Jussi Saurio	13219dbf87	Merge 'extend raw WAL API with few more methods' from Nikita Sivukhin This PR extends raw WAL API with few methods which will be helpful for offline-sync: 1. `try_wal_watermark_read_page` - try to read page from the DB with given WAL watermark value\ * Usually, WAL max_frame is set automatically to the latest value (`shared.max_frame`) when transaction is started and then this "watermark" is preserved throughout whole transaction * New method allows to simulate "read from the past" by controlling frame watermark explicitly * There is an alternative to implement some API like `start_read_session(frame_watermark: u64)` - but I decided to expose just single method to simplify the logic and reduce "surface" of actions which can be executed in this "controllable" manner * Also, for simplicity, now `try_wal_watermark_read_page` always read data from disk and bypass any cached values (and also do not populate the cache) 2. `wal_changed_pages_after` - return set of unique pages changed after watermark WAL position in the current WAL session With these 2 methods we can implement `REVERT frame_watermark` logic which will just fetch all changed pages first, and then revert them to the previous value by using `try_wal_watermark_read_page` and `wal_insert_frame` methods (see `test_wal_api_revert_pages` test). Note, that if there were schema changes - than `REVERT` logic described above can bring connection to the inconsistent state, as it will preserve schema information in memory and will still think that table exist (while it can be reverted). This should be considered by any consumer of this new methods. Closes #2433	2025-08-04 23:53:46 +03:00
Nikita Sivukhin	9694366645	add one more assert	2025-08-04 17:23:34 +04:00
Nikita Sivukhin	76bdf0c1ab	small fixes	2025-08-04 17:02:53 +04:00
Nikita Sivukhin	2e23230e79	extend raw WAL API with few more methods - try_wal_watermark_read_page - try to read page from the DB with given WAL watermark value - wal_changed_pages_after - return set of unique pages changed after watermark WAL position	2025-08-04 16:55:50 +04:00
Jussi Saurio	4f3f66d55e	fix/wal: remove start_pages_in_frames_hack to prevent checkpoint data loss We have some kind of transaction-local hack (`start_pages_in_frames`) for bookkeeping how many pages are currently in the in-memory WAL frame cache, I assume for performance reasons or whatever. `wal.rollback()` clears all the frames from `shared.frame_cache` that the rollbacking tx is allowed to clear, and then truncates `shared.pages_in_frames` to however much its local `start_pages_in_frames` value was. In `complete_append_frame`, we check if `frame_cache` has that key (page) already, and if not, we add it to `pages_in_frames`. However, `wal.rollback()` never _removes_ the key (page) if its value is empty, so we can end up in a scenario where the `frame_cache` key for `page P` exists but has no frames, and so `page P` does not get added to `pages_in_frames` in `complete_append_frame`. This leads to a checkpoint data loss scenario: - transaction rolls back, has start_pages_in_frames=0, so truncates shared pages_in_frames to an empty vec. let's say `page P` key in `frame_cache` still remains but it has no frames. - The next time someone commits a frame for `page P`, it does NOT get added to `pages_in_frames` because `frame_cache` has that key - At some point, a PASSIVE checkpoint checkpoints `n` frames, but since `pages_in_frames` does not have `page P`, it doesn't actually checkpoint it and all the "checkpointed" frames are simply thrown away - very similar to the scenario in #2366 Remove the `start_pages_in_frames` hack entirely and just make `pages_in_frames` effectively the same as `frame_cache.keys`. I think we could also just get rid of `pages_in_frames` and just use `frame_cache.contains_key(p)` but maybe Pere can chime in here	2025-08-04 10:35:12 +03:00
Pere Diaz Bou	c807b035c5	core/mvcc: fix tests again had to create connections for every different txn	2025-08-01 10:44:19 +02:00
Jussi Saurio	2233bb41c3	Merge 'fix/wal: reset ongoing checkpoint state when checkpoint fails' from Jussi Saurio ## What The following sequence of actions is possible: ```sql -- TRUNCATE checkpoint fails during WAL restart, -- but OngoingCheckpoint.state is still left at Done for conn 0 Connection 0(op=23): PRAGMA wal_checkpoint(TRUNCATE) Connection 0(op=23) Checkpoint TRUNCATE: OK: false, wal_page_count: NULL, checkpointed_count: NULL -- TRUNCATE checkpoint succeeds for conn 1 Connection 1(op=26): PRAGMA wal_checkpoint(TRUNCATE) Connection 1(op=26) Checkpoint TRUNCATE: OK: true, wal_page_count: 0, checkpointed_count: 0 -- Conn 0 now does a PASSIVE checkpoint, and immediately thinks -- it's in the Done state, and thinks it checkpointed 17 frames. -- since mode is PASSIVE, it now thinks both the WAL and the DB have those 17 frames -- so the first 17 frames of the WAL can be ignored from now on. Connection 0(op=27): PRAGMA wal_checkpoint(PASSIVE) Connection 0(op=27) Checkpoint PASSIVE: OK: true, wal_page_count: 0, checkpointed_count: 17 -- Connection 0 starts a txn with min=18 (ignore first 17 frames in WAL), -- and deletes rowid=690, which becomes WAL frame number 1 Connection 0(op=28): DELETE FROM test_table WHERE id = 690 begin_read_tx(min=18, max=0, slot=1, max_frame_in_wal=0) -- Connection 1 starts a txn with min=18 (ignore first 17 frames in WAL), -- and inserts rowid=1128, which becomes WAL frame number 2 Connection 1(op=28): INSERT INTO test_table (id, text) VALUES (1128, text_560) begin_read_tx(min=18, max=1, slot=1, max_frame_in_wal=1) -- Connection 0 again starts tx with min=18, and performs a read, and two wrong things happen: -- 1. it doesn't see row 690 as deleted, because it's in WAL frame 1, which it ignores -- 2. it doesn't see the new row 1128, because it's in WAL frame 2, which it ignores Connection 0(op=29): SELECT * FROM test_table begin_read_tx(min=18, max=2, slot=1, max_frame_in_wal=2) ``` ## Fix Reset `ongoing_checkpoint.state` to `Start` when checkpoint fails. Issue found in #2364 . Reviewed-by: bit-aloo (@Shourya742) Closes #2380	2025-08-01 11:28:04 +03:00
Jussi Saurio	e147494642	pager: make WAL optional again and remove DummyWAL	2025-08-01 10:14:35 +03:00
Jussi Saurio	e6528f2664	fix/wal: reset ongoing checkpoint state when checkpoint fails	2025-08-01 08:39:34 +03:00
Jussi Saurio	eeceefe49d	Merge 'fix/wal: only rollback WAL if txn was write + fix start state for WalFile' from Jussi Saurio Closes #2363 ## What The following sequence of actions is possible: ``` Some committed frames already exist in the WAL. shared.pages_in_frames.len() > 0. Brand new connection does this: BEGIN ^-- deferred, no read tx started yet, so its `self.start_pages_in_frames` is `0` because it's a brand new WalFile instance ROLLBACK <-- calls `wal.rollback()` and truncates `shared.pages_in_frames` to length `0` PRAGMA wal_checkpoint(); ^-- because `pages_in_frames` is empty, it doesnt actually checkpoint anything but still sets shared.max_frame to 0, causing effectively data loss ``` ## Fix - Only call `wal.rollback()` for write transactions - Set `start_pages_in_frames` correctly so that this doesn't happen even if a regression starts calling `wal.rollback()` again Reviewed-by: Preston Thorpe (@PThorpe92) Closes #2366	2025-07-31 16:16:20 +03:00
Jussi Saurio	62e804480e	fix/wal: make db_changed check detect cases where max frame happens to be the same	2025-07-31 14:37:33 +03:00
Jussi Saurio	e88707c6fd	fix/wal: only rollback WAL if txn was write	2025-07-31 14:18:43 +03:00
Jussi Saurio	39dec647a7	fix/wal: reset page cache when another connection checkpointed in between	2025-07-31 12:44:22 +03:00
PThorpe92	2e741641e6	Add test to assert we are backfilling all the rows properly with vectored writes	2025-07-30 19:42:54 -04:00
PThorpe92	ade1c182de	Add is_full method to checkpoint batch	2025-07-30 19:42:54 -04:00

1 2 3 4 5

208 Commits