turso

mirror of https://github.com/aljazceru/turso.git synced 2026-01-04 00:44:19 +01:00

Author	SHA1	Message	Date
Pekka Enberg	02f4bc39b3	Merge 'Reanimate MVCC' from Pekka Enberg Bit-rot happened. Bring MVCC back from the dead. Closes #2136	2025-07-18 11:22:49 +03:00
Jussi Saurio	9722aab8a1	clappy	2025-07-18 11:01:57 +03:00
Jussi Saurio	347a9152a6	Merge 'Replace verbose IO Completion methods with helpers' from Preston Thorpe one of the last remnants of some original verbosity Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2156	2025-07-18 10:52:17 +03:00
Jussi Saurio	bbd7f32d80	Merge 'Fix rollback for TxErrors' from Diego Reis Fixes #2153. Not so sure if SQLite doesn't rollback in more cases, we should definitively check this out. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2154	2025-07-18 10:49:29 +03:00
Jussi Saurio	7e1755ac6f	Merge 'Use pread and pwrite in run_once' from Ihor Andrianov pread and pwrite is usually less instructions then seek and read. Also added possibility for io to retry if AGAIN error happens. And made write to wait for Event::writable Reviewed-by: Preston Thorpe (@PThorpe92) Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2010	2025-07-18 10:48:26 +03:00
Iaroslav Zeigerman	28ff170e14	improve sorter settings in the fuzz test	2025-07-18 07:41:15 +02:00
Iaroslav Zeigerman	76e748146b	rebase	2025-07-18 07:30:08 +02:00
Iaroslav Zeigerman	f6f1d076da	verify that records remain unchanged after sorting	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	d9751212d7	make a fuzz sorter test	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	20bdbd5ca5	address suggestions	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	edf2be1432	fix conflicts	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	6a609398fe	cosmetic fix	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	a88b828268	Fix clippy	2025-07-18 07:28:36 +02:00
Iaroslav Zeigerman	fd042ac4c8	Use IOResult insteaed of CursorResult	2025-07-18 07:28:36 +02:00
Iaroslav Zeigerman	78f3bf3475	Core: Introduce external sorting	2025-07-18 07:28:36 +02:00
Diego Reis	9ade79b3ad	Do not rollback in TxError Fixes #2153. Not so sure if SQLite doesn't rollback in more cases, we should definitively check this out.	2025-07-18 01:11:51 -03:00
PThorpe92	a0a948101d	Last cleanups of old completion api	2025-07-17 23:47:43 -04:00
PThorpe92	dced94aec6	Replace verbose completions with new helpers	2025-07-17 23:47:21 -04:00
PThorpe92	44d7570272	Add helper/convenience methods for creating io completions	2025-07-17 23:46:55 -04:00
Jussi Saurio	2f2ecb3576	microsoft paperclip	2025-07-17 23:48:31 +03:00
Jussi Saurio	483dc27539	Merge 'make most instrumentation levels to be Debug or Trace instead' from Pedro Muniz Span creation in debug mode is very slow and impacts our ability to run the Simulator faster. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2146	2025-07-17 23:45:07 +03:00
Jussi Saurio	68427c9b31	Merge '`make_from_btree` should wait for IO to complete' from Pedro Muniz Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2144	2025-07-17 23:44:29 +03:00
Jussi Saurio	1b52b5c764	Merge 'chore: update rust to version 1.88.0' from Nils Koch This PR updates to version Rust 1.88.0 ([Release notes](https://releases.rs/docs/1.88.0/)) and fixes all the clippy errors that come with the new Rust version. This is possible in the latest Rust version: ```rust if let Some(foo) = bar && foo.is_cool() { ... } ``` There are three complications in the migration (so far): - A BUNCH of Clippy warnings (mostly fixed in https://github.com/tursodatabase/limbo/pull/1827) - Windows cross compilation failed; linking `advapi32` on windows fixes it - Since Rust 1.87.0, advapi32 is not linked by default anymore ([Release notes](https://github.com/rust- lang/rust/blob/master/RELEASES.md#compatibility-notes-1), [PR](https://github.com/rust-lang/rust/pull/138233)) - Rust is more strict with FFIs and aligning pointers now. CI checks failed with error below - Fixed in https://github.com/tursodatabase/turso/pull/2064 ``` thread 'main' panicked at core/ext/vtab_xconnect.rs:64:25: misaligned pointer dereference: address must be a multiple of 0x8 but is 0x7ffd9d901554 ``` Closes #1807	2025-07-17 23:35:33 +03:00
pedrocarlo	c15f1e02d3	make most instrumentation levels to be Debug or Trace instead. Span creation in debug mode is very slow and impacts our ability to run the Simulator fast enough	2025-07-17 16:48:24 -03:00
Pekka Enberg	ddc0144dcf	core/mvcc: Adjust cursor tests	2025-07-17 21:43:07 +03:00
pedrocarlo	9690eb41c2	`make_from_btree` should wait for IO to complete if we do not want to use a state machine	2025-07-17 15:34:42 -03:00
Jussi Saurio	9726b95beb	Merge 'translate/create index: fix wrong collations' from Jussi Saurio We were passing the table columns' collations (all of them) in order, instead of the index column collations. Two issues: 1. This is wrong 2. There's now an assertion in the Sorter that actually panics if the length of sort order and collations is not the same Closes #2140	2025-07-17 21:25:11 +03:00
pedrocarlo	1f67d69e8e	forgot to set the state to NewTrunk if we have more leaf pages than free entries	2025-07-17 15:09:52 -03:00
Jussi Saurio	a45ac11462	translate/create index: fix wrong collations	2025-07-17 21:07:48 +03:00
Jussi Saurio	e56325bf05	Merge 'Implement IO latency correctly in simulator' from Pedro Muniz Closes #1998. Now I am queuing IO to be run at some later point in time. Also Latency for some reason is slowing the simulator a looot for some runs. This PR also adds a StateMachine variant in Balance as now `free_pages` is correctly an asynchronous function. With this change, we now need a state machine in the `Pager` so that `free_pages` can be reentrant. Lastly, I removed a timeout in `checkpoint_shutdown` as it was triggering constantly due to the slightly increased latency. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1943	2025-07-17 21:05:17 +03:00
Jussi Saurio	49b9a69c40	fix/btree: fix insert_into_cell() logic During running simulations for #1988 I ran into a post-balance validation error where the correct divider cell could not be found from the parent. This was caused by divider cell insertion happening this way: - First divider cell caused overflow - Second technically had space to fit, so we didn't add it to overflow cells I looked at SQLite source, and it seems SQLite always adds the cell to overflow cells if there are existing overflow cells: ```c if( pPage->nOverflow \|\| sz+2>pPage->nFree ){ ...add to overflow cells... } ``` So, I changed our implementation to do the same, which fixed the balance validation issue. However, then I ran into another issue: A cell inserted during balancing in the `edit_page()` stage was added to overflow cells, which should not happen. The reason for this was the changed logic in `insert_into_page()`, outlined above. It looks like SQLite doesn't use `insert_into_cell()´ in its implementation of `page_insert_array()` which explains this. For simplicity, I made a second version of `insert_into_cell()` called `insert_into_cell_during_balance()` which allows regular cell insertion despite existing overflow cells, since the existing overflow cells are what caused the balance to happen in the first place and will be cleared as soon as `edit_page()` is done.	2025-07-17 18:26:14 +03:00
pedrocarlo	b80218324d	fix merge conflicts	2025-07-17 12:25:31 -03:00
pedrocarlo	46a7d20c12	clippy	2025-07-17 12:24:43 -03:00
pedrocarlo	4a13286d62	modify clock to use simulated time instead	2025-07-17 12:24:43 -03:00
pedrocarlo	7b8eec90bd	edit state machine in Btree for freeing pages + Pager state machine for free_page	2025-07-17 12:24:43 -03:00
pedrocarlo	5771d1a00e	disable wal sync timeout on checkpoint	2025-07-17 12:24:43 -03:00
pedrocarlo	dc5f73887e	refactor to require `Arc<Completion>` in file traits so that we can delay IO calls correctly	2025-07-17 12:24:43 -03:00
Pekka Enberg	aa84daabf1	core/storage: Fix BTreeCursor::rowid() with MVCC	2025-07-17 16:23:31 +03:00
Pekka Enberg	cef0195b42	core/storage: Fix BTreeCursor::record() for MVCC Respect immutable record invalidation.	2025-07-17 16:12:40 +03:00
Pekka Enberg	962987e9a1	core/mvcc: Fix MVCC cursor traversal Add an explicit rewind() to move to the beginning. Change forward() semantics so that after first forward() call, you are pointing to the first row, which matches the get_next_record() semantics in B-tree cursor.	2025-07-17 16:12:40 +03:00
Pekka Enberg	2b1ee907a9	core/vdbe: Fix op_new_rowid() with MVCC	2025-07-17 14:13:22 +03:00
Pekka Enberg	72df538a76	core/storage: Add MVCC asertion to BTreeCursor::seek_to_last()	2025-07-17 14:13:22 +03:00
Pekka Enberg	3aca9c54c7	core/storage: Fix BTreeCursor::record() with MVCC	2025-07-17 14:13:22 +03:00
Pekka Enberg	8e338d3e7a	core/vdbe: Fix SetCookie when MVCC is enabled	2025-07-17 14:13:22 +03:00
Pekka Enberg	1fc6126157	core/storage: Allocate page1 lazily for MVCC transactions	2025-07-17 14:13:22 +03:00
Jussi Saurio	01ad75ecd0	page cache: temporarily increase default size until WAL spill is implemented	2025-07-17 12:28:44 +03:00
Jussi Saurio	5a2efa3077	Merge 'refactor/btree&vdbe: fold index key info (sort order, collations) into a single struct' from Jussi Saurio These are nearly always used together in some form, so it makes sense to colocate them, and it also makes many code paths simpler, as we don't separately pass `collations` and `key_sort_order` around As a side effect, as the bitfield-based `IndexKeySortOrder` is removed, we now remove the arbitrary 64 column restriction for indexes, see e.g. this sim failure which fails to 64+ index columns (not sure why it uses an index if they are disabled): https://github.com/tursodatabase/turso/actions/runs/16339391964/job/4615 8045158 Closes #2131	2025-07-17 11:55:56 +03:00
Jussi Saurio	e8199cb26c	btree/vdbe: fold index key info (sort order, collations) into a single struct These are nearly always used together in some form, so it makes sense to colocate them, and it also makes many code paths simpler.	2025-07-17 10:58:43 +03:00
Pekka Enberg	99cdcf5348	Merge 'core: Copy-on-write for in-memory schema' from Levy A. <img height="400" alt="image" src="https://github.com/user- attachments/assets/bdd5c0a8-1bbb-4199-9026-57f0e5202d73" /> <img height="400" alt="image" src="https://github.com/user- attachments/assets/7ea63e58-2ab7-4132-b29e-b20597c7093f" /> We were copying the schema preemptively on each `Database::connect`, now the schema is shared until a change needs to be made by sharing a single `Arc` and mutating it via `Arc::make_mut`. This is faster as reduces memory usage. Closes #2022	2025-07-17 10:46:46 +03:00
Pekka Enberg	ae4dcbad0f	Merge 'Async IO: registration of file descriptors' from Preston Thorpe ### Async IO performance, part 0 Relatively small and focused PR that mainly does two things, will add a .md document of the proposed/planned improvements to the io_uring module to fully revamp our async IO. 1. Registration of file descriptors. At startup, by calling `io_uring_register_files_sparse` we can allocate an array in shared kernel/user space by calling register_files_sparse which initializes each slot to `-1`, and when we open a file we call `io_uring_register_files_update`, providing an index into this array and `fd`. Then for the IO submission, we can reference the index into this array instead of the fd, saving the kernel the work of looking up the fd in the process file table, incrementing the reference count, doing the operation, then finally decrementing the refcount. Instead the kernel can just index into the array and do the operation. This especially provides an improvement for cases like this, where files are open for long periods of time, which the kernel will perform many operations on. The eventual goal of this, is to use Fixed read/write operations, where both the file descriptor and the underlying buffer is registered with the kernel. There is another branch continuing this work, that introduces a buffer pool that memlock's one large 32MB arena mmap and tries to use that wherever possible. These Fixed operations are essentially the "holy grail" of io_uring performance (for file operations). 2. !Vectored IO This is kind of backwards, because the goal is to indeed implement proper vectored IO and I'm removing some of the plumbing in this PR, but currently we have been using `Writev`/`Readv`, while never submitting > 1 iovec at a time. Writes to the WAL, especially, would benefit immensely from vectored IO, as it is append-only and therefore all writes are contiguous. Regular checkpointing/cache flushing to disk can also be adapted to aggregate these writes and submit many in a single system call/opcode. Until this is implemented, the bookkeeping and iovecs are unnecessary noise/overhead, so let's temporarily remove them and revert to normal `read`/`write` until they are needed and it can be designed from scratch. 3. Flags `setup_single_issuer` hints to the kernel that `IOURING_ENTER` calls will all be sent from a single thread, and `setup_coop_taskrun` removes some unnecessary kernel interrupts for providing cqe's which most single threaded applications do not need. Both these flags demonstrate modest improvement of performance. Closes #2127	2025-07-17 08:47:44 +03:00

1 2 3 4 5 ...

3449 Commits