turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-30 06:24:21 +01:00

Author	SHA1	Message	Date
Iaroslav Zeigerman	76e748146b	rebase	2025-07-18 07:30:08 +02:00
Iaroslav Zeigerman	f6f1d076da	verify that records remain unchanged after sorting	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	d9751212d7	make a fuzz sorter test	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	20bdbd5ca5	address suggestions	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	edf2be1432	fix conflicts	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	6a609398fe	cosmetic fix	2025-07-18 07:28:37 +02:00
Iaroslav Zeigerman	a88b828268	Fix clippy	2025-07-18 07:28:36 +02:00
Iaroslav Zeigerman	fd042ac4c8	Use IOResult insteaed of CursorResult	2025-07-18 07:28:36 +02:00
Iaroslav Zeigerman	78f3bf3475	Core: Introduce external sorting	2025-07-18 07:28:36 +02:00
Jussi Saurio	2f2ecb3576	microsoft paperclip	2025-07-17 23:48:31 +03:00
Jussi Saurio	483dc27539	Merge 'make most instrumentation levels to be Debug or Trace instead' from Pedro Muniz Span creation in debug mode is very slow and impacts our ability to run the Simulator faster. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2146	2025-07-17 23:45:07 +03:00
Jussi Saurio	68427c9b31	Merge '`make_from_btree` should wait for IO to complete' from Pedro Muniz Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2144	2025-07-17 23:44:29 +03:00
Jussi Saurio	1b52b5c764	Merge 'chore: update rust to version 1.88.0' from Nils Koch This PR updates to version Rust 1.88.0 ([Release notes](https://releases.rs/docs/1.88.0/)) and fixes all the clippy errors that come with the new Rust version. This is possible in the latest Rust version: ```rust if let Some(foo) = bar && foo.is_cool() { ... } ``` There are three complications in the migration (so far): - A BUNCH of Clippy warnings (mostly fixed in https://github.com/tursodatabase/limbo/pull/1827) - Windows cross compilation failed; linking `advapi32` on windows fixes it - Since Rust 1.87.0, advapi32 is not linked by default anymore ([Release notes](https://github.com/rust- lang/rust/blob/master/RELEASES.md#compatibility-notes-1), [PR](https://github.com/rust-lang/rust/pull/138233)) - Rust is more strict with FFIs and aligning pointers now. CI checks failed with error below - Fixed in https://github.com/tursodatabase/turso/pull/2064 ``` thread 'main' panicked at core/ext/vtab_xconnect.rs:64:25: misaligned pointer dereference: address must be a multiple of 0x8 but is 0x7ffd9d901554 ``` Closes #1807	2025-07-17 23:35:33 +03:00
pedrocarlo	c15f1e02d3	make most instrumentation levels to be Debug or Trace instead. Span creation in debug mode is very slow and impacts our ability to run the Simulator fast enough	2025-07-17 16:48:24 -03:00
pedrocarlo	9690eb41c2	`make_from_btree` should wait for IO to complete if we do not want to use a state machine	2025-07-17 15:34:42 -03:00
Jussi Saurio	9726b95beb	Merge 'translate/create index: fix wrong collations' from Jussi Saurio We were passing the table columns' collations (all of them) in order, instead of the index column collations. Two issues: 1. This is wrong 2. There's now an assertion in the Sorter that actually panics if the length of sort order and collations is not the same Closes #2140	2025-07-17 21:25:11 +03:00
pedrocarlo	1f67d69e8e	forgot to set the state to NewTrunk if we have more leaf pages than free entries	2025-07-17 15:09:52 -03:00
Jussi Saurio	a45ac11462	translate/create index: fix wrong collations	2025-07-17 21:07:48 +03:00
Jussi Saurio	e56325bf05	Merge 'Implement IO latency correctly in simulator' from Pedro Muniz Closes #1998. Now I am queuing IO to be run at some later point in time. Also Latency for some reason is slowing the simulator a looot for some runs. This PR also adds a StateMachine variant in Balance as now `free_pages` is correctly an asynchronous function. With this change, we now need a state machine in the `Pager` so that `free_pages` can be reentrant. Lastly, I removed a timeout in `checkpoint_shutdown` as it was triggering constantly due to the slightly increased latency. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1943	2025-07-17 21:05:17 +03:00
Jussi Saurio	49b9a69c40	fix/btree: fix insert_into_cell() logic During running simulations for #1988 I ran into a post-balance validation error where the correct divider cell could not be found from the parent. This was caused by divider cell insertion happening this way: - First divider cell caused overflow - Second technically had space to fit, so we didn't add it to overflow cells I looked at SQLite source, and it seems SQLite always adds the cell to overflow cells if there are existing overflow cells: ```c if( pPage->nOverflow \|\| sz+2>pPage->nFree ){ ...add to overflow cells... } ``` So, I changed our implementation to do the same, which fixed the balance validation issue. However, then I ran into another issue: A cell inserted during balancing in the `edit_page()` stage was added to overflow cells, which should not happen. The reason for this was the changed logic in `insert_into_page()`, outlined above. It looks like SQLite doesn't use `insert_into_cell()´ in its implementation of `page_insert_array()` which explains this. For simplicity, I made a second version of `insert_into_cell()` called `insert_into_cell_during_balance()` which allows regular cell insertion despite existing overflow cells, since the existing overflow cells are what caused the balance to happen in the first place and will be cleared as soon as `edit_page()` is done.	2025-07-17 18:26:14 +03:00
pedrocarlo	b80218324d	fix merge conflicts	2025-07-17 12:25:31 -03:00
pedrocarlo	46a7d20c12	clippy	2025-07-17 12:24:43 -03:00
pedrocarlo	4a13286d62	modify clock to use simulated time instead	2025-07-17 12:24:43 -03:00
pedrocarlo	7b8eec90bd	edit state machine in Btree for freeing pages + Pager state machine for free_page	2025-07-17 12:24:43 -03:00
pedrocarlo	5771d1a00e	disable wal sync timeout on checkpoint	2025-07-17 12:24:43 -03:00
pedrocarlo	dc5f73887e	refactor to require `Arc<Completion>` in file traits so that we can delay IO calls correctly	2025-07-17 12:24:43 -03:00
Jussi Saurio	01ad75ecd0	page cache: temporarily increase default size until WAL spill is implemented	2025-07-17 12:28:44 +03:00
Jussi Saurio	5a2efa3077	Merge 'refactor/btree&vdbe: fold index key info (sort order, collations) into a single struct' from Jussi Saurio These are nearly always used together in some form, so it makes sense to colocate them, and it also makes many code paths simpler, as we don't separately pass `collations` and `key_sort_order` around As a side effect, as the bitfield-based `IndexKeySortOrder` is removed, we now remove the arbitrary 64 column restriction for indexes, see e.g. this sim failure which fails to 64+ index columns (not sure why it uses an index if they are disabled): https://github.com/tursodatabase/turso/actions/runs/16339391964/job/4615 8045158 Closes #2131	2025-07-17 11:55:56 +03:00
Jussi Saurio	e8199cb26c	btree/vdbe: fold index key info (sort order, collations) into a single struct These are nearly always used together in some form, so it makes sense to colocate them, and it also makes many code paths simpler.	2025-07-17 10:58:43 +03:00
Pekka Enberg	99cdcf5348	Merge 'core: Copy-on-write for in-memory schema' from Levy A. <img height="400" alt="image" src="https://github.com/user- attachments/assets/bdd5c0a8-1bbb-4199-9026-57f0e5202d73" /> <img height="400" alt="image" src="https://github.com/user- attachments/assets/7ea63e58-2ab7-4132-b29e-b20597c7093f" /> We were copying the schema preemptively on each `Database::connect`, now the schema is shared until a change needs to be made by sharing a single `Arc` and mutating it via `Arc::make_mut`. This is faster as reduces memory usage. Closes #2022	2025-07-17 10:46:46 +03:00
Pekka Enberg	ae4dcbad0f	Merge 'Async IO: registration of file descriptors' from Preston Thorpe ### Async IO performance, part 0 Relatively small and focused PR that mainly does two things, will add a .md document of the proposed/planned improvements to the io_uring module to fully revamp our async IO. 1. Registration of file descriptors. At startup, by calling `io_uring_register_files_sparse` we can allocate an array in shared kernel/user space by calling register_files_sparse which initializes each slot to `-1`, and when we open a file we call `io_uring_register_files_update`, providing an index into this array and `fd`. Then for the IO submission, we can reference the index into this array instead of the fd, saving the kernel the work of looking up the fd in the process file table, incrementing the reference count, doing the operation, then finally decrementing the refcount. Instead the kernel can just index into the array and do the operation. This especially provides an improvement for cases like this, where files are open for long periods of time, which the kernel will perform many operations on. The eventual goal of this, is to use Fixed read/write operations, where both the file descriptor and the underlying buffer is registered with the kernel. There is another branch continuing this work, that introduces a buffer pool that memlock's one large 32MB arena mmap and tries to use that wherever possible. These Fixed operations are essentially the "holy grail" of io_uring performance (for file operations). 2. !Vectored IO This is kind of backwards, because the goal is to indeed implement proper vectored IO and I'm removing some of the plumbing in this PR, but currently we have been using `Writev`/`Readv`, while never submitting > 1 iovec at a time. Writes to the WAL, especially, would benefit immensely from vectored IO, as it is append-only and therefore all writes are contiguous. Regular checkpointing/cache flushing to disk can also be adapted to aggregate these writes and submit many in a single system call/opcode. Until this is implemented, the bookkeeping and iovecs are unnecessary noise/overhead, so let's temporarily remove them and revert to normal `read`/`write` until they are needed and it can be designed from scratch. 3. Flags `setup_single_issuer` hints to the kernel that `IOURING_ENTER` calls will all be sent from a single thread, and `setup_coop_taskrun` removes some unnecessary kernel interrupts for providing cqe's which most single threaded applications do not need. Both these flags demonstrate modest improvement of performance. Closes #2127	2025-07-17 08:47:44 +03:00
Pekka Enberg	d2158ff201	Merge 'Clean up AST unparsing, remove `ToSqlString`' from Levy A. Enables formatting `Expr::Column` by adding the context to `ToTokens` instead of creating a new unparsing implementation for each node. `ToTokens` implemented for: - [x] `UpdatePlan` - [x] `Plan` - [x] `JoinedTable` - [x] `SelectPlan` - [x] `DeletePlan` Reviewed-by: Pedro Muniz (@pedrocarlo) Closes #1949	2025-07-17 08:44:31 +03:00
PThorpe92	ad2ae3e22f	Use fallback to regular fd if file registration is unavailable in io_uring	2025-07-16 23:08:46 -04:00
PThorpe92	fb78cdade0	Increase ring size from 128 -> 512	2025-07-16 22:44:20 -04:00
PThorpe92	4d09f1ab65	Enable coop_taskrun flag to disable excessive interrupts for completions	2025-07-16 22:43:44 -04:00
PThorpe92	95c343586c	Enable single_issuer flag for io_uring to signal submissions from single thread	2025-07-16 22:42:40 -04:00
PThorpe92	9dfadf7872	Add registered file descriptors to io_uring IO module	2025-07-16 22:41:47 -04:00
Nils Koch	8dc066503e	chore: fix clippy errors	2025-07-16 19:34:42 +01:00
Pekka Enberg	e6c3a5a9b8	Merge 'rename operation_xxx to change_xxx to make naming more consistent' from Nikita Sivukhin This PR renames CDC table column names to use "change"-centric terminology and avoid using `operation_xxx` column names. Just a small refactoring to bring more consistency as `turso-db` refer to the feature as capture data changes - and there is no word operation here. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2120	2025-07-16 20:08:23 +03:00
Pekka Enberg	af182d9895	Merge 'btree: fix post-balancing seek bug in delete path' from Jussi Saurio Aftermath of seek-related refactor in #2065, which you can read for background. The change in this PR is documented pretty well inline - if we receive a `TryAdvance` seek result when seeking after balancing, we need to - well - try to advance. Closes #2116 Closes #2115	2025-07-16 20:08:15 +03:00
Levy A.	8e8f1682df	add `with_schema_mut` removes all repeated `Arc::make_mut`	2025-07-16 13:54:39 -03:00
Levy A.	d0e26db01a	use lock for database schema	2025-07-16 13:54:39 -03:00
Levy A.	4c77d771ff	only copy schema on writes	2025-07-16 13:54:36 -03:00
Jussi Saurio	bb0c017d9f	Merge 'btree: fix trying to go upwards when we are already at the end of the entire btree' from Jussi Saurio ## What does this fix This PR fixes an issue with BTree upwards traversal logic where we would try to go up to a parent node in `next()` even though we are at the very end of the btree. This behavior can leave the cursor incorrectly positioned at an interior node when it should be at the right edge of the rightmost leaf. ## Why doesn't it cause problems on main This bug is masked on `main` by every table `insert()` (wastefully) calling `find_cell()`: - `op_new_rowid` called, let's say the current max rowid is `666`. Cursor is left pointing at `666`. - `insert()` is called with rowid `667`, cursor is currently pointing at `666`, which is incorrect. - `find_cell()` does a binary search every time, and hence somewhat accidentally positions the cursor correctly _after_ `666` so that the insert goes to the correct place ## Why was this issue found in #1988, I am removing `find_cell()` entirely in favor of always performing a seek to the correct location - and skipping `seek` when it is not required, saving us from wasting a binary search on every insert - but this change means that we need to call `next()` after `op_new_rowid` to have the cursor positioned correctly at the new insertion slot. Doing this surfaces this upwards traversal bug in that PR branch. ## Details of solution - Store `cell_count` together with `cell_idx` in pagestack, so that chlidren can know whether their parents have reached their end without doing IO - To make this foolproof, pin pages on `PageStack` so the page cache cannot evict them during tree traversal - `cell_indices` renamed to `node_states` since it now carries more information (cell index AND count, instead of just index) Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #2005	2025-07-16 19:44:21 +03:00
Nikita Sivukhin	97b82fe6d8	rename operation_xxx to change_xxx to make naming more consistent	2025-07-16 20:16:24 +04:00
Levy A.	714225b9f0	remove `ToSqlString` trait	2025-07-16 12:16:34 -03:00
Levy A.	6fe2505425	add more `ToTokens` impls	2025-07-16 12:16:31 -03:00
Levy A.	373a4a26c4	fix: comma function	2025-07-16 12:16:28 -03:00
Levy A.	765b90aeb9	feat: implement `ToTokens` for `UpdatePlan`	2025-07-16 12:16:23 -03:00
Diego Reis	b86674adbb	Remove cache clearing in cacheflush	2025-07-16 11:11:52 -03:00

1 2 3 4 5 ...

3427 Commits