turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-29 22:14:23 +01:00

Author	SHA1	Message	Date
PThorpe92	0a56d23402	Use u64 for file offsets in IO and calculate such offsets in u64	2025-08-28 09:44:00 -04:00
Pekka Enberg	2ea4354afe	Merge 'Improve integrity check' from Nikita Sivukhin - check free list trunk and pages - use shared hash map to check for duplicate references for pages - properly check overflow pages Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #2816	2025-08-28 16:06:15 +03:00
Pekka Enberg	878147b931	Merge 'translate/insert: Improve string format performance' from Pere Diaz Bou Rust's `fmt!` is slow af for the simplest of cases, let's just create strings with a known size and skip all the fmt stuff. Closes #2832	2025-08-28 14:36:09 +03:00
Pekka Enberg	a5322d2ab7	Merge 'core/schema: get_dependent_materialized_views_unnormalized' from Pere Diaz Bou If we get a table name for in memory structure, it's safe to assume it's already normalized. Closes #2830	2025-08-28 14:34:06 +03:00
Pekka Enberg	eba90e1c5e	Merge 'core/util: emit literal, cow instead of replace' from Pere Diaz Bou Closes #2828	2025-08-28 14:33:04 +03:00
Pere Diaz Bou	964422375e	translate/insert: string fmt perf improvmenets	2025-08-28 13:22:54 +02:00
Pere Diaz Bou	48e5ad7a55	core/schema: get_dependent_materialized_views_unnormalized If we get a table name for in memory structure, it's safe to assume it's already normalized.	2025-08-28 13:11:40 +02:00
Pere Diaz Bou	84c5c4e581	core/util: emit literal, cow instead of replace	2025-08-28 12:59:55 +02:00
Pere Diaz Bou	082f18c073	core/translate: sanize_string fast path improvement	2025-08-28 12:57:28 +02:00
Pekka Enberg	b4eba8b456	Merge 'Fix planner alias and table name handling' from closes: #2817 ``` turso> select * from simple_view as "v" where "v"."x" = 1; ┌───┐ │ x │ ├───┤ │ 1 │ └───┘ turso> select "s"."x" from simple as "s" where "s"."x" = 1; ┌───┐ │ x │ ├───┤ │ 1 │ └─── ``` Reviewed-by: Preston Thorpe <preston@turso.tech> Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2818	2025-08-28 07:39:47 +03:00
Pekka Enberg	591914a63b	Merge 'Introduce and propagate `IOContext` as required' from Avinash Sajjanshetty I added `IOContext` to `DatbaseStorage` IO trait and this struct will carry the necessary ctx required for encryption (or checksums.). This lets us set the encryption at outside and let the IO layer handle it properly Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2812	2025-08-28 07:38:52 +03:00
themixednuts	79a9f4743e	fix: planner alias and table name	2025-08-27 18:13:03 -05:00
Nikita Sivukhin	1c0efcfbff	fix clippy	2025-08-27 23:22:21 +04:00
Nikita Sivukhin	09d4590ece	fix compilation	2025-08-27 23:19:26 +04:00
Nikita Sivukhin	ae705445bf	improve integrity check - check free list trunk and pages - use shared hash map to check for duplicate references for pages - properly check overflow pages	2025-08-27 23:14:21 +04:00
Avinash Sajjanshetty	2c0842ff52	Set and propagate `IOContext` as required	2025-08-27 22:05:01 +05:30
Avinash Sajjanshetty	9e663c7f46	Add IOContext to carry encryption/checksum ctx	2025-08-27 21:33:05 +05:30
Pekka Enberg	bf7f80a937	core/io: Switch Unix I/O to use libc::pwrite() We use libc elsewhere for fault injection reasons, so let's do this call-site too.	2025-08-27 17:56:23 +03:00
Pekka Enberg	2921033b28	core/storage: Micro-optimize Pager::commit_dirty_pages() There's no need to call io.now() unless debug tracing is on. Let's micro-optimize commit_dirty_pages() to avoid the unnecessary call.	2025-08-27 11:12:43 +03:00
Avinash Sajjanshetty	caa00e31f8	Use `Cell` instead of `RefCell` because its nice	2025-08-26 20:00:13 +05:30
Jussi Saurio	66d00915d7	Merge 'Improve documentation of page pinning' from Jussi Saurio Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2797	2025-08-26 17:18:25 +03:00
Pekka Enberg	26ba09c45f	Revert "Merge 'Remove double indirection in the Parser' from Pedro Muniz" This reverts commit `71c1b357e4`, reversing changes made to `6bc568ff69` because it actually makes things slower.	2025-08-26 14:58:21 +03:00
Jussi Saurio	bf58d179db	Improve documentation of page pinning	2025-08-26 10:13:25 +03:00
Pekka Enberg	3176df64a2	Merge 'Fix: return NULL for rowid() when cursor's null flag is on' from Jussi Saurio Fixes TPC-H query 13 from returning an incorrect result. In this specific case, we were returning non-null `IdxRowid` values for the right-hand side table even when there was no match with the left-hand side table, meaning the join produced matches even in cases where there shouldn't have been any. Closes #2794 Closes #2795	2025-08-26 09:33:49 +03:00
Jussi Saurio	e52f807c7d	Fix: return NULL for rowid() when cursor's null flag is on Fixes TPC-H query 13 from returning an incorrect result. In this specific case, we were returning non-null `IdxRowid` values for the right-hand side table even when there was no match with the left-hand side table, meaning the join produced matches even in cases where there shouldn't have been any. Closes #2794	2025-08-26 09:08:48 +03:00
Pekka Enberg	114ece0375	Merge 'Make fill_cell_payload() safe for async IO and cache spilling' from Jussi Saurio ## Make fill_cell_payload() safe for async IO and cache spilling ### Problems: 1. fill_cell_payload() is not re-entrant because it can yield IO on allocating a new overflow page, resulting in losing some of the input data. 2. fill_cell_payload() in its current form is not safe for cache spilling because the previous overflow page in the chain of allocated overflow pages can be evicted by a spill caused by the next overflow page allocation, invalidating the page pointer and causing corruption. 3. fill_cell_payload() uses raw pointers and `unsafe` as a workaround from a previous time when we used to clone `WriteState`, resulting in hard-to-read code. ### Solutions: 1. Introduce a new substate to the fill_cell_payload state machine to handle re-entrancy wrt. allocating overflow pages. 2. Always pin the current overflow page so that it cannot be evicted during the overflow chain construction. Also pin the regular page the overflow chain is attached to, because it is immediately accessed after fill_cell_payload is done. 3. Remove all explicit usages of `unsafe` from `fill_cell_payload` (although our pager is ofc still extremely unsafe under the hood :] ) Note that solution 2 addresses a problem that arose in the development of page cache spilling, which is not yet implemented, but will be soon. ### Miscellania: 1. Renamed a bunch of variables to be clearer 2. Added more comments about what is happening in fill_cell_payload Closes #2737	2025-08-26 08:36:46 +03:00
Pekka Enberg	6e78c23ce7	Merge 'Remove Windows IO in place of Generic IO' from Preston Thorpe Generic IO and Windows IO were identical, since we don't do anything windows specific (maybe when someone eventually wants to implement an IO back-end for windows `IO_RING` API, we can bring it back :) This does use the exact impl of `WindowsIO`, simply renaming it to Generic IO.. because I don't think GenericIO had ever been used. Reviewed-by: Pedro Muniz (@pedrocarlo) Closes #2790	2025-08-26 08:33:04 +03:00
Pekka Enberg	8f11311473	Merge 'Improve encryption API' from Avinash Sajjanshetty This patch brings a bunch of quality of life improvements to encryption: 1. Previously, we just let any string to be used as a key. I have updated the `PRAGMA hexkey=''` to get the key in hex. I have also renamed from `key`, because that will be used to get passphrase 2. Added `PRAGMA cipher` so that now users can select which cipher they want to use (for now, either `aegis256` or `aes256gcm`) 3. We now set the encryption context when both cipher and key are set I also updated tests to reflect this. Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2779	2025-08-26 08:32:29 +03:00
pedrocarlo	d3240844ec	refactor Core to remove the double indirection	2025-08-25 22:59:31 -03:00
PThorpe92	8c64b772e7	Use previous WindowsIO impl as generic IO	2025-08-25 19:04:14 -04:00
PThorpe92	177c717f25	Remove windows IO in place of Generic IO	2025-08-25 18:47:21 -04:00
PThorpe92	2d661e3304	Apply review suggestions, add logging	2025-08-25 16:56:43 -04:00
PThorpe92	748e339f68	Make clippy happy	2025-08-25 16:52:34 -04:00
PThorpe92	1b514e6d0f	Only checkpoint final remaining DB connection, and use Truncate mode	2025-08-25 16:52:29 -04:00
Pekka Enberg	e57f59d744	Merge 'Fix several issues with integrity_check' from Jussi Saurio Things that were just wrong: 1. No pages other than the root page were checked, because no looping was done. Add a loop. 2. Rightmost child page was never added to page stack. Add it. New integrity check features: - Add overflow pages to stack as well - Check that no page is referenced more than once in the tree Closes #2781	2025-08-25 19:05:32 +03:00
Pekka Enberg	6baa4cd1c0	Merge 'DBSP projection' from Pekka Enberg This PR implements the ProjectOperator for DBSP circuits. Closes #2773	2025-08-25 19:05:20 +03:00
Pekka Enberg	e3ffc82a1d	core/incremental: Fix expression compiler to use new parser	2025-08-25 17:48:20 +03:00
Glauber Costa	ffab4a89a2	addressed review comments from Jussi	2025-08-25 17:48:17 +03:00
Glauber Costa	097510216e	implement the projector operator for DBSP My goal with this patch is to be able to implement the ProjectOperator for DBSP circuits using VDBE for expression evaluation. not doing so is dangerous for the following reason: we will end up with different, subtle, and incompatible behavior between SQLite expressions if they are used in views versus outside of views. In fact, even in our prototype had them: our projection tests, which used to pass, were actually wrong =) (sqlite would return something different if those functions were executed outside the view context) For optimization reasons, we single out trivial expressions: they don't have go through VDBE. Trivial expressions are expressions that only involve Columns, Literals, and simple operators on elements of the same type. Even type coercion takes this out of the realm of trivial. Everything that is not trivial, is then translated with translate_expr - in the same way SQLite will, and then compiled with VDBE. We can, over time, make this process much better. There are essentially infinite opportunities for optimization here. But for now, the main warts are: * VDBE execution needs a connection * There is no good way in VDBE to pass parameters to a program. * It is almost trivial to pollute the original connection. For example, we need to issue HALT for the program to stop, but seeing that halt will usually cause the program to try and halt the original program. Subprograms, like the ones we use in triggers are a possible solution, but they are much more expensive to execute, especially given that our execution would essentially have to have a program with no other role than to wrap the subprogram. Therefore, what I am doing is: * There is an in-memory database inside the projection operator (an obvious optimization is to share it with all projection operators). * We obtain a connection to that database when the operator is created * We use that connection to execute our VDBE, which offers a clean, safe and isolated way to execute the expression. * We feed the values to the program manually by editing the registers directly.	2025-08-25 17:48:17 +03:00
Glauber Costa	38def26704	Add expr_compiler To be used in DBSP-based projections. This will compile an expression to VDBE bytecode and execute it. To do that we need to add a new type of Expression, which we call a Register. This is a way for us to pass parameters to a DBSP program which will be not columns or literals, but inputs from the DBSP deltas.	2025-08-25 17:48:17 +03:00
Glauber Costa	911b4c38a6	do not ignore silent failures from view creation We have an issue at the moment that when a materialized view fails to be created, we just swallow the error and leave the database in a funny state. We have can_create_view() to detect those issues early, but not all errors can be detected that early.	2025-08-25 17:48:17 +03:00
Jussi Saurio	8cae10f744	Fix several issues with integrity_check Things that were just wrong: 1. No pages other than the root page were checked, because no looping was done. Add a loop. 2. Rightmost child page was never added to page stack. Add it. New integrity check features: - Add overflow pages to stack as well - Check that no page is referenced more than once in the tree	2025-08-25 16:51:57 +03:00
PThorpe92	37a7ec7477	Update append_frames_vectored to use new encryption_ctx and apply review	2025-08-25 09:50:57 -04:00
PThorpe92	daea841b47	Minor adjustments/comments to wal append_frames_vectored method	2025-08-25 09:47:06 -04:00
PThorpe92	0239088718	Use new append_frames_vectored WAL method to flush pager cache and commit write tx	2025-08-25 09:47:06 -04:00
PThorpe92	46e288ac26	Add append_frames_vectored to WAL api In addition to the existing `append_frame` which will write an individual frame to the WAL, we add a method `append_frames_vectored` that takes N frames and the db size which will need to be set for the last (commit) frame, and it calculates the checksums and submits them as a single `pwritev` call, reducing the number of syscalls needed for each write operation.	2025-08-25 09:47:01 -04:00
Avinash Sajjanshetty	40b7e3bf5a	rename `cipher` to `cipher_mode` for consistency	2025-08-25 19:16:15 +05:30
Pekka Enberg	1a4a53e6ea	Merge 'core/io: Fix build on Android and iOS' from Pekka Enberg Commit `ebe6aa0d28` ("adjust cfg for unix and linux IO") adjusted the I/O conditional compilation, but forgot that Android and iOS are also part of Unix target family. Fixes #2500 Closes #2776	2025-08-25 15:34:52 +03:00
Pekka Enberg	3f5878243f	Merge 'Remove unnecessary argument from Pager::end_tx()' from Nikita Sivukhin No need to pass `disable` flag to the `end_tx` method as it has that info from connection itself Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2777	2025-08-25 15:34:41 +03:00
Preston Thorpe	040ceba2d6	Merge 'WAL txn: fix reads from DB file' from Nikita Sivukhin - Transaction which was started with max_frame = 0 and max_frame_read_lock_index = 0 can write to the WAL and in this case it needs to read data back from WAL and not the DB file. - Without cache spilling its hard to reproduce this issue for the turso- db now, but I found this issue with sync-engine which do weird stuff with the WAL which "simulates" cache spilling behaviour to some extent. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2735	2025-08-25 08:34:17 -04:00

1 2 3 4 5 ...

4487 Commits