turso

mirror of https://github.com/aljazceru/turso.git synced 2026-02-09 10:14:21 +01:00

Author	SHA1	Message	Date
TcMits	14eb8a8ffe	move check code into parser	2025-08-22 16:28:56 +07:00
Jussi Saurio	cf64452139	Merge 'sim: remove "run_once faults"' from Jussi Saurio This kind of fault does not semantically represent anything real, since we already have fault injection for every concrete IO operation like reading, writing, syncing and so forth. Moreover, having this feature is the direct cause of the false positive simulator failure as reported in issue #2727. There, a "run_once fault" happened immediately after we fsynced following an INSERT, which caused the simulator to think the INSERT failed, and later a sim assertion failed because the on-disk database had 1 more row than it thought it would. Closes #2727 Reviewed-by: Pekka Enberg <penberg@iki.fi> Closes #2728	2025-08-22 10:30:36 +03:00
Jussi Saurio	cc643362a4	sim: remove "run_once faults" This kind of fault does not semantically represent anything real, since we already have fault injection for every concrete IO operation like reading, writing, syncing and so forth. Moreover, having this feature is the direct cause of the false positive simulator failure as reported in issue #2727. There, a "run_once fault" happened immediately after we fsynced following an INSERT, which caused the simulator to think the INSERT failed, and later a sim assertion failed because the on-disk database had 1 more row than it thought it would.	2025-08-22 10:13:06 +03:00
Pekka Enberg	b9bb859271	Merge 'Switch to new parser in core' from Levy A. Integrate #2381 to core. Resolves #2337. Reviewed-by: Lâm Hoàng Phúc (@TcMits) Closes #2650	2025-08-22 10:06:37 +03:00
Levy A.	34724a0571	fix: proper parser offset tracking	2025-08-22 02:11:53 -03:00
Pekka Enberg	156352fabc	Merge ' should not return a Completion when there is a page cache hit' from Pedro Muniz Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2723	2025-08-22 07:33:28 +03:00
Pekka Enberg	b4b156cca9	Merge 'github: Remove Intel Mac support' from Pekka Enberg They're very close to end of life and we're having macOS runner availability problems. Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2726	2025-08-22 07:33:18 +03:00
Pekka Enberg	3547bd1093	github: Remove Intel Mac support They're very close to end of life and we're having macOS runner availability problems.	2025-08-21 23:59:15 +03:00
Pekka Enberg	f1bde3c12c	Merge 'github: Reduce Python build matrix' from Pekka Enberg Test with the earliest and latest supported versions, but skip the ones in the middle to reduce our Python CI build matrix. Closes #2724	2025-08-21 23:50:19 +03:00
Jussi Saurio	4edbf3aac0	Merge 'Page cache truncate' from Nikita Sivukhin Add `truncate` method in the page cache which remove all entries which reference pages greater than new DB size. This will be used in the sync engine as in its case DB size can shrink when we "rebase" changes from remote to local. It stands on the #2707 because touch few files from that PR Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2711	2025-08-21 23:34:39 +03:00
Levy A.	ee12ef9fb5	remove unnecessary `Box<ast::Select>`	2025-08-21 17:20:25 -03:00
Pekka Enberg	b5dafe2f07	github: Reduce Python build matrix Test with the earliest and latest supported versions, but skip the ones in the middle to reduce our Python CI build matrix.	2025-08-21 22:50:51 +03:00
Levy A.	8a610a776f	refactor: Result type alias	2025-08-21 15:24:01 -03:00
Levy A.	07975603d3	fix: incorrect sql statement in parser test	2025-08-21 15:24:01 -03:00
Levy A.	d9c7cbf7f2	fix: report TK_ILLEGAL as error	2025-08-21 15:24:01 -03:00
Levy A.	0138aade31	tidy error reporting	2025-08-21 15:24:01 -03:00
Levy A.	8f198a3086	fix create view columns length check	2025-08-21 15:24:01 -03:00
Levy A.	a86a066a91	fix order by function clause	2025-08-21 15:24:01 -03:00
Levy A.	4ba1304fb9	complete parser integration	2025-08-21 15:23:59 -03:00
Levy A.	c6b032de63	feat: add AST formating and checking	2025-08-21 15:19:17 -03:00
Levy A.	f8daf2ff95	fix: copy TokenType's `as_str` from the old parser	2025-08-21 15:19:17 -03:00
Levy A.	b0537d02c7	fix: SQLite operators are left associative	2025-08-21 15:19:16 -03:00
Levy A.	40fce89238	fix: empty variables are allowed	2025-08-21 15:19:16 -03:00
Levy A.	8f10f2f240	fix number parser the added test case was failing	2025-08-21 15:19:16 -03:00
Levy A.	dd39f47cda	feat: add materialized view + implement essential methods for core	2025-08-21 15:19:16 -03:00
Levy A.	186e2f5d8e	switch to new parser	2025-08-21 15:19:16 -03:00
pedrocarlo	6b0ed08465	`read_page` should return No Completion when have a page cache hit	2025-08-21 14:39:24 -03:00
Pekka Enberg	ae8b1eb00d	Merge 'core/io: Don't open file as non-blocking in Unix backend' from Pekka Enberg The Unix backend is a syscall()-based, blocking implementation. The O_NONBLOCK adds nothing. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2708	2025-08-21 19:13:39 +03:00
Nikita Sivukhin	f6729ccc36	fix clippy	2025-08-21 18:27:54 +04:00
Nikita Sivukhin	89e180c2a8	add few tests	2025-08-21 18:27:54 +04:00
Nikita Sivukhin	94f2e5a4b9	add truncate method to the page cache	2025-08-21 18:27:54 +04:00
Jussi Saurio	05f4cec34d	Merge 'Wal api checkpoint seq' from Nikita Sivukhin This PR adds information about checkpoint sequence number to the WAL raw API. Will be used in the sync engine. Depends on the #2699 Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2707	2025-08-21 17:05:45 +03:00
Jussi Saurio	d125daf1f2	Merge 'Use more structured approach in translate_insert' from Jussi Saurio Closes #2686 Note: this PR also incorporates #2700, which I cannot merge separately because it reveals the bug described in #2686, which nothing on `main` currently detects. _(Also as background, the way this all started was: I was trying to enable `UNIQUE` and `PRIMARY KEY` usage in `simulator` in #2641 , but couldn't because it would constantly fail due to #2686.)_ --- I'll admit this PR went in a different direction than I had envisioned. I was trying to debug and minimally fix #2686, but was finding it increasingly hard to understand the flow of `translate_insert` and fix this specific issue without breaking something else. So, I thought it might be benefit from restructuring. --- ## Functional changes - Fixes #2686. * If an index contained a `rowid` alias column, we were inserting `NULL` into the index instead of the actual integer value. * The root cause of this is that SQLite does insert `NULL` in place of the rowid alias column into the table, presumably to save space. * This is not a problem for tables as `SELECT`ing the rowid alias column will always be mapped into `Insn::Rowid`, but it is a major problem for indexes as index lookups will never find anything. ## Code structure changes The responsibility of holding the information about what to insert is now contained in these new data structures: ```rust /// Represents how a table should be populated during an INSERT. #[derive(Debug)] struct Insertion<'a> { /// The integer key ("rowid") provided to the VDBE. key: InsertionKey<'a>, /// The column values that will be fed to the MakeRecord instruction to insert the row. /// If the table has a rowid alias column, it will also be included in this record, /// but a NULL will be stored for it. col_mappings: Vec<ColumnMapping<'a>>, /// The register that will contain the record built using the MakeRecord instruction. record_reg: usize, } #[derive(Debug)] enum InsertionKey<'a> { /// Rowid is not provided by user and will be autogenerated. Autogenerated { register: usize }, /// Rowid is provided via the 'rowid' keyword. LiteralRowid { value_index: Option<usize>, register: usize, }, /// Rowid is provided via a rowid alias column. RowidAlias(ColumnMapping<'a>), } /// Represents how a column in a table should be populated during an INSERT. /// In a vector of InsertionMapping, the index of a given InsertionMapping is /// the position of the column in the table. #[derive(Debug)] struct ColumnMapping<'a> { /// Column definition column: &'a Column, /// Index of the value to use from a tuple in the insert statement. /// This is needed because the values in the insert statement are not necessarily /// in the same order as the columns in the table, nor do they necessarily contain /// all of the columns in the table. /// If None, a NULL will be emitted for the column, unless it has a default value. /// A NULL rowid alias column's value will be autogenerated. value_index: Option<usize>, /// Register where the value will be stored for insertion into the table. register: usize, } ``` --- This gets rid of a few things that are a bit hard to follow in the current implementation: 1. Needing to keep track of "the last rowid explicit value" and other weird edge case code related to rowids ```rust // <old code> // In case when both rowid and rowid-alias column provided in the query - turso-db overwrite rowid with latest value from the list // As we iterate by column in natural order of their definition in scheme, // we need to track last value_index we wrote to the rowid and overwrite rowid register only if new value_index is greater let mut last_rowid_explicit_value = None; ... // <more old code> // When inserting a single row, SQLite writes the value provided for the rowid alias column (INTEGER PRIMARY KEY) // directly into the rowid register and writes a NULL into the rowid alias column. let write_directly_to_rowid_reg = mapping.column.is_rowid_alias; let write_reg = if write_directly_to_rowid_reg { if last_rowid_explicit_value.is_some_and(\|x\| x > value_index) { continue; } last_rowid_explicit_value = Some(value_index); column_registers_start // rowid always the first register in the array for insertion record } else { column_register }; ``` Instead when the `Insertion` struct is constructed, it will simply overwrite `InsertionKey` if a rowid reference is encountered multiple times, so we naturally use the "last seen" rowid value. Moreover, `InsertionKey` is always translated first, so we need no special logic for it: ```rust // <new code> translate_key(program, insertion, &mut translate_value_fn, resolver)?; for col in insertion.col_mappings.iter() { translate_column( program, col.column, col.register, col.value_index, &mut translate_value_fn, resolver, )?; } ``` --- 2. Needing to keep track of registers in the main execution flow: ```rust // <old code> // allocate a register for each column in the table. if not provided by user, they will simply be set as null. // allocate an extra register for rowid regardless of whether user provided a rowid alias column. let num_cols = btree_table.columns.len(); let rowid_and_columns_start_register = program.alloc_registers(num_cols + 1); let columns_start_register = rowid_and_columns_start_register + 1; ``` Now the main execution flow just uses these methods on `Insertion`: ```rust // Create and insert the record program.emit_insn(Insn::MakeRecord { start_reg: insertion.first_col_register(), count: insertion.col_mappings.len(), dest_reg: insertion.record_register(), index_name: None, }); program.emit_insn(Insn::Insert { cursor: cursor_id, key_reg: insertion.key_register(), record_reg: insertion.record_register(), flag: InsertFlags::new(), table_name: table_name.to_string(), }); ``` --- The translation of a row now uses the information in `Insertion` and the implementation is shared between the "insert single row" and "insert multiple rows" cases: ```rust /// Translate the key and the columns of the insertion. /// This function is called by both [translate_rows_single] and [translate_rows_multiple], /// each providing a different [translate_value_fn] implementation, because for multiple rows /// we need to emit the values in a loop, from either an ephemeral table or a coroutine, /// whereas for the single row the translation happens in a single pass without looping. fn translate_rows_base<'short, 'long: 'short>( program: &mut ProgramBuilder, insertion: &'short Insertion<'long>, mut translate_value_fn: impl FnMut(&mut ProgramBuilder, usize, usize) -> Result<()>, resolver: &Resolver, ) -> Result<()> { translate_key(program, insertion, &mut translate_value_fn, resolver)?; for col in insertion.col_mappings.iter() { translate_column( program, col.column, col.register, col.value_index, &mut translate_value_fn, resolver, )?; } Ok(()) } ``` Which gets rid of the duplication in `populate_columns_single_row` and `populate_columns_multiple_rows` in the old implementation. Reviewed-by: Nikita Sivukhin (@sivukhin) Closes #2687	2025-08-21 16:50:03 +03:00
Jussi Saurio	cc28b8833e	Fix condition that checks table.cols against number of provided values	2025-08-21 16:40:10 +03:00
Jussi Saurio	b5bd31a47b	Remove old unused data structures and functions	2025-08-21 16:40:10 +03:00
Jussi Saurio	ac56d5bb67	Use new datastructures and functions in translate_insert	2025-08-21 16:40:10 +03:00
Jussi Saurio	88c4eae63e	Add functions for constructing and translating Insertions	2025-08-21 16:40:10 +03:00
Jussi Saurio	630441e270	Add new Insertion datastructures	2025-08-21 16:40:10 +03:00
Jussi Saurio	215485d403	Add Table::get_column_by_name method	2025-08-21 16:40:10 +03:00
Jussi Saurio	2caea349a9	Add regression test for #2686	2025-08-21 16:40:10 +03:00
Jussi Saurio	dd2e0ea596	Fix: always emit rowid when column is rowid alias SQLite does not store the rowid alias column in the record at all when it is a rowid alias, because the rowid is always stored anyway in the record header.	2025-08-21 16:40:10 +03:00
Jussi Saurio	f9ad43a3a3	Merge 'Fix: all indexes need to be updated if the rowid changes' from Jussi Saurio Found when running simulator in #2641 All indexes store the rowid as the last column, so whenever the rowid of a given row changes the index entry must also be deleted and reinserted with the new index. Reviewed-by: Nikita Sivukhin (@sivukhin) Closes #2712	2025-08-21 16:40:03 +03:00
Nikita Sivukhin	d7e47c1268	fix bug - continue checkpoint as usual even if frames range is degenerate	2025-08-21 17:37:19 +04:00
Jussi Saurio	22be35e790	Add regression test	2025-08-21 16:31:12 +03:00
Jussi Saurio	e224bb15a8	Fix incorrect UNIQUE constraint failure behavior in UPDATE UPDATE should skip over the UNIQUE constraint failure if the existing row it found during the check has the same rowid as the row we are currently updating	2025-08-21 16:30:34 +03:00
Jussi Saurio	e7fbeba120	Merge 'Fix: in UPDATE, insert rowid into index instead of NULL' from Jussi Saurio Same deal as #2700, except this time in UPDATE. Nothing tests this on `main` so not caught. I will later put #2641 into mergeable condition so it will catch all of these going forward. Reviewed-by: Nikita Sivukhin (@sivukhin) Closes #2710	2025-08-21 15:49:55 +03:00
Jussi Saurio	9d44e97a7a	Fix: all indexes need to be updated if the rowid changes	2025-08-21 15:48:46 +03:00
Jussi Saurio	4ceadcca78	Merge 'Remove hardcoded flag usage in DBHeader for encryption' from Avinash Sajjanshetty Previously, we just hardcoded the reserved space with encryption flag. This patch removes that and sets the reserved space if a key was specified during a creation of db Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #2706	2025-08-21 15:46:41 +03:00
Jussi Saurio	d1647390c4	Merge 'add remove_file method to the IO' from Nikita Sivukhin Closes #2705	2025-08-21 15:46:16 +03:00
Jussi Saurio	bec2c4bc79	Merge 'properly execute pragmas - they may require some IO' from Nikita Sivukhin - for example CDC pragma execute create table under the hood Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #2701	2025-08-21 15:45:49 +03:00

1 2 3 4 5 ...

7976 Commits