turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-19 01:24:20 +01:00

Author	SHA1	Message	Date
Pavan-Nambi	7e8dabaee5	make comparison case sensitive	2025-10-12 18:02:03 +05:30
Jussi Saurio	a52dbb7842	Handle table ID / rootpages properly for both checkpointed and non-checkpointed tables Table ID is an opaque identifier that is only meaningful to the MV store. Each checkpointed MVCC table corresponds to a single B-tree on the pager, which naturally has a root page. We cannot use root page as the MVCC table ID directly because: - We assign table IDs during MVCC commit, but - we commit pages to the pager only during checkpoint which means the root page is not easily knowable ahead of time. Hence, we: - store the mapping between table id and btree rootpage - sqlite_schema rows will have a negative rootpage column if the table has not been checkpointed yet.	2025-09-30 16:53:12 +03:00
Jussi Saurio	35b584f050	Merge 'core: change root_page to i64' from Pere Diaz Bou Closes #3454	2025-09-30 12:50:23 +03:00
Pere Diaz Bou	0f631101df	core: change page idx type from usize to i64 MVCC is like the annoying younger cousin (I know because I was him) that needs to be treated differently. MVCC requires us to use root_pages that might not be allocated yet, and the plan is to use negative root_pages for that case. Therefore, we need i64 in order to fit this change.	2025-09-29 18:38:43 +02:00
Nikita Sivukhin	af9d5ab52e	fix bug	2025-09-29 12:28:27 +04:00
Nikita Sivukhin	7a1da051d0	remove usage of expr.to_string() and fix normalize util test	2025-09-29 11:36:12 +04:00
Nikita Sivukhin	00114d9674	remove quotes handling from normalization util	2025-09-29 11:00:19 +04:00
Nikita Sivukhin	f3f9219795	completely remove usage of enum variants	2025-09-26 13:01:49 +04:00
Nikita Sivukhin	fdf8ca88fd	introduce exact(...) function - because enum variant will disappear	2025-09-26 13:01:49 +04:00
PThorpe92	864d113037	Remove dumb comment	2025-09-20 14:38:47 -04:00
Glauber Costa	0b3317d449	extract columns from all tables in case of joins. Our code for view needs to extract the list of columns used in the view. We currently extract only from "the base table", but once we have joins, we need a more complex structure, that keeps the mapping of (tables, columns). This actually affects both views and materialized views: for views, the queries with joins work just fine, because views are just aliases for a query. But the list of columns returned by pragma table_info on the view is incorrect. We add a test to make sure it is fixed. For materialized views, we add extensive tests to make sure that the columns are extracted correctly.	2025-09-19 03:57:11 -05:00
Jussi Saurio	5aa788691b	Merge 'Fix math functions compatibility issues' from Levy A. Adds `round`, `hex`, `unhex`, `abs`, `lower`, `upper`, `sign` and `log` (with base) to the expression fuzzer. Rounding with the precision argument still has some incompatibilities. Closes #3160	2025-09-19 09:15:11 +03:00
Pere Diaz Bou	d53c64e84b	core/schema: parse schema rows for MVCC transactions	2025-09-18 18:22:06 +02:00
PThorpe92	cfc6ec4ee5	Normalize more idents	2025-09-17 15:53:06 -04:00
Levy A.	fcc03d2f6a	fix: compatible math functions also add math functions to fuzzer	2025-09-17 00:23:27 -03:00
Glauber Costa	3565e7978a	Add an index to the dbsp internal table And also change the schema of the main table. I have come to see the current key-value schema as inadequate for non-aggregate operators. Calculating Min/Max, for example, doesn't feat in this schema because we have to be able to track existing values and index them. Another alternative is to keep one table per operator type, but this quickly leads to an explosion of tables.	2025-09-15 22:30:48 -05:00
Jussi Saurio	396091044e	store tx_mode in conn.mv_tx otherwise op_transaction works completely wrong because each separate insert statement overrides the tx_mode to Write	2025-09-14 21:59:08 +03:00
Jussi Saurio	bb74b2eaf9	Merge 'Refactor parseschema' from Jussi Saurio Extracts out duplicated logic from `Schema::make_from_btree()` and `parse_schema_rows()` Closes #3015	2025-09-11 13:35:36 +03:00
Jussi Saurio	f17997fc5d	Extract methods for populating indices/views from schema	2025-09-11 09:51:46 +03:00
Jussi Saurio	07944e23b5	Extract common logic for handling sqlite_schema rows	2025-09-11 09:45:40 +03:00
Levy A.	4070e05cd2	fix: math function parameter conversion	2025-09-10 20:49:30 -03:00
Glauber Costa	08b2e685d5	Persistence for DBSP-based materialized views This fairly long commit implements persistence for materialized view. It is hard to split because of all the interdependencies between components, so it is a one big thing. This commit message will at least try to go into details about the basic architecture. Materialized Views as tables ============================ Materialized views are now a normal table - whereas before they were a virtual table. By making a materialized view a table, we can reuse all the infrastructure for dealing with tables (cursors, etc). One of the advantages of doing this is that we can create indexes on view columns. Later, we should also be able to write those views to separate files with ATTACH write. Materialized Views as Zsets =========================== The contents of the table are a ZSet: rowid, values, weight. Readers will notice that because of this, the usage of the ZSet data structure dwindles throughout the codebase. The main difference between our materialized ZSet and the standard DBSP ZSet, is that obviously ours is backed by a BTree, not a Hash (since SQLite tables are BTrees) Aggregator State ================ In DBSP, the aggregator nodes also have state. To store that state, there is a second table. The table holds all aggregators in the view, and there is one table per view. That is __turso_internal_dbsp_state_{view_name}. The format of that table is similar to a ZSet: rowid, serialized_values, weight. We serialize the values because there will be many aggregators in the table. We can't rely on a particular format for the values. The Materialized View Cursor ============================ Reading from a Materialized View essentially means reading from the persisted ZSet, and enhancing that with data that exists within the transaction. Transaction data is ephemeral, so we do not materialize this anywhere: we have a carefully crafted implementation of seek that takes care of merging weights and stitching the two sets together.	2025-09-05 07:04:33 -05:00
TcMits	ddbfb6cc16	make clippy happy	2025-09-04 13:04:53 +07:00
TcMits	ce6ff74cd6	add test	2025-09-04 13:02:10 +07:00
TcMits	94b1cf9ab5	windows iterator returns no values for shorter slice	2025-09-04 12:09:21 +07:00
Pekka Enberg	1de647758f	Merge 'refactor parser fmt' from Lâm Hoàng Phúc @penberg this PR try to clean up `turso_parser`'s`fmt` code. - `get_table_name` and `get_column_name` should return None when table/column does not exist. ```rust /// Context to be used in ToSqlString pub trait ToSqlContext { /// Given an id, get the table name /// First Option indicates whether the table exists /// /// Currently not considering aliases fn get_table_name(&self, _id: TableInternalId) -> Option<&str> { None } /// Given a table id and a column index, get the column name /// First Option indicates whether the column exists /// Second Option indicates whether the column has a name fn get_column_name(&self, _table_id: TableInternalId, _col_idx: usize) -> Option<Option<&str>> { None } // help function to handle missing table/column names fn get_table_and_column_names( &self, table_id: TableInternalId, col_idx: usize, ) -> (String, String) { let table_name = self .get_table_name(table_id) .map(\|s\| s.to_owned()) .unwrap_or_else(\|\| format!("t{}", table_id.0)); let column_name = self .get_column_name(table_id, col_idx) .map(\|opt\| { opt.map(\|s\| s.to_owned()) .unwrap_or_else(\|\| format!("c{col_idx}")) }) .unwrap_or_else(\|\| format!("c{col_idx}")); (table_name, column_name) } } ``` - remove `FmtTokenStream` because it is same as `WriteTokenStream ` - remove useless functions and simplify `ToTokens` ```rust /// Generate token(s) from AST node /// Also implements Display to make sure devs won't forget Display pub trait ToTokens: Display { /// Send token(s) to the specified stream with context fn to_tokens<S: TokenStream + ?Sized, C: ToSqlContext>( &self, s: &mut S, context: &C, ) -> Result<(), S::Error>; // Return displayer representation with context fn displayer<'a, 'b, C: ToSqlContext>(&'b self, ctx: &'a C) -> SqlDisplayer<'a, 'b, C, Self> where Self: Sized, { SqlDisplayer::new(ctx, self) } } ``` Closes #2748	2025-09-02 18:35:43 +03:00
Pekka Enberg	2addeb5a9f	Merge 'introduce `eq/contains/starts_with/ends_with_ignore_ascii_case` macros' from Lâm Hoàng Phúc depend on #2865 ```sh `ALTER TABLE _ RENAME TO _`/limbo_rename_table/ time: [10.100 ms 10.191 ms 10.283 ms] change: [-16.770% -15.559% -14.417%] (p = 0.00 < 0.05) Performance has improved. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high mild `ALTER TABLE _ RENAME COLUMN _ TO _`/limbo_rename_column/ time: [7.4829 ms 7.5492 ms 7.6128 ms] change: [-19.397% -18.093% -16.789%] (p = 0.00 < 0.05) Performance has improved. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) low mild 1 (1.00%) high mild `ALTER TABLE _ ADD COLUMN _`/limbo_add_column/ time: [5.3255 ms 5.3713 ms 5.4183 ms] change: [-24.002% -22.612% -21.195%] (p = 0.00 < 0.05) Performance has improved. Found 39 outliers among 100 measurements (39.00%) 17 (17.00%) low severe 1 (1.00%) low mild 1 (1.00%) high mild 20 (20.00%) high severe `ALTER TABLE _ DROP COLUMN _`/limbo_drop_column/ time: [5.8858 ms 5.9183 ms 5.9510 ms] change: [-16.233% -14.679% -13.083%] (p = 0.00 < 0.05) Performance has improved. Found 25 outliers among 100 measurements (25.00%) 8 (8.00%) low severe 11 (11.00%) low mild 2 (2.00%) high mild 4 (4.00%) high severe Prepare `SELECT 1`/limbo_parse_query/SELECT 1 time: [590.28 ns 591.31 ns 592.35 ns] change: [-3.7810% -3.5059% -3.2444%] (p = 0.00 < 0.05) Performance has improved. Found 7 outliers among 100 measurements (7.00%) 1 (1.00%) low severe 6 (6.00%) high mild Prepare `SELECT * FROM users LIMIT 1`/limbo_parse_query/SELECT * FROM users LIMIT 1 time: [1.2569 µs 1.2582 µs 1.2596 µs] change: [-5.0125% -4.7516% -4.4933%] (p = 0.00 < 0.05) Performance has improved. Found 7 outliers among 100 measurements (7.00%) 3 (3.00%) low severe 2 (2.00%) low mild 1 (1.00%) high mild 1 (1.00%) high severe Prepare `SELECT first_name, count(1) FROM users GROUP BY first_name HAVING count(1) > 1 ORDER BY cou... time: [3.7180 µs 3.7227 µs 3.7274 µs] change: [-3.0557% -2.7642% -2.4761%] (p = 0.00 < 0.05) Performance has improved. Found 6 outliers among 100 measurements (6.00%) 2 (2.00%) low mild 4 (4.00%) high mild Execute `SELECT 1`/limbo_execute_select_1 time: [27.455 ns 27.477 ns 27.499 ns] change: [-2.9461% -2.7493% -2.5589%] (p = 0.00 < 0.05) Performance has improved. Found 3 outliers among 100 measurements (3.00%) 1 (1.00%) low mild 1 (1.00%) high mild 1 (1.00%) high severe Execute `SELECT * FROM users LIMIT ?`/limbo_execute_select_rows/1 time: [410.53 ns 411.05 ns 411.54 ns] change: [-15.364% -15.133% -14.912%] (p = 0.00 < 0.05) Performance has improved. Found 5 outliers among 100 measurements (5.00%) 4 (4.00%) low mild 1 (1.00%) high mild Execute `SELECT * FROM users LIMIT ?`/limbo_execute_select_rows/10 time: [2.1100 µs 2.1122 µs 2.1145 µs] change: [-11.517% -11.065% -10.662%] (p = 0.00 < 0.05) Performance has improved. Found 4 outliers among 100 measurements (4.00%) 2 (2.00%) low severe 2 (2.00%) low mild Execute `SELECT * FROM users LIMIT ?`/limbo_execute_select_rows/50 time: [9.5156 µs 9.5268 µs 9.5383 µs] change: [-10.284% -10.086% -9.8833%] (p = 0.00 < 0.05) Performance has improved. Found 3 outliers among 100 measurements (3.00%) 1 (1.00%) low severe 2 (2.00%) low mild Execute `SELECT * FROM users LIMIT ?`/limbo_execute_select_rows/100 time: [18.669 µs 18.698 µs 18.731 µs] change: [-9.5949% -9.3407% -9.1140%] (p = 0.00 < 0.05) Performance has improved. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low severe 1 (1.00%) high mild Execute `SELECT count() FROM users`/limbo_execute_select_count time: [7.1027 µs 7.1098 µs 7.1170 µs] change: [-43.739% -43.596% -43.469%] (p = 0.00 < 0.05) Performance has improved. Found 9 outliers among 100 measurements (9.00%) 2 (2.00%) low mild 5 (5.00%) high mild 2 (2.00%) high severe ``` Closes #2866	2025-09-02 18:35:14 +03:00
TcMits	635402fc6f	string sometimes used as identifier quoting	2025-09-02 21:35:37 +07:00
TcMits	bfff05faba	merge main	2025-09-02 18:25:20 +07:00
TcMits	33a04fbaf7	resolve conflict	2025-09-02 17:30:10 +07:00
Pekka Enberg	cfaba4ab10	Merge 'Implement libSQL's `ALTER COLUMN` extension' from Levy A. Implement `ALTER COLUMN` as described here: https://github.com/tursodatabase/libsql/blob/main/libsql- sqlite3/doc/libsql_extensions.md#altering-columns - [x] Add `ALTER COLUMN` to parser - [x] Implement `Insn::AlterColumn` - [x] Add tests Closes #2814	2025-09-02 09:06:03 +03:00
TcMits	6e87b08d64	faster type_from_name	2025-09-01 14:38:38 +07:00
TcMits	ed1fb4cabc	remove unnecessary check	2025-09-01 11:51:51 +07:00
TcMits	37f33dc45f	add eq/contains/starts_with/ends_with_ignore_ascii_case	2025-08-31 16:18:42 +07:00
William Souza	b1114734d3	support cipher and encryption key URI options	2025-08-30 10:29:41 -03:00
Levy A.	5b378e3730	feat: add `AlterColumn` instruction also refactor `RenameColumn` to reuse the logic from `AlterColumn`	2025-08-30 03:10:39 -03:00
Pekka Enberg	eba90e1c5e	Merge 'core/util: emit literal, cow instead of replace' from Pere Diaz Bou Closes #2828	2025-08-28 14:33:04 +03:00
Pere Diaz Bou	84c5c4e581	core/util: emit literal, cow instead of replace	2025-08-28 12:59:55 +02:00
TcMits	4ddfdb2a62	finish	2025-08-27 14:58:35 +07:00
Pekka Enberg	26ba09c45f	Revert "Merge 'Remove double indirection in the Parser' from Pedro Muniz" This reverts commit `71c1b357e4`, reversing changes made to `6bc568ff69` because it actually makes things slower.	2025-08-26 14:58:21 +03:00
pedrocarlo	d3240844ec	refactor Core to remove the double indirection	2025-08-25 22:59:31 -03:00
Glauber Costa	911b4c38a6	do not ignore silent failures from view creation We have an issue at the moment that when a materialized view fails to be created, we just swallow the error and leave the database in a funny state. We have can_create_view() to detect those issues early, but not all errors can be detected that early.	2025-08-25 17:48:17 +03:00
Levy A.	4ba1304fb9	complete parser integration	2025-08-21 15:23:59 -03:00
Levy A.	186e2f5d8e	switch to new parser	2025-08-21 15:19:16 -03:00
Jussi Saurio	c21d9871db	Merge 'fix pragma table_info for views' from Glauber Costa We were not generating table_info for views. This PR fixes it. We were so far storing columns as strings with just their names - since this is all we needed - but we will move now to store Columns. We need to convert the names to Column anyway for table_info to work. Closes #2625	2025-08-17 14:40:21 +03:00
PThorpe92	2c526c4c37	Add io_yield_x macros to reduce boilerplate	2025-08-16 16:14:00 -04:00
Glauber Costa	03eeabef18	fix pragma table_info for views We were not generating table_info for views. This PR fixes it. We were so far storing columns as strings with just their names - since this is all we needed - but we will move now to store Columns. We need to convert the names to Column anyway for table_info to work.	2025-08-16 08:03:57 -05:00
Glauber Costa	5ab6f78f6b	Implement views Views (non materialized) are relatively simple, since they are just query aliases. We can expand them as if they were subqueries.	2025-08-13 14:14:03 -05:00
Glauber Costa	337f27a433	rename some structures to mention materialized views A lot of the structures we have - like the ones under Schema, are specific for materialized views. In preparation to adding normal views, rename them, so things are less confusing.	2025-08-13 14:13:16 -05:00
pedrocarlo	82b75330bc	adjust `types.rs` `util.rs` `view.rs` and mvcc to bubble io	2025-08-13 10:24:55 +03:00

1 2 3 4

151 Commits