turso

mirror of https://github.com/aljazceru/turso.git synced 2026-01-25 10:54:28 +01:00

Author	SHA1	Message	Date
Pekka Enberg	9461e22c06	Merge 'Improve DBSP view serialization' from Glauber Costa Improve serialization for DBSP views. The serialization code was written organically, without much forward thinking about stability as we evolved the table and operator format. Now that this is done, we are at at point where we can actually make it suck less and take a considerable step towards making this production ready. We also add a simple version check (in the table name, because that is much easier than reading contents in parse_schema_row) to prevent views to be used if we had to do anything to evolve the format of the circuit (including the operators) Closes #3351	2025-09-26 09:18:45 +03:00
Glauber Costa	1b5e74060a	make sure that we are able to prevent views from being corrupted as we make changes to the way materialized views are generated (think adding new operators, changing the id of existing operators, etc), we will need to persist the topology of the circuit itself. This is a change that I believe to be premature. For now, it is enough to reserve the first operator id for it, and add a version number to the table name. We can just detect that something changed, and ask the user to drop the view. We can get away with it due to the fact that the views are experimental.	2025-09-25 22:52:08 -03:00
Pere Diaz Bou	91cff65e44	Merge 'Autoincrement' from Pavan Nambi fixes #1976 and #1605 ```zsh turso> DROP TABLE IF EXISTS t; CREATE TABLE t ( id INTEGER PRIMARY KEY AUTOINCREMENT, name TEXT ); turso> INSERT INTO t (name) VALUES ('A'); SELECT * FROM sqlite_sequence; ┌──────┬─────┐ │ name │ seq │ ├──────┼─────┤ │ t │ 1 │ └──────┴─────┘ turso> DROP TABLE IF EXISTS t; CREATE TABLE t ( id INTEGER PRIMARY KEY AUTOINCREMENT, name TEXT ); turso> INSERT INTO t (name) VALUES ('A'); SELECT * FROM sqlite_sequence; ┌──────┬─────┐ │ name │ seq │ ├──────┼─────┤ │ t │ 1 │ └──────┴─────┘ turso> INSERT INTO t (name) VALUES ('A'); SELECT * FROM sqlite_sequence; ┌──────┬─────┐ │ name │ seq │ ├──────┼─────┤ │ t │ 2 │ └──────┴─────┘ turso> ``` Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2983	2025-09-25 18:57:24 +02:00
Jussi Saurio	c18c44b032	fix: result columns have varying binding precedence In e.g. `SELECT x AS y, y AS x FROM t ORDER BY x;`, the `x` in the `ORDER BY` should reference t.y, which has been aliased as `x` for this query. The same goes for GROUP BY, JOIN ON etc. but NOT for WHERE. Previously we had wrong precedence in `bind_and_rewrite_expr`.	2025-09-25 08:07:37 +03:00
Pavan-Nambi	49d5141f2d	Merge remote-tracking branch 'origin/main' into cdc_fail_autoincrement	2025-09-24 18:06:02 +05:30
Jussi Saurio	726bc24e78	Support referring to rowid as _rowid_ or oid	2025-09-24 09:17:28 +03:00
Mayank Verma	7505a2a567	fix CTE alias resolution in planner	2025-09-22 13:51:48 +05:30
Pavan Nambi	47194d7658	Merge branch 'tursodatabase:main' into cdc_fail_autoincrement	2025-09-21 16:03:38 +05:30
Preston Thorpe	6b273af7e9	Merge 'translate/optimize: centralize AST/expr traversal' from Preston Thorpe Previously we were rewriting/traversing the AST in a couple different places, each of these added kinda ad-hoc as we needed them. This attempts to do the binding of column references as well as the rewriting of anonymous `Expr::Variable` -> `__param_N` that we use to maintain the order of bound variables, also normalizes the Qualified Name's. Also we previously weren't accepting Variable (or at least they wouldn't work) in places like `LIMIT ? OFFSET ?`, which this PR adds. I kinda want to keep refactoring translation a bit, and try to break plan building up into more easy-to-digest chunks.. but I will resist the urge right now as it's definitely not high priority pre-beta Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #3210	2025-09-19 08:03:39 -04:00
Glauber Costa	0b3317d449	extract columns from all tables in case of joins. Our code for view needs to extract the list of columns used in the view. We currently extract only from "the base table", but once we have joins, we need a more complex structure, that keeps the mapping of (tables, columns). This actually affects both views and materialized views: for views, the queries with joins work just fine, because views are just aliases for a query. But the list of columns returned by pragma table_info on the view is incorrect. We add a test to make sure it is fixed. For materialized views, we add extensive tests to make sure that the columns are extracted correctly.	2025-09-19 03:57:11 -05:00
PThorpe92	6f446aaf48	remove bind_column_references method and its last usages	2025-09-18 18:59:28 -04:00
PThorpe92	ffd1f87682	Centralize most of the AST traversal by binding columns and rewriting exprs together	2025-09-18 18:38:03 -04:00
Pavan-Nambi	020921f803	Merge remote-tracking branch 'upstream/main' into cdc_fail_autoincrement	2025-09-18 19:27:19 +05:30
TcMits	88119888d0	reduce allocation needed for break_predicate_at_and_boundaries	2025-09-18 10:52:29 +07:00
Piotr Rzysko	f5efcbe745	Add support for window functions Adds initial support for window functions. For now, only existing aggregate functions can be used as window functions—no specialized window-specific functions are supported yet. Currently, only the default frame definition is implemented: RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW EXCLUDE NO OTHERS.	2025-09-13 11:12:44 +02:00
Piotr Rzysko	c81cd16230	Extract QueryDestination::placeholder_for_subquery	2025-09-13 10:49:14 +02:00
Piotr Rzysko	6224cdbbd3	Support WalkControl in walk_expr_mut Now walk_expr_mut can use WalkControl to skip parts of the expression tree. This makes it consistent with walk_expr.	2025-09-13 10:49:14 +02:00
Pavan-Nambi	7191f1cc1c	Merge remote-tracking branch 'upstream/main' into cdc_fail_autoincrement	2025-09-12 15:17:12 +05:30
Jussi Saurio	e3bd00883b	Fix creation of automatic indexes indexes with the naming scheme "sqlite_autoindex_<tblname>_<number>" are automatically created when a table is created with UNIQUE or PRIMARY KEY definitions. these indexes must map to the table definition SQL in definition order, i.e. sqlite_autoindex_foo_1 must be the first instance of UNIQUE or PRIMARY KEY and so on. this commit fixes our autoindex creation / parsing so that this invariant is upheld.	2025-09-11 14:11:30 +03:00
Jussi Saurio	eb2710438c	translate: return parse error for unsupported join types	2025-09-10 15:46:52 +03:00
Pavan-Nambi	e5d3594fa2	fmt	2025-09-10 07:35:20 +05:30
Pavan-Nambi	a04bde12a9	resolve errors that came after merging	2025-09-10 07:34:59 +05:30
Glauber Costa	08b2e685d5	Persistence for DBSP-based materialized views This fairly long commit implements persistence for materialized view. It is hard to split because of all the interdependencies between components, so it is a one big thing. This commit message will at least try to go into details about the basic architecture. Materialized Views as tables ============================ Materialized views are now a normal table - whereas before they were a virtual table. By making a materialized view a table, we can reuse all the infrastructure for dealing with tables (cursors, etc). One of the advantages of doing this is that we can create indexes on view columns. Later, we should also be able to write those views to separate files with ATTACH write. Materialized Views as Zsets =========================== The contents of the table are a ZSet: rowid, values, weight. Readers will notice that because of this, the usage of the ZSet data structure dwindles throughout the codebase. The main difference between our materialized ZSet and the standard DBSP ZSet, is that obviously ours is backed by a BTree, not a Hash (since SQLite tables are BTrees) Aggregator State ================ In DBSP, the aggregator nodes also have state. To store that state, there is a second table. The table holds all aggregators in the view, and there is one table per view. That is __turso_internal_dbsp_state_{view_name}. The format of that table is similar to a ZSet: rowid, serialized_values, weight. We serialize the values because there will be many aggregators in the table. We can't rely on a particular format for the values. The Materialized View Cursor ============================ Reading from a Materialized View essentially means reading from the persisted ZSet, and enhancing that with data that exists within the transaction. Transaction data is ephemeral, so we do not materialize this anywhere: we have a carefully crafted implementation of seek that takes care of merging weights and stitching the two sets together.	2025-09-05 07:04:33 -05:00
Pekka Enberg	44357f93a2	Merge branch 'main' into 2025-08-21-make-limit-and-offset-expr	2025-09-04 09:54:45 +03:00
Preston Thorpe	caaf60a7ea	Merge 'Unify resolution of aggregate functions' from Piotr Rżysko This PR unifies the logic for resolving aggregate functions. Previously, bare aggregates (e.g. `SELECT max(a) FROM t1`) and aggregates wrapped in expressions (e.g. `SELECT max(a) + 1 FROM t1`) were handled differently, which led to duplicated code. Now both cases are resolved consistently. The added benchmark shows a small improvement: ``` Prepare `SELECT first_name, last_name, state, city, age + 10, LENGTH(email), UPPER(first_name), LOWE... time: [59.791 µs 59.898 µs 60.006 µs] change: [-7.7090% -7.2760% -6.8242%] (p = 0.00 < 0.05) Performance has improved. Found 10 outliers among 100 measurements (10.00%) 8 (8.00%) high mild 2 (2.00%) high severe ``` For an existing benchmark, no change: ``` Prepare `SELECT first_name, count(1) FROM users GROUP BY first_name HAVING count(1) > 1 ORDER BY cou... time: [11.895 µs 11.913 µs 11.931 µs] change: [-0.2545% +0.2426% +0.6960%] (p = 0.34 > 0.05) No change in performance detected. Found 8 outliers among 100 measurements (8.00%) 1 (1.00%) low severe 2 (2.00%) high mild 5 (5.00%) high severe ``` Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #2884	2025-09-03 19:46:04 -04:00
TcMits	b0f4dd49d5	use match_ignore_ascii_case macro	2025-09-03 12:01:52 +07:00
Piotr Rzysko	517f23013a	Delay deduplication of aggregate expressions It is not necessary to iterate over existing aggregates for every traversed expression. Instead, do so only when an aggregate function is found.	2025-09-02 08:22:37 +02:00
Piotr Rzysko	569e41cb1e	Skip traversing children of aggregate functions Aggregate functions cannot be nested, and this is validated during the translation of aggregate function arguments. Therefore, traversing their child expressions is unnecessary.	2025-09-02 08:22:37 +02:00
Piotr Rzysko	9b742a64c2	Handle functions with star argument wrapped in expressions Handled in the same way as in `prepare_one_select_plan` for bare function calls.	2025-09-02 08:22:36 +02:00
Piotr Rzysko	f3cbc382ce	Support external aggregate functions wrapped in expressions Handled in the same way as in `prepare_one_select_plan` for bare function calls. In `prepare_one_select_plan`, however, resolving external scalar functions is performed unnecessarily twice.	2025-09-02 08:22:36 +02:00
Piotr Rzysko	d361734819	Remove unnecessary recursion in resolve_aggregates The walk_expr method already traverses arguments, so there is no need to do this explicitly.	2025-09-02 08:22:36 +02:00
Piotr Rzysko	3ad4016080	Fix handling of zero-argument grouped aggregations This commit consolidates the creation of the Aggregate struct, which was previously handled differently in `prepare_one_select_plan` and `resolve_aggregates`. That discrepancy caused inconsistent handling of zero-argument aggregates. The queries added in the new tests would previously trigger a panic.	2025-08-31 12:02:09 +02:00
Pere Diaz Bou	d72be206f2	core/translate: parse_table remove unnecessary clone of table name	2025-08-29 16:42:46 +02:00
themixednuts	79a9f4743e	fix: planner alias and table name	2025-08-27 18:13:03 -05:00
bit-aloo	51d40092db	add empty table references, and error out in case if the table references are present in limit/offset	2025-08-26 19:56:25 +05:30
bit-aloo	a3b87cd97f	add review comments	2025-08-26 19:56:25 +05:30
bit-aloo	9bebc9b5c7	clippy'ed	2025-08-26 19:56:25 +05:30
bit-aloo	a16bee4574	move to new parser	2025-08-26 19:56:24 +05:30
bit-aloo	ffcadd00ae	evaluate limit or offset expr	2025-08-26 19:56:12 +05:30
Pekka Enberg	26ba09c45f	Revert "Merge 'Remove double indirection in the Parser' from Pedro Muniz" This reverts commit `71c1b357e4`, reversing changes made to `6bc568ff69` because it actually makes things slower.	2025-08-26 14:58:21 +03:00
pedrocarlo	d3240844ec	refactor Core to remove the double indirection	2025-08-25 22:59:31 -03:00
Levy A.	4ba1304fb9	complete parser integration	2025-08-21 15:23:59 -03:00
Levy A.	186e2f5d8e	switch to new parser	2025-08-21 15:19:16 -03:00
Glauber Costa	5ab6f78f6b	Implement views Views (non materialized) are relatively simple, since they are just query aliases. We can expand them as if they were subqueries.	2025-08-13 14:14:03 -05:00
Glauber Costa	337f27a433	rename some structures to mention materialized views A lot of the structures we have - like the ones under Schema, are specific for materialized views. In preparation to adding normal views, rename them, so things are less confusing.	2025-08-13 14:13:16 -05:00
Jussi Saurio	f598c86fa4	Merge 'Handle single, double and unquoted strings in values clause' from Mikaël Francoeur I'm not sure how much this will clash with @TcMits's parser rewrite, hopefully not too much. If it does and we eventually have to remove it, at least we'll have two new regression tests. Closes https://github.com/tursodatabase/turso/issues/2484 Closes #2499	2025-08-11 21:08:15 +03:00
Glauber Costa	145d6eede7	Implement very basic views using DBSP This is just the bare minimum that I needed to convince myself that this approach will work. The only views that we support are slices of the main table: no aggregations, no joins, no projections. drop view is implemented. view population is implemented. deletes, inserts and updates are implemented. much like indexes before, a flag must be passed to enable views.	2025-08-10 23:34:04 -05:00
Mikaël Francoeur	2cf4e4fe96	handle single, double and unquoted strings in values clause	2025-08-08 09:03:38 -04:00
Jussi Saurio	21dc2d0161	translate: return parse errors for unsupported features instead of silently ignoring	2025-08-08 11:39:30 +03:00
Piotr Rzysko	8fb4fbf8af	Make WhereTerm::consumed a plain bool Now that virtual tables are integrated into the optimizer, this field no longer needs to be wrapped in Cell<bool>.	2025-08-05 05:48:28 +02:00

1 2 3 4

171 Commits