turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-23 03:04:19 +01:00

Author	SHA1	Message	Date
Glauber Costa	5ab6f78f6b	Implement views Views (non materialized) are relatively simple, since they are just query aliases. We can expand them as if they were subqueries.	2025-08-13 14:14:03 -05:00
Glauber Costa	337f27a433	rename some structures to mention materialized views A lot of the structures we have - like the ones under Schema, are specific for materialized views. In preparation to adding normal views, rename them, so things are less confusing.	2025-08-13 14:13:16 -05:00
Jussi Saurio	f598c86fa4	Merge 'Handle single, double and unquoted strings in values clause' from Mikaël Francoeur I'm not sure how much this will clash with @TcMits's parser rewrite, hopefully not too much. If it does and we eventually have to remove it, at least we'll have two new regression tests. Closes https://github.com/tursodatabase/turso/issues/2484 Closes #2499	2025-08-11 21:08:15 +03:00
Glauber Costa	145d6eede7	Implement very basic views using DBSP This is just the bare minimum that I needed to convince myself that this approach will work. The only views that we support are slices of the main table: no aggregations, no joins, no projections. drop view is implemented. view population is implemented. deletes, inserts and updates are implemented. much like indexes before, a flag must be passed to enable views.	2025-08-10 23:34:04 -05:00
Mikaël Francoeur	2cf4e4fe96	handle single, double and unquoted strings in values clause	2025-08-08 09:03:38 -04:00
Jussi Saurio	21dc2d0161	translate: return parse errors for unsupported features instead of silently ignoring	2025-08-08 11:39:30 +03:00
Piotr Rzysko	8fb4fbf8af	Make WhereTerm::consumed a plain bool Now that virtual tables are integrated into the optimizer, this field no longer needs to be wrapped in Cell<bool>.	2025-08-05 05:48:28 +02:00
Piotr Rzysko	99f87c07c1	Support column references in table-valued function arguments This change extends table-valued function support by allowing arguments to be column references, not only literals. Virtual tables can now reject a plan by returning an error from best_index (e.g., when a TVF argument references a table that appears later in the join order). The planner using this information excludes invalid plans during join order search.	2025-08-05 05:48:28 +02:00
Piotr Rzysko	718598eab8	Introduce scan type Different scan parameters are required for different table types. Currently, index and iteration direction are only used by B-tree tables, while the remaining table types don’t require any parameters. Planning access to virtual tables, however, will require passing additional information from the planner, such as the virtual table index (distinct from a B-tree index) and the constraints that must be forwarded to the `filter` method.	2025-08-04 20:27:22 +02:00
Iaroslav Zeigerman	6f63327320	fix overlooked tests	2025-07-26 04:51:44 -07:00
Iaroslav Zeigerman	f13b9105b9	Fix error handling when binding column references while translating the UPDATE statement	2025-07-26 04:51:42 -07:00
Glauber Costa	b5927dcfd5	support doubly qualified identifiers	2025-07-25 14:52:45 -05:00
Pekka Enberg	669b231714	Merge 'parser: Distinguish quoted identifiers and unify Id into Name enum' from bit-aloo Closes: #1947 This PR replaces the `Name(pub String)` struct with a `Name` enum that explicitly models how the name appeared in the source either as an unquoted identifier (`Ident`) or a quoted string (`Quoted`). In the process, the separate `Id` wrapper type has been coalesced into the `Name` enum, simplifying the AST and reducing duplication in identifier handling logic. While this increases the size of some AST nodes (notably `yyStackEntry`). cc: @levydsa Reviewed-by: Levy A. (@levydsa) Reviewed-by: Preston Thorpe (@PThorpe92) Closes #2251	2025-07-25 12:08:54 +03:00
Glauber Costa	988b16f962	Support ATTACH (read only) Support for attaching databases. The main difference from SQLite is that we support an arbitrary number of attached databases, and we are not bound to just 100ish. We for now only support read-only databases. We open them as read-only, but also, to keep things simple, we don't patch any of the insert machinery to resolve foreign tables. So if an insert is tried on an attached database, it will just fail with a "no such table" error - this is perfect for now. The code in core/translate/attach.rs is written by Claude, who also played a key part in the boilerplate for stuff like the .databases command and extending the pragma database_list, and also aided me in the test cases.	2025-07-24 19:19:48 -05:00
bit-aloo	9a54ef214e	parser: Distinguish quoted identifiers and unify Id into Name enum This commit replaces the `Name(pub String)` struct with a `Name` enum that explicitly models how the name appeared in the source either as an unquoted identifier (`Ident`) or a quoted string (`Quoted`). In the process, the separate `Id` wrapper type has been coalesced into the `Name` enum, simplifying the AST and reducing duplication in identifier handling logic. While this increases the size of some AST nodes (notably `yyStackEntry`), it improves correctness and makes source structure more explicit for later phases.	2025-07-24 14:40:19 +05:30
Glauber Costa	cbdd5c5fc7	improve handling of double quotes I ended up hitting #1974 today and wanted to fix it. I worked with Claude to generate a more comprehensive set of queries that could fail aside from just the insert query described in the issue. He got most of them right - lots of cases were indeed failing. The ones that were gibberish, he told me I was absolutely right for pointing out they were bad. But alas. With the test cases generated, we can work on fixing it. The place where the assertion was hit, all we need to do there is return true (but we assert that this is indeed a string literal, it shouldn't be anything else at this point). There are then just a couple of places where we need to make sure we handle double quotes correctly. We already tested for single quotes in a couple of places, but never for double quotes. There is one funny corner case where you can just select "col" from tbl, and if there is no column "col" on the table, that is treated as a string literal. We handle that too. Fixes #1974	2025-07-18 10:39:02 -05:00
Piotr Rzysko	30ae6538ee	Treat table-valued functions as tables With this change, the following two queries are considered equivalent: ```sql SELECT value FROM generate_series(5, 50); SELECT value FROM generate_series WHERE start = 5 AND stop = 50; ``` Arguments passed in parentheses to the virtual table name are now matched to hidden columns. Column references are still not supported as table-valued function arguments. The only difference is that previously, a query like: ```sql SELECT one.value, series.value FROM (SELECT 1 AS value) one, generate_series(one.value, 3) series; ``` would cause a panic. Now, it returns a proper error message instead. Adding support for column references is more nuanced for two main reasons: - We need to ensure that in joins where a TVF depends on other tables, those other tables are processed first. For example, in: ```sql SELECT one.value, series.value FROM generate_series(one.value, 3) series, (SELECT 1 AS value) one; ``` the one table must be processed by the top-level loop, and series must be nested. - For outer joins involving TVFs, the arguments must be treated as ON predicates, not WHERE predicates.	2025-07-14 07:16:53 +02:00
Piotr Rzysko	9102f4a2f4	Extract table parsing into separate method This is in preparation for reusing the method when parsing `TableCall`s.	2025-07-14 07:16:53 +02:00
Piotr Rzysko	000d70f1f3	Propagate info about hidden columns	2025-07-14 07:16:53 +02:00
Nils Koch	828d4f5016	fix clippy errors for rust 1.88.0 (auto fix)	2025-07-12 18:58:41 +03:00
Pekka Enberg	3f10427f52	core: Fix resolve_function() error messages We need to return the original function name, not normalized one to be compatible with SQLite. Spotted by SQLite TCL tests.	2025-07-09 15:30:57 +03:00
Pekka Enberg	7f91768ff6	core/translate: Unify no such table error messages We're now mixing different error messages, which makes compatibility testing pretty hard. Unify on a single, SQLite compatible error message "no such table".	2025-07-07 11:10:46 +03:00
Pekka Enberg	725c3e4ddc	Rename `limbo_sqlite3_parser` crate to `turso_sqlite3_parser`	2025-06-29 12:34:46 +03:00
Pekka Enberg	2fc5c0ce5c	Switch to runtime flag for enabling indexes Makes it easier to test the feature: ``` $ cargo run -- --experimental-indexes Limbo v0.0.22 Enter ".help" for usage hints. Connected to a transient in-memory database. Use ".open FILENAME" to reopen on a persistent database limbo> CREATE TABLE t(x); limbo> CREATE INDEX t_idx ON t(x); limbo> DROP INDEX t_idx; ```	2025-06-26 10:07:28 +03:00
Nils Koch	2827b86917	chore: fix clippy warnings	2025-06-23 19:52:13 +01:00
Piotr Rzysko	64a0333119	Fix missing column references in non-aggregate expressions Previously, queries like: ``` SELECT CASE WHEN c0 != 'x' THEN group_concat(c1, ',') ELSE 'x' END FROM t0 GROUP BY c0; ``` would return incorrect results because c0 was not copied during the aggregation loop into a register accessible to the logic processing the grouped results (e.g., the CASE WHEN expression in this example). The same issue applied to expressions in the HAVING and ORDER BY clauses.	2025-06-20 06:19:16 +02:00
Pere Diaz Bou	dde93e8deb	disable distinct without index_experimental distinct uses indexes, therefore we need to disable them	2025-06-17 19:33:23 +02:00
Piotr Rzysko	4d35e36b77	Introduce virtual table types	2025-06-01 07:45:57 +02:00
Jussi Saurio	cc405dea7e	Use new TableReferences struct everywhere	2025-05-29 11:44:56 +03:00
Jussi Saurio	73e806ad84	Make WhereTerm::consumed a Cell<bool> Currently in the main translation logic after planning and optimization, we don't _really_ need to pass a &mut Vec<WhereTerm> around anymore, except for the fact that virtual table constraint resolution is done ad-hoc in `init_loop()`. Even there, the only thing we mutate is `WhereTerm::consumed` which is a boolean indicating that the term has been "used up" by the optimizer and shouldn't be evaluated as a normal where clause condition anymore. In the upcoming branch for WHERE clause subqueries, I want to store immutable references to WHERE clause expressions in `Resolver`, but this is unfortunately not possible if we still use the aforementioned mutable references. Hence, we can temporarily make `WhereTerm::consumed` a `Cell<bool>` which allows us to pass an immutable reference to `init_loop()`, and the `Cell` can be removed once the virtual table constraint resolution is moved to an earlier part of the query processing pipeline.	2025-05-28 11:02:39 +03:00
pedrocarlo	bb7da39c72	remove assumption that translate_select is always called from a top-level context + adjust insert to use translate_select when needed	2025-05-25 19:12:30 -03:00
Jussi Saurio	07fa3a9668	Rename SelectQueryType to QueryDestination	2025-05-25 21:23:04 +03:00
Jussi Saurio	7c07c09300	Add stable internal_id property to TableReference Currently our "table id"/"table no"/"table idx" references always use the direct index of the `TableReference` in the plan, e.g. in `SelectPlan::table_references`. For example: ```rust Expr::Column { table: 0, column: 3, .. } ``` refers to the 0'th table in the `table_references` list. This is a fragile approach because it assumes the table_references list is stable for the lifetime of the query processing. This has so far been the case, but there exist certain query transformations, e.g. subquery unnesting, that may fold new table references from a subquery (which has its own table ref list) into the table reference list of the parent. If such a transformation is made, then potentially all of the Expr::Column references to tables will become invalid. Consider this example: ```sql -- Assume tables: users(id, age), orders(user_id, amount) -- Get total amount spent per user on orders over $100 SELECT u.id, sub.total FROM users u JOIN (SELECT user_id, SUM(amount) as total FROM orders o WHERE o.amount > 100 GROUP BY o.user_id) sub WHERE u.id = sub.user_id -- Before subquery unnesting: -- Main query table_references: [users, sub] -- u.id refers to table 0, column 0 -- sub.total refers to table 1, column 1 -- -- Subquery table_references: [orders] -- o.user_id refers to table 0, column 0 -- o.amount refers to table 0, column 1 -- -- After unnesting and folding subquery tables into main query, -- the query might look like this: SELECT u.id, SUM(o.amount) as total FROM users u JOIN orders o ON u.id = o.user_id WHERE o.amount > 100 GROUP BY u.id; -- Main query table_references: [users, orders] -- u.id refers to table index 0 (correct) -- o.amount refers to table index 0 (incorrect, should be 1) -- o.user_id refers to table index 0 (incorrect, should be 1) ``` We could ofc traverse every expression in the subquery and rewrite the table indexes to be correct, but if we instead use stable identifiers for each table reference, then all the column references will continue to be correct. Hence, this PR introduces a `TableInternalId` used in `TableReference` as well as `Expr::Column` and `Expr::Rowid` so that this kind of query transformations can happen with less pain.	2025-05-25 20:26:17 +03:00
Jussi Saurio	8ed5334ca7	tests/fuzz: add compound_select_fuzz()	2025-05-24 13:12:41 +03:00
Jussi Saurio	362347c474	refactor: use walk_expr() in determine_where_to_eval_expr()	2025-05-23 16:27:28 +03:00
Jussi Saurio	9ec84e3905	refactor: use walk_expr() in table_mask_from_expr()	2025-05-23 16:27:28 +03:00
Jussi Saurio	3835a29f47	refactor: use walk_expr() in resolve_aggregates()	2025-05-23 16:27:23 +03:00
Jussi Saurio	2ab5c5f6a9	refactor: use walk_expr_mut() in bind_column_references()	2025-05-23 15:56:49 +03:00
Jussi Saurio	76227ec274	Rename to Distinctness + add distinctness information to SelectPlan	2025-05-22 16:51:03 +03:00
Pekka Enberg	e102cd0be5	Merge 'Add support for DISTINCT aggregate functions' from Jussi Saurio Reviewable commit by commit. CI failures are not related. Adds support for e.g. `select first_name, sum(distinct age), count(distinct age), avg(distinct age) from users group by 1` Implementation details: - Creates an ephemeral index per distinct aggregate, and jumps over the accumulation step if a duplicate is found Closes #1507	2025-05-20 13:58:57 +03:00
Jussi Saurio	d584a1879b	Mark WHERE terms as consumed instead of deleting them We've run into trouble in multiple places due to the fact that we delete terms from the where clause (e.g. when a constant condition is removed, or the term becomes part of an index seek key). A simpler solution is to add a flag indicating that the term is consumed (used), so that it is not translated in the main loop anymore when WHERE clause terms are evaluated.	2025-05-17 15:44:12 +03:00
Jussi Saurio	368c45e025	Add distinctness information to Aggregate struct	2025-05-17 15:33:55 +03:00
Jussi Saurio	a90358f669	TableMask: comments	2025-05-14 09:42:26 +03:00
Jussi Saurio	3b1aef4a9e	Do Less Work (tm) - everything works except ephemeral	2025-05-14 09:42:01 +03:00
Jussi Saurio	1e46f1d9de	Feature: join reordering optimizer	2025-05-14 09:40:48 +03:00
Jussi Saurio	330fedbc2f	Add notion of join ordering to plan + make determining where to eval expr dynamic always	2025-05-03 15:32:06 +03:00
Jussi Saurio	5a1cfb7d15	Add ColumnUsedMask struct to TableReference to track columns referenced in query	2025-04-15 15:13:31 +03:00
Jussi Saurio	c6bea835f9	Fix trying to use index when both sides of comparison refer to same table	2025-04-12 11:13:33 +03:00
Jussi Saurio	a706b7160a	planner: support index backwards seeks and iteration	2025-04-09 10:14:29 +03:00
PThorpe92	13e084351d	Change parse_limit function to accept reference value to ast::Limit	2025-04-04 12:38:18 -04:00

1 2 3

128 Commits