turso

mirror of https://github.com/aljazceru/turso.git synced 2026-01-20 16:44:19 +01:00

Author	SHA1	Message	Date
Jussi Saurio	29fe3b585a	Add more tests and disable correlated IN-subqueries in HAVING position I discovered a flaw in our current translation that makes queries of type HAVING foo IN (SELECT ...) not work properly - in these cases we need to defer translation of the subquery until later. I will fix this in a future PR because I suspect it's not trivial.	2025-10-29 09:57:55 +02:00
Jussi Saurio	4e48e1ffad	Make an exception for Expr::SubqueryResult in collect_result_columns()	2025-10-28 13:11:12 +02:00
Diego Reis	da323fa0c4	Some clean ups and correctly working on WHERE clauses	2025-10-09 11:57:15 -03:00
Diego Reis	79958f468d	Add jump_target_null to ConditionMetadata It's kinda make sense, conditions can be evaluated into 3 values: false, true and null. Now we handle that.	2025-10-09 11:56:14 -03:00
Jussi Saurio	58ea9e4c3c	clippy	2025-10-02 21:49:33 +03:00
Jussi Saurio	8e2e557da4	Collate: fix Insn::Compare to use collation seq of each compared column	2025-10-02 21:49:33 +03:00
Jussi Saurio	edd4651b97	Collate: add proper collation info for GROUP BY sorter columns	2025-10-02 21:49:33 +03:00
PThorpe92	58625b1c6d	Use expr.is_constant instead of matching for literal directly	2025-09-23 23:08:04 -04:00
PThorpe92	376d2bf7b1	Add plumbing to add sequence column to stabilize tiebreakers in order+group by	2025-09-23 22:35:59 -04:00
Piotr Rzysko	110ffba2a1	Fix accumulator reset when arguments outnumber aggregates Previously, while resetting accumulator registers, we would also reset subsequent registers. This happened because the number of registers to reset was computed as the sum of arguments rather than the number of aggregate functions.	2025-09-13 10:49:14 +02:00
Piotr Rzysko	6f1cd17fcf	Consolidate methods emitting AggStep	2025-08-31 13:29:10 +02:00
Piotr Rzysko	cdba1f1b87	Generalize GroupByAggArgumentSource This is primarily a mechanical change: the enum was moved between files, renamed, and its comments updated so it is no longer strictly tied to GROUP BY aggregations. This prepares the enum for reuse with ungrouped aggregations.	2025-08-31 13:23:12 +02:00
Piotr Rzysko	0a85883ee2	Support external aggregate functions in GROUP BY	2025-08-31 12:02:11 +02:00
Piotr Rzysko	7d179bd9fe	Fix handling of multiple arguments in aggregate functions This bug occurred when arguments were read for the GROUP BY sorter — all arguments were incorrectly resolved to the first column. Added tests confirm that aggregates now work correctly both with and without the sorter.	2025-08-31 12:02:11 +02:00
Piotr Rzysko	3ad4016080	Fix handling of zero-argument grouped aggregations This commit consolidates the creation of the Aggregate struct, which was previously handled differently in `prepare_one_select_plan` and `resolve_aggregates`. That discrepancy caused inconsistent handling of zero-argument aggregates. The queries added in the new tests would previously trigger a panic.	2025-08-31 12:02:09 +02:00
Piotr Rzysko	978a78b79a	Handle COLLATE clause in grouped aggregations Previously, it was only applied to ungrouped aggregations.	2025-08-31 06:51:26 +02:00
Pekka Enberg	26ba09c45f	Revert "Merge 'Remove double indirection in the Parser' from Pedro Muniz" This reverts commit `71c1b357e4`, reversing changes made to `6bc568ff69` because it actually makes things slower.	2025-08-26 14:58:21 +03:00
pedrocarlo	d3240844ec	refactor Core to remove the double indirection	2025-08-25 22:59:31 -03:00
Levy A.	4ba1304fb9	complete parser integration	2025-08-21 15:23:59 -03:00
Levy A.	186e2f5d8e	switch to new parser	2025-08-21 15:19:16 -03:00
Jussi Saurio	dd2e0ea596	Fix: always emit rowid when column is rowid alias SQLite does not store the rowid alias column in the record at all when it is a rowid alias, because the rowid is always stored anyway in the record header.	2025-08-21 16:40:10 +03:00
Jussi Saurio	a99c8a8ca0	Simplify ORDER BY sorter column remapping In case an ORDER BY column exactly matches a result column in the SELECT, the insertion of the result column into the ORDER BY sorter can be skipped because it's already necessarily inserted as a sorting column. For this reason we have a mapping to know what index a given result column has in the order by sorter. This commit makes that mapping much simpler.	2025-08-15 15:48:41 +03:00
Levy A.	ffd6844b5b	refactor: remove `PseudoTable` from `Table` the only reason for `PseudoTable` to exist, is to provide column information for `PseudoCursor` creation. this should not be part of the schema.	2025-06-30 14:31:58 -03:00
Levy A.	3907b387b3	cargo fix	2025-06-30 14:01:47 -03:00
Levy A.	afc55b27f0	refactor: remove unnecessary column definitions for PseudoTable the only information that matters in the amount of column	2025-06-30 13:54:29 -03:00
Pekka Enberg	725c3e4ddc	Rename `limbo_sqlite3_parser` crate to `turso_sqlite3_parser`	2025-06-29 12:34:46 +03:00
Nils Koch	2827b86917	chore: fix clippy warnings	2025-06-23 19:52:13 +01:00
Jussi Saurio	a549f2971d	Merge 'Ephemeral Table in Update' from Pedro Muniz Closes #1713. Adds ephemeral table when a rowid_alias is being updated. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #1726	2025-06-21 19:07:32 +03:00
Piotr Rzysko	64b83a45e8	Fix infinite aggregation loop when sorting is not required Previously, with the `index_experimental` feature enabled, the query in the added test would enter an infinite loop. This happened because `label_grouping_agg_step` pointed to a constant argument that was moved to the end of the program. As a result, the aggregation loop would jump to the constant, then return to the start of the main loop, rewind the index, and re-enter the aggregation loop—causing it to repeat indefinitely.	2025-06-21 10:03:10 +02:00
pedrocarlo	9ae4f6ec40	fix merge conflict problems	2025-06-20 16:38:10 -03:00
pedrocarlo	e53a290a48	move ephemeral table logic to update plan and reuse select logic for ephemeral index	2025-06-20 16:30:21 -03:00
Piotr Rzysko	64a0333119	Fix missing column references in non-aggregate expressions Previously, queries like: ``` SELECT CASE WHEN c0 != 'x' THEN group_concat(c1, ',') ELSE 'x' END FROM t0 GROUP BY c0; ``` would return incorrect results because c0 was not copied during the aggregation loop into a register accessible to the logic processing the grouped results (e.g., the CASE WHEN expression in this example). The same issue applied to expressions in the HAVING and ORDER BY clauses.	2025-06-20 06:19:16 +02:00
Piotr Rzysko	08c1767ba7	Collect non-aggregate columns in one place Previously, the logic for collecting non-aggregate columns was duplicated across multiple locations and implemented inconsistently. This caused a bug that was revealed by the refactoring in this commit (see the added test).	2025-06-20 06:17:14 +02:00
Levy A.	15e0cab8d8	refactor+fix: precompute default values from schema	2025-06-11 14:18:39 -03:00
Jussi Saurio	18dd87eff1	Fix incorrect handling of OR clauses in HAVING	2025-06-10 18:02:14 +03:00
Jussi Saurio	819a6138d0	Merge 'Fix: aggregate regs must be initialized as NULL at the start' from Jussi Saurio Again found when fuzzing nested where clause subqueries: Aggregate registers need to be NULLed at the start because the same registers might be reused on another invocation of a subquery, and if they are not NULLed, the 2nd invocation of the same subquery will have values left over from the first invocation. Reviewed-by: Preston Thorpe (@PThorpe92) Closes #1614	2025-05-30 09:39:37 +03:00
Jussi Saurio	f8257df77b	Fix: aggregate regs must be initialized as NULL at the start	2025-05-29 18:44:53 +03:00
Jussi Saurio	cc405dea7e	Use new TableReferences struct everywhere	2025-05-29 11:44:56 +03:00
Jussi Saurio	77ce4780d9	Fix ProgramBuilder::cursor_ref not having unique keys Currently we have this: program.alloc_cursor_id(Option<String>, CursorType)` where the String is the table's name or alias ('users' or 'u' in the query). This is problematic because this can happen: `SELECT * FROM t WHERE EXISTS (SELECT * FROM t)` There are two cursors, both with identifier 't'. This causes a bug where the program will use the same cursor for both the main query and the subquery, since they are keyed by 't'. Instead introduce `CursorKey`, which is a combination of: 1. `TableInternalId`, and 2. index name (Option<String> -- in case of index cursors. This should provide key uniqueness for cursors: `SELECT * FROM t WHERE EXISTS (SELECT * FROM t)` here the first 't' will have a different `TableInternalId` than the second `t`, so there is no clash.	2025-05-29 00:59:24 +03:00
Jussi Saurio	4e9d9a2470	Fix LIMIT handling Currently we have some usages of LIMIT where the actual limit counter is initialized next to the DecrJumpZero instruction, and then `program.mark_last_insn_constant()` is used to hoist the counter initialization to the beginning of the program. This is very fragile, and already FROM clause subquery handling works around this with a hack (removed in this PR), and (upcoming) WHERE clause subqueries would also run into problems because of this, because the LIMIT might need to be initialized once for every iteration of the subquery. This PR removes those usages for LIMIT, and LIMIT processing is now more intuitive: - limit counter is now initialized at the start of the query processing - a function init_limit() is extracted to do this for select/update/delete	2025-05-27 21:12:22 +03:00
Jussi Saurio	70965f4b28	Insn::Return: add possibility to fallthrough on non-integer values as per sqlite spec	2025-05-27 19:09:10 +03:00
Jussi Saurio	7c07c09300	Add stable internal_id property to TableReference Currently our "table id"/"table no"/"table idx" references always use the direct index of the `TableReference` in the plan, e.g. in `SelectPlan::table_references`. For example: ```rust Expr::Column { table: 0, column: 3, .. } ``` refers to the 0'th table in the `table_references` list. This is a fragile approach because it assumes the table_references list is stable for the lifetime of the query processing. This has so far been the case, but there exist certain query transformations, e.g. subquery unnesting, that may fold new table references from a subquery (which has its own table ref list) into the table reference list of the parent. If such a transformation is made, then potentially all of the Expr::Column references to tables will become invalid. Consider this example: ```sql -- Assume tables: users(id, age), orders(user_id, amount) -- Get total amount spent per user on orders over $100 SELECT u.id, sub.total FROM users u JOIN (SELECT user_id, SUM(amount) as total FROM orders o WHERE o.amount > 100 GROUP BY o.user_id) sub WHERE u.id = sub.user_id -- Before subquery unnesting: -- Main query table_references: [users, sub] -- u.id refers to table 0, column 0 -- sub.total refers to table 1, column 1 -- -- Subquery table_references: [orders] -- o.user_id refers to table 0, column 0 -- o.amount refers to table 0, column 1 -- -- After unnesting and folding subquery tables into main query, -- the query might look like this: SELECT u.id, SUM(o.amount) as total FROM users u JOIN orders o ON u.id = o.user_id WHERE o.amount > 100 GROUP BY u.id; -- Main query table_references: [users, orders] -- u.id refers to table index 0 (correct) -- o.amount refers to table index 0 (incorrect, should be 1) -- o.user_id refers to table index 0 (incorrect, should be 1) ``` We could ofc traverse every expression in the subquery and rewrite the table indexes to be correct, but if we instead use stable identifiers for each table reference, then all the column references will continue to be correct. Hence, this PR introduces a `TableInternalId` used in `TableReference` as well as `Expr::Column` and `Expr::Rowid` so that this kind of query transformations can happen with less pain.	2025-05-25 20:26:17 +03:00
Jussi Saurio	f6443ae742	Support LIMIT with UNION ALL	2025-05-24 13:12:41 +03:00
Jussi Saurio	0c4c451d2a	rename	2025-05-22 16:51:03 +03:00
Jussi Saurio	f3ea9a603a	add support for SELECT DISTINCT	2025-05-22 16:51:03 +03:00
Jussi Saurio	76227ec274	Rename to Distinctness + add distinctness information to SelectPlan	2025-05-22 16:51:03 +03:00
Pekka Enberg	e102cd0be5	Merge 'Add support for DISTINCT aggregate functions' from Jussi Saurio Reviewable commit by commit. CI failures are not related. Adds support for e.g. `select first_name, sum(distinct age), count(distinct age), avg(distinct age) from users group by 1` Implementation details: - Creates an ephemeral index per distinct aggregate, and jumps over the accumulation step if a duplicate is found Closes #1507	2025-05-20 13:58:57 +03:00
pedrocarlo	4a3119786e	refactor BtreeCursor and Sorter to accept Vec of collations	2025-05-19 15:22:55 -03:00
pedrocarlo	bf1fe9e0b3	Actually fixed group by and order by collation	2025-05-19 15:22:15 -03:00
pedrocarlo	0df6c87f07	Fixed Group By collation	2025-05-19 15:22:14 -03:00

1 2

85 Commits