turso

mirror of https://github.com/aljazceru/turso.git synced 2026-01-06 17:54:20 +01:00

Author	SHA1	Message	Date
PThorpe92	dde8a49f4e	normalize identifier for creating autoindex to prevent e.g. sqlite_autoindex`table`_2	2025-09-17 13:25:33 -04:00
PThorpe92	4e71524e42	normalize identifier for ID::Name in upsert expr rewriting	2025-09-17 13:24:06 -04:00
Preston Thorpe	8c53d7f024	Merge 'translation: rewrite expressions and properly handle quoted identifiers in UPSERT' from Preston Thorpe This PR fixes bugs found in the [turso- go](https://github.com/tursodatabase/turso-go) driver with UPSERT clause earlier, where `Gorm` will (obviously) use Expr::Variable's as well as use quotes for `Expr::Qualified` in the tail end of an UPSERT statement. Example: ```sql INSERT INTO users (a,b,c) VALUES (?,?,?) ON CONFLICT (`users`.`a`) DO UPDATE SET b = `excluded`.`b`, a = ?; ``` and previously we were not properly calling `rewrite_expr`, which was not properly setting the anonymous `Expr::Variable` to `__param_N` named parameter, so it would ignore it completely, then return the wrong # of parameters. Also, we didn't handle quoted "`excluded`.`x`", so it would panic in the optimizer that Qualified should have been rewritten earlier. Closes #3157	2025-09-17 11:25:13 -04:00
Preston Thorpe	bcafb288ad	Merge 'Fix is_nonnull returns true on 1 / 0' from Lâm Hoàng Phúc turso: ```sh turso> CREATE TABLE t (x PRIMARY KEY, y, z); turso> INSERT INTO t VALUES (37, -70, -196792117); turso> SELECT * FROM t WHERE (1 / 0) >= -3289742039 < t.x; ┌────┬─────┬────────────┐ │ x │ y │ z │ ├────┼─────┼────────────┤ │ 37 │ -70 │ -196792117 │ └────┴─────┴────────────┘ turso> ``` sqlite: ```sh sqlite> CREATE TABLE t (x PRIMARY KEY, y, z); sqlite> INSERT INTO t VALUES (37, -70, -196792117); sqlite> SELECT * FROM t WHERE (1 / 0) >= -3289742039 < t.x; sqlite> ``` related: https://github.com/tursodatabase/turso/actions/runs/17765571409 /job/50488042583?pr=3147#step:8:855 Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #3167	2025-09-17 06:55:55 -04:00
PThorpe92	97c11898fe	Minor refactor in translate/insert	2025-09-17 06:44:10 -04:00
PThorpe92	5dd466941e	Handle upsert even in inserting_multiple_rows case	2025-09-17 06:44:09 -04:00
PThorpe92	85eee42bf1	Support quoted qualified identifiers in UPSERT excluded.x clauses	2025-09-17 06:44:08 -04:00
PThorpe92	d2cd833b86	Rewrite exprs in set + where clause for UPSERT	2025-09-17 06:38:25 -04:00
TcMits	6606bf12d3	is_nonnull returns true on 1 / 0	2025-09-17 14:50:15 +07:00
Pekka Enberg	06d869ea5e	core/ext: Switch vtab_modules from Rc to Arc	2025-09-17 10:36:12 +03:00
Pekka Enberg	17e9f05ea4	core: Convert Rc<Pager> to Arc<Pager>	2025-09-17 09:32:49 +03:00
Jussi Saurio	cae234818b	Merge 'Inital support for window functions' from Piotr Rżysko This adds basic support for window functions. For now: * Only existing aggregate functions can be used as window functions. * Specialized window-specific functions (`rank`, `row_number`, etc.) are not yet supported. * Only the default frame definition is implemented: `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW EXCLUDE NO OTHERS`. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #3079	2025-09-17 08:29:16 +03:00
Pekka Enberg	ae25a0f088	Merge 'Implement Min/Max aggregators' from Glauber Costa We have not implemented them before because they require the raw elements to be kept. It is easy to see why in the following example: ``` current_min = 3; insert(2) => current_min = 2 // can be done without state delete(2) => needs to look at the state to determine new min! ``` The aggregator state was a very simple key-value structure. To accomodate for min/max, we will make it into a more complex table, where we can encode a more complex structure. The key insight is that we can use a primary key composed of: ``` 1) storage_id 2) zset_id, 3) element ``` The storage_id and zset_id are our previous key, except they are now exploded to support a larger range of storage_id. With more bits available in the storage_id, we can encode information about which column we are storing. For aggregations in multiple columns, we will need to keep a different list of values for min/max! The element is just the values of the columns. Because this is a primary key, the data will be sorted in the btree. We can then just do a prefix search in the first two components of the key and easily find the min/max when needed. This new format is also adequate for joins. Joins will just have a new storage_id which encodes two "columns" (left side, right side). Closes #3143	2025-09-16 16:19:59 +03:00
Pekka Enberg	74331898a3	Merge 'Add quoted identifier test cases for `ALTER TABLE`' from Levy A. Resolves #2093 There is a small incompatibility on how we quote the added column on the final schema, but doesn't change any behavior. Closes #2943	2025-09-16 11:46:12 +03:00
Glauber Costa	3565e7978a	Add an index to the dbsp internal table And also change the schema of the main table. I have come to see the current key-value schema as inadequate for non-aggregate operators. Calculating Min/Max, for example, doesn't feat in this schema because we have to be able to track existing values and index them. Another alternative is to keep one table per operator type, but this quickly leads to an explosion of tables.	2025-09-15 22:30:48 -05:00
Jussi Saurio	396091044e	store tx_mode in conn.mv_tx otherwise op_transaction works completely wrong because each separate insert statement overrides the tx_mode to Write	2025-09-14 21:59:08 +03:00
Piotr Rzysko	1a95131c3c	Include windows in ToTokens for SelectPlan	2025-09-13 11:12:44 +02:00
Piotr Rzysko	9ff2133ff2	Rewrite window function expressions in the optimizer Currently, this is effectively a no-op because, at the optimization stage, window function expressions are in the form win_func(subquery_column1, subquery_column2, ...). Nevertheless, expressions are rewritten to maintain consistency with aggregates, which also hold cloned expressions from sources like result columns. This ensures future changes in the optimizer won’t break window function handling.	2025-09-13 11:12:44 +02:00
Piotr Rzysko	f5efcbe745	Add support for window functions Adds initial support for window functions. For now, only existing aggregate functions can be used as window functions—no specialized window-specific functions are supported yet. Currently, only the default frame definition is implemented: RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW EXCLUDE NO OTHERS.	2025-09-13 11:12:44 +02:00
Piotr Rzysko	c81cd16230	Extract QueryDestination::placeholder_for_subquery	2025-09-13 10:49:14 +02:00
Piotr Rzysko	1826023c32	Decouple AggArgumentSource::Expression from Aggregate This allows it to be reused for window function processing without relying on the Aggregate struct.	2025-09-13 10:49:14 +02:00
Piotr Rzysko	6c3c44e204	Expose fewer details from AggArgumentSource Hides unnecessary internals to decouple the API from the Aggregate struct.	2025-09-13 10:49:14 +02:00
Piotr Rzysko	5f2a3e1242	Handle dummy argument for count() and count() in translation Two main reasons for this change: Improve readability by moving the logic for this special case closer to the code that relies on it. * Decouple AggFunc from the Aggregate struct. In the future, window function processing will use AggFunc directly, without necessarily depending on Aggregate.	2025-09-13 10:49:14 +02:00
Piotr Rzysko	6d84cbedc2	Fix delimiter handling in group_concat and string_agg Non-literal delimiters must be translated by AggArgumentSource.	2025-09-13 10:49:14 +02:00
Piotr Rzysko	110ffba2a1	Fix accumulator reset when arguments outnumber aggregates Previously, while resetting accumulator registers, we would also reset subsequent registers. This happened because the number of registers to reset was computed as the sum of arguments rather than the number of aggregate functions.	2025-09-13 10:49:14 +02:00
Piotr Rzysko	6224cdbbd3	Support WalkControl in walk_expr_mut Now walk_expr_mut can use WalkControl to skip parts of the expression tree. This makes it consistent with walk_expr.	2025-09-13 10:49:14 +02:00
Pekka Enberg	d8f07fe3da	core: Panic on fsync() error by default Retrying fsync() on error was historically not safe ("fsyncgate") and Postgres still defaults to panicing on fsync(). Therefore, add a "data_sync_retry" pragma (disabled by default) and use it to determine whether to panic on fsync() error or not.	2025-09-13 10:21:12 +03:00
Pekka Enberg	dcd43ab8fc	Merge 'Handle `EXPLAIN QUERY PLAN` like SQLite' from Lâm Hoàng Phúc After this PR: ``` turso> EXPLAIN QUERY PLAN SELECT 1; QUERY PLAN `--SCAN CONSTANT ROW turso> EXPLAIN QUERY PLAN SELECT 1 UNION SELECT 1; QUERY PLAN `--COMPOUND QUERY \|--LEFT-MOST SUBQUERY \| `--SCAN CONSTANT ROW `--UNION USING TEMP B-TREE `--SCAN CONSTANT ROW turso> CREATE TABLE x(y); turso> CREATE TABLE z(y); turso> EXPLAIN QUERY PLAN SELECT * from x,z; QUERY PLAN \|--SCAN x `--SCAN z turso> EXPLAIN QUERY PLAN SELECT * from x,z ON x.y = z.y; QUERY PLAN \|--SCAN x `--SEARCH z USING INDEX ephemeral_z_t2 turso> ``` Closes #3057	2025-09-12 20:41:23 +03:00
TcMits	9dac467b40	support EXPLAIN QUERY PLAN	2025-09-12 19:58:45 +07:00
Preston Thorpe	b09dcceeef	Merge 'Fixes views' from Glauber Costa This is a collection of fixes for materialized views ahead of adding support for JOINs. It is mostly issues with how we assume there is a single table, with a single delta, but we have to send more than one. Those are things that are just objectively wrong, so I am sending it separately to make the JOIN PR smaller. Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #3009	2025-09-12 07:43:32 -04:00
Preston Thorpe	f55023acc8	Merge 'Refactor UPSERT to use wal_expr_mut to walk AST.' from Preston Thorpe Working on https://github.com/tursodatabase/turso/issues/2964 I came upon `walk_expr_mut`, I don't think it existed last time I really spent much time in the translator. So quickly went back and cleaned this up. Reviewed-by: Jussi Saurio <jussi.saurio@gmail.com> Closes #3044	2025-09-12 06:45:13 -04:00
TcMits	5dddc5e00b	introduce OP_Explain	2025-09-12 17:31:50 +07:00
Pekka Enberg	06371d8894	Merge 'Add BEGIN CONCURRENT support for MVCC mode' from Pekka Enberg Currently, when MVCC is enabled, every transaction mode supports concurrent reads and writes, which makes it hard to adopt for existing applications that use `BEGIN DEFERRED` or `BEGIN IMMEDIATE`. Therefore, add support for `BEGIN CONCURRENT` transactions when MVCC is enabled. The transaction mode allows multiple concurrent read/write transactions that don't block each other, with conflicts resolved at commit time. Furthermore, implement the correct semantics for `BEGIN DEFERRED` and `BEGIN IMMEDIATE` by taking advantage of the pager level write lock when transaction upgrades to write. This means that now concurrent MVCC transactions are serialized against the legacy ones when needed. The implementation includes: - Parser support for CONCURRENT keyword in BEGIN statements - New Concurrent variant in TransactionMode to distinguish from regular read/write transactions - MVCC store tracking of exclusive transactions to support IMMEDIATE and EXCLUSIVE modes alongside CONCURRENT - Proper transaction state management for all transaction types in MVCC This enables better concurrency for applications that can handle optimistic concurrency control, while still supporting traditional SQLite transaction semantics via IMMEDIATE and EXCLUSIVE modes. Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #3021	2025-09-12 07:38:53 +03:00
PThorpe92	36425b2ada	Refactor UPSERT to use wal_expr_mut to walk AST. Working on https://github.com/tursodatabase/turso/issues/2964 I came upon `walk_expr_mut`, I don't think it existed last time I really spent much time in the translator. So quickly went back and cleaned this up.	2025-09-11 21:08:11 -04:00
Pekka Enberg	433b60555f	Add BEGIN CONCURRENT support for MVCC mode Currently, when MVCC is enabled, every transaction mode supports concurrent reads and writes, which makes it hard to adopt for existing applications that use `BEGIN DEFERRED` or `BEGIN IMMEDIATE`. Therefore, add support for `BEGIN CONCURRENT` transactions when MVCC is enabled. The transaction mode allows multiple concurrent read/write transactions that don't block each other, with conflicts resolved at commit time. Furthermore, implement the correct semantics for `BEGIN DEFERRED` and `BEGIN IMMEDIATE` by taking advantage of the pager level write lock when transaction upgrades to write. This means that now concurrent MVCC transactions are serialized against the legacy ones when needed. The implementation includes: - Parser support for CONCURRENT keyword in BEGIN statements - New Concurrent variant in TransactionMode to distinguish from regular read/write transactions - MVCC store tracking of exclusive transactions to support IMMEDIATE and EXCLUSIVE modes alongside CONCURRENT - Proper transaction state management for all transaction types in MVCC This enables better concurrency for applications that can handle optimistic concurrency control, while still supporting traditional SQLite transaction semantics via IMMEDIATE and EXCLUSIVE modes.	2025-09-11 16:05:52 +03:00
Glauber Costa	98ed6c2b0e	keep alias in logical plan We have been ignoring the alias in the logical plan, but we have to keep it. Implementing joins in particular is made hard without it, because it is common that one has the same column name in different tables, just differentiated by the alias	2025-09-11 05:30:46 -07:00
Jussi Saurio	e3bd00883b	Fix creation of automatic indexes indexes with the naming scheme "sqlite_autoindex_<tblname>_<number>" are automatically created when a table is created with UNIQUE or PRIMARY KEY definitions. these indexes must map to the table definition SQL in definition order, i.e. sqlite_autoindex_foo_1 must be the first instance of UNIQUE or PRIMARY KEY and so on. this commit fixes our autoindex creation / parsing so that this invariant is upheld.	2025-09-11 14:11:30 +03:00
Pekka Enberg	ce5e67a483	Merge 'Return parse error for unsupported exprs' from Jussi Saurio Turns a bunch of `todo!()` crashes into parse errors Reviewed-by: Preston Thorpe <preston@turso.tech> Closes #3007	2025-09-10 17:56:22 +03:00
Pekka Enberg	4efa0a57fd	Merge 'translate: return parse error for unsupported join types' from Jussi Saurio We were silently ignoring these before Closes #3006	2025-09-10 17:14:33 +03:00
Jussi Saurio	84ecef2718	Return parse error for unsupported exprs	2025-09-10 16:10:21 +03:00
Jussi Saurio	eb2710438c	translate: return parse error for unsupported join types	2025-09-10 15:46:52 +03:00
Jussi Saurio	f469113d9f	Don't crash if DELETE uses index	2025-09-10 14:54:51 +03:00
Jussi Saurio	e0ca0cf8af	Enable access path optimizer for DELETE	2025-09-10 14:54:51 +03:00
Jussi Saurio	6d43bdbf71	emit the Delete instruction for the iteration index cursor, and do it last	2025-09-10 14:54:51 +03:00
Jussi Saurio	2d1d284279	optimizer: convert outer join to inner join if possible	2025-09-08 23:21:34 +03:00
Pekka Enberg	f88f39082a	core/vdbe: Fix MakeRecord affinity handling The MakeRecord instruction now accepts an optional affinity_str parameter that applies column-specific type conversions before creating records. When provided, the affinity string is applied character-by-character to each register using the existing apply_affinity_char() function, matching SQLite's behavior. Fixes #2040 Fixes #2041	2025-09-08 18:49:13 +03:00
Pekka Enberg	01879144b6	Merge 'Evaluate left join seek key condition again after null row' from Jussi Saurio Closes #2949 This fixes a special case of the behavior described in #2501 - the special case is that WHERE conditions that were selected as seek predicates for the left join table were not properly evaluated when the right-hand-side table returned no match. The test in commit 12d72d115588a9e744bdb22382998ba1bf9031ab should demonstrate this adequately - this should return no rows, but on `main` it returns `1\|NULL`. Closes #2955	2025-09-08 12:01:17 +03:00
Jussi Saurio	f6ede79de7	Do not consume non-join WHERE terms in left joins	2025-09-08 09:35:11 +03:00
Jussi Saurio	ed3c73a194	kargo klippy	2025-09-06 17:51:15 +03:00
Jussi Saurio	430101ab47	expr: use more efficient implementation for binary condition exprs currently we always evaluate the binary expression, then coerce it to zero/null with the `ZeroOrNull` instruction, and then emit a separate jump. this is fine for non-conditional expressions where we are using the value itself (e.g. in a SELECT result column), but in conditionals we don't care about that at all and just want to jump. so: try to keep the spirit of code reuse, but still have distinct implementations for conditionals and non-conditionals.	2025-09-06 17:45:54 +03:00

1 2 3 4 5 ...

1508 Commits