turso

mirror of https://github.com/aljazceru/turso.git synced 2025-12-26 20:44:23 +01:00

Author	SHA1	Message	Date
l.gualtieri	3487969c40	fix floating point numbers get truncated in json #877	2025-02-10 12:23:43 +01:00
Pekka Enberg	2eda8a54d9	Merge 'Fix various bugs in B-Tree handling' from Nikita Sivukhin This PR introduce simple fuzz test for BTree insertion algorithm and fixes few bugs found by fuzzer - BTree algorithm returned early although there were overflow pages on stack and more rebalances were needed - BTree balancing algorithm worked under assumption that single page will be enough for rebalance - although this is not always true (if page were tightly packed with relatively big cells, insertion of new very big cell can require 3 split pages to distribute the content between them) - `overflow_cells` wasn't cleared properly during rebalancing - insertions of dividers to the parent node were implemented incorrectly - `defragment_page` didn't reset `PAGE_HEADER_OFFSET_FRAGMENTED_BYTES_COUNT` field which can lead to suboptimal usage of pages Closes #951	2025-02-10 07:40:27 +02:00
meteorgan	a99d9a8988	chore: remove unused dependencies	2025-02-09 21:05:06 +08:00
wyhaya	e9046fef78	cli: Improve pretty mode table	2025-02-09 00:01:07 +08:00
Nikita Sivukhin	75e2f01ec4	print btree for debugging	2025-02-08 13:03:38 +04:00
Nikita Sivukhin	55dd108878	setup simple insertion fuzz test	2025-02-08 12:38:32 +04:00
Pekka Enberg	9657920dd7	Merge 'Add java section in README.md' from Kim Seon Woo Added a section for java in README.md Closes #923	2025-02-08 09:15:04 +02:00
김선우	cc72439032	Add java section in README.md	2025-02-08 12:09:17 +09:00
alpaylan	9c339cb8e1	wip: add delete support to the simulator	2025-02-07 08:19:00 -05:00
Jussi Saurio	d5f58f5fea	Add quickcheck tests for generate_series() and refine implementation	2025-02-06 18:36:21 +02:00
PThorpe92	661c74e338	Apply new planner structure to virtual table impl	2025-02-06 09:15:28 -05:00
Jussi Saurio	f5f77c0bd1	Initial virtual table implementation	2025-02-06 07:51:50 -05:00
Pekka Enberg	238fb9c977	Merge 'Sqlean Crypto extension' from Diego Reis Introduces a new `crypto` extension, compatible with the Sqlean [crypto extension](https://github.com/nalgeon/sqlean/blob/main/docs/crypto.md). Closes #903	2025-02-06 13:46:01 +02:00
Diego Reis	05057a04ac	completes crypto extension It aims to be compatible with https://github.com/nalgeon/sqlean/blob/main/docs/crypto.md	2025-02-06 01:42:47 -03:00
Diego Reis	dd58be3b60	Add basic structure for crypto extension	2025-02-05 23:09:26 -03:00
krishvishal	a3d0e1e974	Remove vector extension from different Cargo.toml files and add `quickcheck`, `quickcheck_macros` and `rand` crates to core's Cargo.toml file	2025-02-06 06:58:41 +05:30
Pekka Enberg	6ea7fa06d2	Merge 'prepare perf: make ProgramBuilder aware of plan to count/estimate required memory' from Jussi Saurio Use knowledge of query plan to inform how much memory to initially allocate for `ProgramBuilder` vectors Some of them are exact, some are semi-random estimates ```sql Prepare `SELECT 1`/Limbo/SELECT 1 time: [756.93 ns 758.11 ns 759.59 ns] change: [-4.5974% -4.3153% -4.0393%] (p = 0.00 < 0.05) Performance has improved. Found 7 outliers among 100 measurements (7.00%) 2 (2.00%) low severe 1 (1.00%) low mild 3 (3.00%) high mild 1 (1.00%) high severe Prepare `SELECT * FROM users LIMIT 1`/Limbo/SELECT * FROM users LIMIT 1 time: [1.4739 µs 1.4769 µs 1.4800 µs] change: [-7.9364% -7.7171% -7.4979%] (p = 0.00 < 0.05) Performance has improved. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high mild Prepare `SELECT first_name, count(1) FROM users GROUP BY first_name HAVING count(1) > 1 ORDER BY cou...` time: [3.7440 µs 3.7520 µs 3.7596 µs] change: [-5.4627% -5.1578% -4.8445%] (p = 0.00 < 0.05) Performance has improved. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high severe ``` Closes #899	2025-02-05 18:24:16 +02:00
Pekka Enberg	56d401fb67	Merge 'Implement `json_set`' from Marcus Nilsson This PR adds support for `json_set`. There are three helper functions added: 1. `json_path_from_owned_value`, this function turns an `OwnedValue` into a `JsonPath`. 2. `find_or_create_target`, this function is similar to `find_target` with the added bonus of creating the target if it doesn't exist. There is a caveat with this function and that is that it will create objects/arrays as it goes, meaning if you send `{}` into it and try getting the path `$.some.nested.array[123].field`, it will return `{"some":{"nested":array:[]}}` since creation of `some`, `nested` and `array` will succeed, but accessing element `123` will fail. 3. `create_and_mutate_json_by_path`, this function is very similar to `mutate_json_by_path` but calls `find_or_create_target` instead of `find_target` Related to #127 Closes #878	2025-02-05 14:15:02 +02:00
Pekka Enberg	fad479ac59	core/mvcc: Move source code to module	2025-02-05 13:25:16 +02:00
Marcus Nilsson	01492cf46f	add support for json_set Test cases are included. Related to #127	2025-02-04 19:09:58 +01:00
Pekka Enberg	e4d7474372	core: Switch to parking_lot for RwLock We really need to make the WAL lock less expensive, but switching to `parking_lot` is anyway something we should do. Before: ``` Execute `SELECT 1`/Limbo time: [56.230 ns 56.463 ns 56.688 ns] ``` After: ``` Execute `SELECT 1`/Limbo time: [52.003 ns 52.132 ns 52.287 ns] ```	2025-02-04 18:38:33 +02:00
Pekka Enberg	733f7de688	Merge branch 'main' into feature/time-ext	2025-02-04 18:27:14 +02:00
Pekka Enberg	73a345aa4d	Limbo 0.0.14	2025-02-04 15:13:18 +02:00
Nikita Sivukhin	91fcb67b06	rewrite grammar generator and add fuzz test for arithmetic expressions	2025-02-02 18:39:24 +04:00
pedrocarlo	a4251e0448	add dynamic linking in extensions/time/Cargo.toml	2025-02-01 20:14:23 -03:00
pedrocarlo	309591044b	Merge branch 'main' into feature/time-ext	2025-01-31 22:54:14 -03:00
Glauber Costa	a7cc367c1f	implement pragma pragma_list List all available pragmas (Except pragma_list)	2025-01-31 06:44:56 -05:00
Pekka Enberg	3a4cb34606	Merge 'Fix memory leaks, make extension types more efficient' from Preston Thorpe I was baffled previously, because any time that `free` was called on a type from an extension, it would hang even when I knew it wasn't in use any longer, and hadn't been double free'd. After #737 was merged, I tried it again and noticed that it would no longer hang... but only for extensions that were staticly linked. Then I realized that we are using a global allocator, that likely wasn't getting used in the shared library that is built separately that won't inherit from our global allocator in core, causing some symbol mismatch and the subsequent hanging on calls to `free`. This PR adds the global allocator to extensions behind a feature flag in the macro that will prevent it from being used in `wasm` and staticly linked environments where it would conflict with limbos normal global allocator. This allows us to properly free the memory from returning extension functions over FFI. This PR also changes the Extension type to a union field so we can store int + float values inline without boxing them. any additional tips or thoughts anyone else has on improving this would be appreciated 👍 Closes #803	2025-01-30 13:31:17 +02:00
pedrocarlo	643ad147c0	checkpoint: implemented time_now, time_fmt_iso, time_date	2025-01-30 01:26:47 -03:00
Pekka Enberg	cfc585813b	Merge 'implement sqlite_source_id function' from Glauber Costa Closes #811	2025-01-29 09:45:00 +02:00
Glauber Costa	8f24d18ad8	implement sqlite_source_id function	2025-01-28 14:55:38 -05:00
Jussi Saurio	e01555467f	Add quickcheck property tests for vector extension	2025-01-28 15:53:11 +02:00
Pekka Enberg	ee05ad172b	core: Bundle vector extension by default	2025-01-28 14:24:09 +02:00
Pekka Enberg	9462426685	Vector extension functions This patch adds some libSQL vector extension functions such as `vector()` and `vector_distance_cos()`, which can be used for exact nearest neighbor search as follows: ``` limbo> SELECT embedding, vector_distance_cos(embedding, '[9, 9, 9]') ...> FROM movies ORDER BY vector_distance_cos(embedding, '[9, 9, 9]'); [4, 5, 6]\|0.013072490692138672 [1, 2, 3]\|0.07417994737625122 ```	2025-01-28 14:24:09 +02:00
PThorpe92	793cdf8bad	Fix memory issues, make extension types more efficient	2025-01-27 22:30:31 -05:00
Pekka Enberg	e8600fa2a1	Merge branch 'main' into static	2025-01-27 09:49:34 +02:00
Pekka Enberg	0918fc40d4	bindings/go: Rename to Limbo ...we'll likely call this Turso eventually, but right now, let's keep the code consistent.	2025-01-26 20:58:10 +02:00
Pekka Enberg	0d0906dce4	Merge 'simulator: implement `--load` and `--watch` flags' from Alperen Keleş The current status of the PR is halfway. The new framing of simulation runner where `setup_simulation` is separated from `run_simulation` allows for injecting custom plans easily. The PR is currently missing the functionality to update the `SimulatorEnv` ad hoc from the plan, as the environment tables were typically created during the planning phase. The next steps will be to implement a function `fn mk_env(InteractionPlan, SimulatorEnv) -> SimulatorEnv`, add `--load` flag to the CLI for loading a serialized plan file, making a corresponding environment and running the simulation. We can optionally combine this with a `--save` option, in which we keep a seed-vault as part of limbo simulator, corresponding each seed with its generated plan and save the time to regenerate existing seeds by just loading them into memory. I am curious to hear thoughts on this? Would the maintainers be open to adding such a seed-vault? Do you think the saved time would be worth the complexity of the approach? Reviewed-by: Pere Diaz Bou <pere-altea@homail.com> Closes #720	2025-01-26 08:52:58 +02:00
PThorpe92	4be1f9c3cc	Begin work on Go bindings (purego)	2025-01-25 11:37:12 -05:00
PThorpe92	c5e60d8e08	Enable only uuid by default, change tests back to account for this	2025-01-21 10:20:01 -05:00
PThorpe92	f13d035965	Enable wasm to static link extensions	2025-01-21 09:36:49 -05:00
PThorpe92	3d188eba0f	Enable staticly linking with builtin extensions	2025-01-21 09:32:43 -05:00
sonhmai	a090fb927a	centralize Rust integration and regression tests	2025-01-21 15:41:09 +07:00
sonhmai	cb631dafdc	feat: wire checkpoint to bytecode execution	2025-01-20 08:34:13 +07:00
sonhmai	6243ffbab4	add dev dependencies for testing wal_checkpoint	2025-01-20 08:34:13 +07:00
sonhmai	66d6291f32	add scaffolding for supporting wal checkpoint	2025-01-20 08:34:13 +07:00
Pekka Enberg	0abb917604	Limbo 0.0.13	2025-01-19 13:30:56 +02:00
alpaylan	e476b9f697	implement watch mode - add `--watch` flag - start saving seeds in persistent storage - make a separate version of execution functions that use `vector of interaction` instead of `InteractionPlan`	2025-01-18 23:54:03 +03:00
Jussi Saurio	af039ffa6e	Merge 'Initial support for aggregate functions in extensions' from Preston Thorpe #708 This PR adds basic support for the following API for defining Aggregates, and changes the existing API for scalars. ```rust register_extension! { scalars: { Double }, aggregates: { MedianState }, } #[derive(ScalarDerive)] struct Double; impl Scalar for Double { fn name(&self) -> &'static str { "double" } fn call(&self, args: &[Value]) -> Value { if let Some(arg) = args.first() { match arg.value_type() { ValueType::Float => { let val = arg.to_float().unwrap(); Value::from_float(val * 2.0) } ValueType::Integer => { let val = arg.to_integer().unwrap(); Value::from_integer(val * 2) } _ => { println!("arg: {:?}", arg); Value::null() } } } else { Value::null() } } } #[derive(AggregateDerive)] struct MedianState; impl AggFunc for MedianState { type State = Vec<f64>; fn name(&self) -> &'static str { "median" } fn args(&self) -> i32 { 1 } fn step(state: &mut Self::State, args: &[Value]) { if let Some(val) = args.first().and_then(Value::to_float) { state.push(val); } } fn finalize(state: Self::State) -> Value { if state.is_empty() { return Value::null(); } let mut sorted = state; sorted.sort_by(\|a, b\| a.partial_cmp(b).unwrap()); let len = sorted.len(); if len % 2 == 1 { Value::from_float(sorted[len / 2]) } else { let mid1 = sorted[len / 2 - 1]; let mid2 = sorted[len / 2]; Value::from_float((mid1 + mid2) / 2.0) } } } ``` I know it's a bit more verbose than the previous version, but I think in the long run this will be more robust, and it matches the aggregate API of implementing a trait on a struct that you derive the relevant trait on. Also this allows for better registration of functions, I think passing in the struct identifiers just feels much better than the `"func_name" => function_ptr` Closes #721	2025-01-18 11:07:06 +02:00
PThorpe92	fc82461eff	Complete percentile extension, enable col+delimeter args	2025-01-17 21:15:09 -05:00

1 2 3 4

152 Commits