I love SQLite and this is in no way I'm making a point devaluing SQLite, Author'...

loa_observer · 2025-12-12T19:03:32 1765566212

duckdb is super fast for analytic tasks, especially when u use it with visual eda tool like pygwalker. it allows u handles millions of data visuals and eda in seconds.

but i would say, comparing duckdb and sqlite is a little bit unfair, i would still use sqlite to build system in most of cases, but duckdb only for analytic. you can hardly make a smooth deployment if you apps contains duckdb on a lot of platform

trueno · 2025-12-12T19:57:15 1765569435

depending on the size and needs of distributed system or application im kind of really excited about postgres + pg_lake. postgres has blown my mind at how well it does concurrent writes at least for the types of things i build/support for my org, the pg_lake extension then adds the ability to.. honestly work like a datalake style analytics engine. it intuitively switches whether or not the transaction goes down the normal query path or it uses duckdb which brings giga-aggregation type queries to massive datasets.

someone should smush sqlite+duckdb together and do that kind of switching depending on query type

mikepurvis · 2025-12-12T18:21:07 1765563667

Whoa. Is that first query building an index of random filesystem json files on the fly?

NortySpock · 2025-12-12T18:57:58 1765565878

It's not an index, it's just (probably parallel) file reads

That being said, it would be trivial to tweak the above script into two steps, one reading data into a DuckDB database table, and the second one reading from that table.

lame_lexem · 2025-12-12T18:40:24 1765564824

can we all agree to never store datasets uncompressed. duckdb supports reading many compression formats

hawk_ · 2025-12-12T19:38:17 1765568297

How much impact do the various compression formats have on query performance?