12/31/2025 at 5:23:46 PM
About time for the 6th Edition, eh? What would folks include in it?- Vector databases and hybrid search?
- Object storage for all the things? Lake houses. Parquet and beyond.
- Continuously materialized views? I'm not sure this one has made the splash but I think about Naiad (Materialize) and Noria (Readyset)
- NewSQL went mostly mainstream (Spanner wasn't included in the last one, but there's been more here with things like CockroachDB, TiDB, etc)
by vvern
1/1/2026 at 2:35:17 AM
Definitely they should include D4M and GraphQL [1],[2].Not only D4M can cater for structured relational data, it's also suitable for non-structured and sparse data in spreadsheet, matrices and graph. It's essentially a generalization of SQL but for all things data.
There's also integration of D4M with SciDB [3].
[1] D4M: Dynamic Distributed Dimensional Data Model:
[2] GraphQL:
[3] D4M: Bringing associative arrays to database engines:
by teleforce
12/31/2025 at 7:03:41 PM
The object storage stuff is new, but it's mostly confirmed that the older architecture works. MPP with shared (S3) storage and everything above that on local SSD and compute delivers the best performance. Even Snowflake finally came out with "interactive" warehouses with this architecture.Parquet, Iceberg, and other open formats seem good, but they may hit a complexity wall. There's already some inconsistency between platforms, eg with delete vectors.
Incremental view maintenance interests me as well, and I would like to see it more available on different platforms. It's ironic that people use dbt etc. to test every little edit of their manually coded delta pipelines, but don't look at IVM.
by kwillets
12/31/2025 at 6:09:37 PM
LLMs as DBs (if you squint hard enough)by B1FF_PSUVM