5/3/2026 at 11:46:34 PM
S3: Simple Storage Service. It's a building block, and it's only natural other abstractions are built on top of it.by gberger
5/4/2026 at 7:36:57 AM
Agree it doesn't have to be part of S3 itself. My point is that there is a missing semantic layer.In practice, many teams use S3 directly without any layer on top. So without better organizational capabilities, they can't keep track of what they have stored where, who created it, whether it is still used, etc.
And when teams do use a catalog, it's usually detached from the storage layer itself, so you can't easily view a dataset in the catalog and know how much it costs, who accessed it, and so on.
Have you seen better places that figured out a better way to handle this? Without a ton of custom tooling?
by sagiba
5/4/2026 at 2:57:20 AM
No but why doesn’t this object-storage-primitive accommodate all my specific requirements already?They should also accommodate my need for all POSIX filesystem API’s included cheap-moves and renames!!!!!
/s
by FridgeSeal
5/4/2026 at 7:40:31 AM
POSIX isn't the ask. Datasets are. The need to keep track of what data you have stored is universal, not my specific requirement.by sagiba
5/4/2026 at 11:10:22 AM
I make the (glib) comment, because it’s a similar argument to the one that was popular a few years ago.S3 is an object store. Treat it more like a KV store. As other comments have pointed out, the solution here is pick-your-favourite-metadata-store, be it Postgres, or what iceberg does, and other data on S3.
by FridgeSeal