Yes especially data, simplicity is key as well. Where possible go with simple bu...

alsiola · on June 15, 2023

Really do not share your experience with CosmosDB, some of it's attributes that currently make me miserable include...

- Scaling is not infinite, it's up to 20gb per partition key (1), which can't be changed after document creation.

- One set of global indices, no equivalent to DynamoDB's secondary indices.

- Still can't run their docker container on mac (2) natively.

- Weird SQL-like dialect that's required for all but the simplest queries. JOINs are spectacularly awkward.

- Tooling is horrific. Based on the previous point, no existing tooling works for it (and nobody is building tooling for a DB with such minimal market share). For example, I needed to manually deleted 30 or so documents/rows yesterday - only way to achieve this is with 30 separate click-to-deletes in their UI.

- Minimises analytics options. There exist a plethora of business intelligence type tools that will happily sit on top of most common DBs. None of them like CosmosDB. So you're stuck with synapse link or whatever MS calls it now.

Overall it seems to combine the worst aspects of both RDS and document stores, with the worst aspects of both traditional and serverless infrastructure.

(1) https://learn.microsoft.com/en-us/azure/cosmos-db/partitioni... (2) https://learn.microsoft.com/en-us/azure/cosmos-db/local-emul...

drawkbox · on June 15, 2023

The partition key limitation is something you can work out with smart partitioning and horizontal scaling. We do the same already with storage/tables to prevent large data blocks at a smaller limit even for speed/lookups/map.

Tooling could be improved and will be, it is fairly new still and the Azure Cosmos DB Emulator is not bad.

There is a CosmosDB Synapse setup that allows more analytics/intel on top like you said but same with other NoSQL, takes a bit to get worked in.

I actually like the flexibility of query types and that they include SQL as it makes it a bit more standard and somewhat less vendor lockin. You can use other types as well Mongo/Cassandra/Tables syntax. For filtering the SQL side isn't bad but most of what we do is flat/associative and not heavily normalized. For most of our data we are very cache heavy as well to reduce db hits and retries.

ACID compliance is huge and there are some design considerations.

What cloud DB do you use the most DynamoDB?

alsiola · on June 15, 2023

Most experience with DynamoDB, which isn't without challenges, and has a steepish learning curve to get the most out of it.

I work in pre-market fit startup though, so hyper-scaleability is not really a consideration right now, and would much prefer to be running postgres.

mamcx · on June 15, 2023

> Where possible go with simple but abstracted cloud storage

This is the total opposite of simplicity. Simple is PG and backup setup. DONE.

Maybe you mean "comfort"?

drawkbox · on June 16, 2023

Managed and no need to backup with snapshots. PG still has manage concerns with size, logs, access, scale etc. Nothing you have to do with cloud storage. I do love me some PostgreSQL but use it less and less except for reporting or heavily filtered needs. Horizontal over vertical for comfort and simplicity.