Prev Next

Database / Azure Cosmos DB interview questions

What is a partition key in Azure Cosmos DB and why is choosing it correctly so important?

A partition key is a property you designate on every item in a Cosmos DB container. Its value is hashed to determine which logical partition the item belongs to. Cosmos DB distributes logical partitions across physical partitions (server nodes), and this is how it achieves horizontal scaling — more physical partitions means more throughput and storage capacity distributed across more servers.

The reason choosing it correctly matters so much comes down to two hard limits Cosmos DB enforces:

  • A single logical partition can hold at most 20 GB of data.
  • Throughput and storage cannot be rebalanced across logical partitions — you cannot move an item from one logical partition to another after it is written without deleting and reinserting it.

A poorly chosen partition key creates hot partitions — a small number of logical partitions absorbing the majority of reads and writes while others sit idle. Even if you provision 50,000 RU/s at the container level, a hot partition effectively limits you to a fraction of that because RU/s are distributed evenly across physical partitions.

A good partition key has high cardinality (many distinct values), even distribution of reads and writes across those values, and aligns with your most common query patterns so that queries stay within a single partition (cross-partition queries fan out to all physical partitions and are more expensive). For a social app, userId is typically a better partition key than country, because a small country could easily hold millions of users and hit the 20 GB ceiling.

Cosmos DB also supports a synthetic partition key — concatenating two or more properties to form a higher-cardinality value when no single field is sufficient on its own.

What is the maximum data storage limit for a single logical partition in Cosmos DB?
What is a synthetic partition key in Azure Cosmos DB?

Invest now in Acorns!!! 🚀 Join Acorns and get your $5 bonus!

Invest now in Acorns!!! 🚀
Join Acorns and get your $5 bonus!

Earn passively and while sleeping

Acorns is a micro-investing app that automatically invests your "spare change" from daily purchases into diversified, expert-built portfolios of ETFs. It is designed for beginners, allowing you to start investing with as little as $5. The service automates saving and investing. Disclosure: I may receive a referral bonus.

Invest now!!! Get Free equity stock (US, UK only)!

Use Robinhood app to invest in stocks. It is safe and secure. Use the Referral link to claim your free stock when you sign up!.

The Robinhood app makes it easy to trade stocks, crypto and more.


Webull! Receive free stock by signing up using the link: Webull signup.

More Related questions...

What is Azure Cosmos DB and what problems does it solve? What are the different APIs available in Azure Cosmos DB? What is a partition key in Azure Cosmos DB and why is choosing it correctly so important? What are Request Units (RU/s) in Azure Cosmos DB? What are the five consistency levels in Azure Cosmos DB? How does global distribution work in Azure Cosmos DB? What is the Cosmos DB Change Feed and what are its main use cases? What is provisioned throughput vs autoscale vs serverless in Cosmos DB? How does indexing work in Azure Cosmos DB? What is Time to Live (TTL) in Cosmos DB and how do you configure it? What is a stored procedure in Cosmos DB and what are its limitations? What is the difference between a point read and a query in Cosmos DB? What is the Cosmos DB NoSQL query language and how does it differ from standard SQL? What is the Cosmos DB transactional batch API? What is Cosmos DB Integrated Cache and how does it reduce RU consumption? How does optimistic concurrency work in Azure Cosmos DB? What is hierarchical partition keys in Cosmos DB and when do you use it? What is the Cosmos DB Bulk Executor and how do you use bulk operations in the SDK? What are Cosmos DB triggers and user-defined functions (UDFs)? How does Cosmos DB handle conflicts in multi-region write (multi-master) setups? What is the Cosmos DB Emulator and how is it used in development? What is Cosmos DB for MongoDB API and what version compatibility does it provide? What is the Cosmos DB analytical store and Azure Synapse Link? What are Cosmos DB materialized views and how do they differ from containers? How does Cosmos DB pricing work and what are the key cost drivers? What is the Cosmos DB Gremlin API and what is it optimized for? How does Cosmos DB backup and restore work? What is the Cosmos DB Patch API and how does it differ from Replace? What is the Cosmos DB Cassandra API and how does CQL map to Cosmos DB concepts? How do you model one-to-many relationships in Cosmos DB? What is the Cosmos DB free tier and what does it include? What is the Cosmos DB SDK and what are the key client configuration options? What is the Cosmos DB Table API and when would you migrate from Azure Table Storage to it? How does Cosmos DB handle security and access control?
Show more question and Answers...

MuleESB

Comments & Discussions