Prev Next

Database / CouchDB Interview Questions

What is database compaction in CouchDB and when should you run it?

Database compaction rewrites a CouchDB database file from scratch, retaining only the current (winning) revision of each document and discarding all stale revisions and orphaned B-tree nodes. Because CouchDB uses an append-only storage engine, every update grows the file. A database with millions of updates can be orders of magnitude larger than the size of its live data. Compaction reclaims that space.

When to run compaction:

  • After a bulk data migration or large import that produced deep revision chains.
  • When disk usage grows significantly faster than the document count (high update churn).
  • On a regular nightly schedule in write-heavy production systems.
  • CouchDB 2.x+ supports automatic compaction triggers via smoosh (the built-in compaction daemon) which fires when the ratio of live data to total file size drops below a configurable threshold.
# Manually trigger compaction on a database
curl -X POST http://admin:pass@localhost:5984/mydb/_compact
# {"ok":true}

# Compact view indexes of a specific design document
curl -X POST http://admin:pass@localhost:5984/mydb/_compact/my_ddoc
# {"ok":true}

# Monitor progress
curl http://admin:pass@localhost:5984/_active_tasks
# Shows compaction tasks with "progress" percentage

During compaction the database stays fully online — CouchDB continues serving reads and writes from the old file and atomically switches to the new file when compaction finishes. View compaction is separate from document compaction; each design document's index has its own compaction command.

Is the CouchDB database accessible for reads and writes while compaction is running?
What does the _compact/{design_doc} endpoint compact compared to the plain _compact endpoint?

Invest now in Acorns!!! 🚀 Join Acorns and get your $5 bonus!

Invest now in Acorns!!! 🚀
Join Acorns and get your $5 bonus!

Earn passively and while sleeping

Acorns is a micro-investing app that automatically invests your "spare change" from daily purchases into diversified, expert-built portfolios of ETFs. It is designed for beginners, allowing you to start investing with as little as $5. The service automates saving and investing. Disclosure: I may receive a referral bonus.

Invest now!!! Get Free equity stock (US, UK only)!

Use Robinhood app to invest in stocks. It is safe and secure. Use the Referral link to claim your free stock when you sign up!.

The Robinhood app makes it easy to trade stocks, crypto and more.


Webull! Receive free stock by signing up using the link: Webull signup.

More Related questions...

What is Apache CouchDB and what makes it different from relational databases? What data model does CouchDB use and how is a document structured? What is the CouchDB HTTP REST API and how do you perform basic CRUD operations? What is MVCC (Multi-Version Concurrency Control) in CouchDB and how does it handle write conflicts? What is the _rev field in CouchDB and why is it required for updates and deletes? What is the CouchDB storage engine (B-tree) and how does its append-only write work? What is database compaction in CouchDB and when should you run it? What are CouchDB attachments and when would you use them? What is the difference between CouchDB and Couchbase? What are the CAP theorem trade-offs for CouchDB — is it CP or AP? What are CouchDB design documents and what do they contain? What are MapReduce views in CouchDB and how do you define a map function? How does the reduce function work in CouchDB views and what are the built-in reduce functions? What are view indexes in CouchDB and how are they built and updated, including stale options? What is the Mango query language in CouchDB and how does it differ from MapReduce views? How do you create and use a Mango index in CouchDB (json and text indexes)? What are the query operators available in the Mango selector syntax? What is the _all_docs endpoint in CouchDB and how does it differ from a custom view? How do you paginate results in CouchDB views using startkey, endkey, and skip/limit? What is a list function in CouchDB and when would you use it? How does CouchDB replication work and what is the replication protocol? What is the difference between one-shot and continuous replication in CouchDB? What is filtered replication in CouchDB and how do you implement it? What is CouchDB Cluster mode (CouchDB 2.x+) and how does it differ from single-node CouchDB 1.x? How does CouchDB cluster sharding work — what are the Q, n, r, and w parameters? What is the _node and _cluster_setup API used for in CouchDB clustering? How does CouchDB handle replication conflicts and what strategies exist to resolve them? What is the CouchDB winning revision algorithm for conflict resolution? What is PouchDB and how does it enable offline-first applications with CouchDB sync? What is Couchbase Sync Gateway and how does it relate to CouchDB's replication model? How does CouchDB implement authentication — cookie auth, JWT, and proxy auth? What is CouchDB's permission model — admin party, database admins, and database readers? How do you implement document-level security in CouchDB using validate_doc_update functions? What is a CouchDB _security object and how do you configure roles and members? How do you enable SSL/TLS in CouchDB and what configuration is required? How do you monitor CouchDB performance using the _stats and _active_tasks endpoints? What are the key CouchDB configuration parameters to tune for production (max_dbs_open, os_process_limit, etc.)? How does CouchDB handle large document sets — what are the performance trade-offs of large vs many small documents? What is the CouchDB _changes feed and how do you use it for real-time event streaming? What are CouchDB update handlers and how do they differ from direct PUT operations? What are CouchDB show functions and when were they deprecated? How do you back up and restore a CouchDB database? How does CouchDB compare to MongoDB for document storage use cases? What are common CouchDB anti-patterns and how do you avoid them? How do you migrate data between CouchDB versions or instances?
Show more question and Answers...

MuleESB

Comments & Discussions