Integration / Apache NiFi Interview Questions
What is data provenance in Apache NiFi and how do you access it?
Data provenance in NiFi is the complete, immutable audit trail of everything that happens to every FlowFile from when it enters until it leaves or is dropped. NiFi records a provenance event automatically for every significant action — no explicit configuration is required.
Provenance event types include: RECEIVE (data enters NiFi), SEND (data sent to external destination), FETCH, CREATE, FORK (FlowFile split into multiple), JOIN (FlowFiles merged), CLONE, CONTENT_MODIFIED, ATTRIBUTES_MODIFIED, DROP, and REPLAY.
Each event records: timestamp, event type, duration, FlowFile UUID, attributes before and after, the component that processed it, SHA-256 content hashes, and transit URI (the actual endpoint URL for SEND/RECEIVE).
Access provenance via the NiFi global menu → Data Provenance. Search by FlowFile UUID, filename, content size, processor, or time range. From a FORK event you can navigate to child FlowFiles; from JOIN to parents — reconstructing complete lineage. The Replay button on RECEIVE or CONTENT_MODIFIED events re-injects that exact FlowFile state back into the flow, invaluable for debugging and reprocessing.
Invest now in Acorns!!! 🚀
Join Acorns and get your $5 bonus!
Acorns is a micro-investing app that automatically invests your "spare change" from daily purchases into diversified, expert-built portfolios of ETFs. It is designed for beginners, allowing you to start investing with as little as $5. The service automates saving and investing. Disclosure: I may receive a referral bonus.
Invest now!!! Get Free equity stock (US, UK only)!
Use Robinhood app to invest in stocks. It is safe and secure. Use the Referral link to claim your free stock when you sign up!.
The Robinhood app makes it easy to trade stocks, crypto and more.
Webull! Receive free stock by signing up using the link: Webull signup.
More Related questions...
