Excited to share the newly documented use-case for CI/CD for data.
For those here that been using lakeFS and hooks for a long time, this is probably not new. Yet many of us haven’t dwelled on the various possibility of hooks and the importance of CI/CD and data quality gates in production.
Data quality gates enable us to protect our downstream data before a mistake occurs. This is especially important when our data is exposed to customers.
What is CI/CD for data, why do you need it, and how to implement it?
Thanks to @Vino
, who documented it!