Vino
11/10/2022, 6:14 PMOz Katz
11/10/2022, 6:16 PMVino
11/10/2022, 6:25 PM(staging-branch/raw/dt=2022-10-11/sample-data.json)
, do data explorations, run transformations and aggregations and write aggregated data in staging branch (staging-branch/analytics/sampledata-by-country-parquet).
Now I run tests on /analytics
and need to merge only the /analytics
dir into my prod. After the merge, I'd delete the staging-branch.
In the data teams I worked with, best practice was to not work on data in ingress location directly. Always copied the data to be processed into a staging area. So I tried to simulate the same for a lakeFS demo and ran into this requirement.Oz Katz
11/10/2022, 7:05 PMAriel Shaqed (Scolnicov)
11/10/2022, 7:45 PM