Juarez Rudsatz
12/22/2022, 1:01 PMJonathan Rosenberg
12/22/2022, 1:02 PMJuarez Rudsatz
12/22/2022, 1:16 PMJonathan Rosenberg
12/22/2022, 2:14 PMSTORAGE_NAMESPACE/_lakefs/retention/gc/addresses/mark_id=<MARK_ID>/
. Then you can use your own Spark job to read the Parquet file and delete the object marked addresses specified in it (they will be relative to your storage namespace, i.e. the location that you initialized your repo at).
how much percent it will increase the size of the data lake?I’m not too sure of that as it depends on the amount of data you write and change…
Iddo Avneri
12/22/2022, 3:13 PM