Jacob
04/02/2024, 1:54 PMJacob
04/02/2024, 1:56 PMThis ensures that data versions persist as long as they are tied to an existing branch or tag.
Jacob
04/02/2024, 1:57 PMGuy Hardonag
04/02/2024, 2:53 PMGuy Hardonag
04/02/2024, 2:54 PMJacob
04/02/2024, 2:54 PMGuy Hardonag
04/02/2024, 3:01 PMJacob
04/02/2024, 3:05 PMGuy Hardonag
04/02/2024, 3:15 PMIon
04/02/2024, 3:26 PMGuy Hardonag
04/02/2024, 3:29 PMIon
04/02/2024, 3:30 PMIon
04/02/2024, 3:31 PMAriel Shaqed (Scolnicov)
04/02/2024, 6:38 PMThen it makes sense to also have tags act in a similar way as the commits on the HEAD
I think it's trickier than that and will depend on the intent of the user of the tag. Some tags should be kept until deleted. For instance, you might want to keep the tag
version_to_reproduce_bug
if you run GC to save space. But if you run GC for compliance then probably not. Meanwhile release_weekly_20230314
is a tag that you will probably want lakeFS to GC, if you're releasing every week. It's actually surprisingly hard to spec out.
The major difference between tags and branch heads is that branch heads are essentially mutable while tags are essentially immutable. So it is easier to assume intent behind branches during a GC.
I suggest adding to the issue @guy hardonag opened.