mpn mbn
10/31/2024, 12:05 PMlakectl fs rm -r <lakefs://repo/branch/A>
lakectl fs upload -r <lakefs://repo/branch/A> -s A
My question is:
How can I do this using Python lakefs package?Offir Cohen
10/31/2024, 12:14 PMAmit Kesarwani
10/31/2024, 7:18 PMAmit Kesarwani
10/31/2024, 7:18 PMHT
11/01/2024, 7:20 AMdatasets-versions.yaml
? In which folder is it ?\mpn mbn
11/02/2024, 10:06 AMmpn mbn
11/02/2024, 4:33 PMbranch.delete_objects([o.path for o in branch.objects(prefix="A/")])
and then parse files in local A
dataset dir and upload each file individually.HT
11/03/2024, 8:50 AMlakectl fs rm -r <lakefs://repo/branch/A>
If you can uselakectl fs upload -r <lakefs://repo/branch/A> -s A
rclone
, then you can just do rclone sync
If you have to use python: you can try fsspec/s3fs
They have a sync command, called rsync
( I wrote my own 😛 )
Not sure it's a good idea to have this datasets-versions.yaml
that hold version information :
1. you are versioning within a versioning system (lakefs). Alternative would be using lakefs metadata or tag may be ?
2. this can create conflict if you are working in the same time on 2 different branch and later merge them: you will need to manually overwrite each other as LakeFS do not do content merge.