Joe M
06/13/2024, 3:46 AMJoe M
06/13/2024, 3:46 AMJoe M
06/13/2024, 3:47 AMJoe M
06/13/2024, 3:51 AMJoe M
06/13/2024, 3:54 AMJoe M
06/13/2024, 3:55 AMItai Admi
06/13/2024, 10:39 AMfs.lakefs.delete.bulk_size
. My guess is that 50 is low enough.
3. If you can, please collect metric from lakeFS just before the bulk delete and right after it ends (successfully or with failure). That could help narrow down the root cause.Joe M
06/13/2024, 2:21 PMJoe M
06/13/2024, 2:22 PMItai Admi
06/13/2024, 2:33 PMi'm still committing every 10K uncommitted files.
Can you please explain this logic? Just listing the changes and if it reaches 10K then commit?Joe M
06/13/2024, 2:35 PMdef check_commit(branch, check_count):
diffs = branch.uncommitted(max_amount=check_count)
count = sum(1 for _ in diffs)
log_info_message(f"{count} changes found on branch")
if count >= check_count:
log_info_message(f"{check_count} changes detected, committing now")
commit_time=datetime.now().strftime('%Y%m%d_%H%M%S_%f')
diffs = branch.commit(message=f"committing changes at {commit_time}")
Joe M
06/13/2024, 2:36 PMItai Admi
06/13/2024, 2:37 PMdeleteObjects
to fail and retry in the server side. I guess that with a certain parallelization level from Spark and 1000 bulk_size, you’re bound to have deleteObjects
that are not succeeding after 3 retries.Itai Admi
06/13/2024, 2:37 PMItai Admi
06/13/2024, 2:38 PMJoe M
06/13/2024, 2:41 PMJoe M
06/13/2024, 2:41 PMJoe M
06/13/2024, 2:43 PMItai Admi
06/13/2024, 2:59 PMJoe M
06/13/2024, 3:02 PMJoe M
06/13/2024, 3:03 PMJoe M
06/13/2024, 3:06 PMJoe M
06/13/2024, 3:06 PMJoe M
06/13/2024, 3:07 PMJoe M
06/13/2024, 3:08 PMItai Admi
06/13/2024, 3:27 PMItai Admi
06/13/2024, 3:29 PMJoe M
06/13/2024, 4:22 PMJoe M
06/13/2024, 4:25 PMJoe M
06/13/2024, 4:26 PMJoe M
06/13/2024, 4:28 PMItai Admi
06/13/2024, 4:41 PMJoe M
06/13/2024, 4:43 PMItai Admi
06/13/2024, 4:44 PMJoe M
06/13/2024, 4:45 PMItai Admi
06/13/2024, 4:45 PMItai Admi
06/13/2024, 4:46 PMJoe M
06/13/2024, 4:46 PMJoe M
06/13/2024, 4:47 PMItai Admi
06/13/2024, 4:47 PMJoe M
06/13/2024, 4:49 PMJoe M
06/13/2024, 4:50 PMItai Admi
06/13/2024, 4:53 PM