I've configured lakeFS with a postgreSQL database. So, lakeFS created many tables in the database that I filled in (101 in total, from kv to kv_99). The first one contains all information about my repo, branches, commits... but all the others are empty. What is ther use?
I look at this page in the doc https://docs.lakefs.io/understand/how/kv.html but it's not clear for me.
05/04/2023, 2:30 PM
Hi Paul, welcome 👋
What you are seeing is how Posrgres implements partitioning for the KV table. It is used for improved access and better performance.
05/04/2023, 2:33 PM
Thank you for the answer.
Can you tell me more? What would happen if I kept only the first table filled?
05/04/2023, 2:46 PM
The partitioning is inner workings of Posrgres and the KV driver. It's not something you need to handle directly. LakeFS splits metadata in different partitions in a way that will allow better performance when performing queries and batch operations. The main table (KV) will always contain all the metadata lakeFS holds, you can think of the other tables (kv#) as disjoint sets of the main table
05/09/2023, 11:31 AM
Hello again, I have a question about the storage capacity of lakefs, what is the maximum size of a repo?
05/09/2023, 11:35 AM
Hi @Paul just so that all of the community can benefit from the question, can you ask it in a separate thread?