Our engineering organization utilizes a monorepo for managing our codebase. I have been trying to sort through whether a similar approach makes sense for LakeFS, or if there are specific advantages to having a multi-repository setup within our LakeFS usage. Are there any best practices or guidances here for reference? We are new to LakeFS.
We have a number of different datasets all focused around document processing pipelines, capturing various inputs/outputs/metrics/etc. from them, which we track over time and currently keep updated with the state of our development for use in experiments, tracking system performance, etc.