Hello again, I'm currently working with a setup w...
# help
b
Hello again, I'm currently working with a setup where I use lakeFS to version control TileDB data stored in S3. Specifically, I'm working with TileDBSOMA data arrays, which are designed for single-cell data analysis. I successfully created a main branch where I loaded TileDB data (which are multiple files in a particular format) in AWS EC2. But when I try to load this "experiment" (ie. an instance of a TileDB dataset) via LakeFS in python, I get an error that the data does not exist. I'm still new to LakeFS and am thus not sure if this is a solvable issue on my side or if an inherent incompatibility between TileDB and LakeFS might be the problem. I was wondering if you plan on supporting TileDB in the near furute or how I could go about solving this. Thank you in advance!
Never mind, I had to specify the lakeFS endpoint to make it work.
tiledb_config["vfs.s3.endpoint_override"] = lakefs_endpoint
This reads the experiment as expected. Thanks anyways!
👍 4
👍🏽 2