Ants Young
01/17/2023, 9:05 AMAdi Polak
01/17/2023, 9:55 AMAnts Young
01/17/2023, 12:00 PMls /path/to/data
in the container, they could see files just like on their local host, that's what we want to implemented.
It would be code intrusion if we demand our users to use a specific data version management tool or Python SDK to download files from the repo, it's better to leave it to the user's own decision. Mounting files to the container is the best way, but it is not an elegant solution if downloading first by the platform and then mounting. Performance is acceptable in most cases.dagshub mount
, pachctl mount
, they both use fuse. And s3fs-fuse and JuiceFS also use fuse. So maybe fuse is one of the optimal way to achieve it.
https://github.com/DagsHub/client/blob/master/docs/index.md#mounted-filesystem---experimental
https://docs.pachyderm.com/2.4.x/how-tos/basic-data-operations/export-data-out-pachyderm/mount-repo-to-local-computer/
By the way, Pachyderm supports mount dataset version in Jupyter, please see https://docs.pachyderm.com/2.4.x/how-tos/jupyterlab-extension/Amit Kesarwani
01/17/2023, 6:52 PMThis command uses the Filesystem in Userspace (FUSE) user interface to export a Pachyderm File System (PFS) to a Unix computer system. This functionality is useful when you want to pull data locally to experiment.
So, I think that data will be copied locally in this case. So, it may not meet your requirements.Ants Young
01/18/2023, 1:20 AMOr Tzabary
01/19/2023, 7:22 AMAmit Kesarwani
01/20/2023, 5:32 AMecho <lakefs access key>:<lakefs secret key> > ${HOME}/.passwd-s3fs
chmod 600 ${HOME}/.passwd-s3fs
sudo mkdir /path/to/mountpoint
-- If you want to mount lakeFS repository
sudo s3fs <lakefs repo name> /path/to/mountpoint -o passwd_file=${HOME}/.passwd-s3fs -o url=<lakefs endpoint url> -o use_path_request_style
sudo ls -lh /path/to/mountpoint/
sudo cat /path/to/mountpoint/<lakefs branch name>/<file name>
sudo touch /path/to/mountpoint/<lakefs branch name>/test1.txt
-- If you want to mount a particular branch of a lakeFS repository
sudo s3fs <lakefs repo name>:/<lakefs branch name> /path/to/mountpoint -o passwd_file=${HOME}/.passwd-s3fs -o url=<lakefs endpoint url> -o use_path_request_style
sudo cat /path/to/mountpoint/<file name>
sudo touch /path/to/mountpoint/test1.txt
Ants Young
01/21/2023, 7:24 AM