Anandkarthick Krishnakumar
09/06/2022, 4:44 AMsymlink
thanksItai Admi
09/06/2022, 5:08 AMlakectl
provides several metastore commands to allow your metadata (managed by the metastore) to have the same ‘versions’ as your data (managed by lakeFS). To create your first table, you would have to use the metastore’s CREATE TABLE
commands (e.g. Hive) - not a lakectl
command. I think that lakeFS flow with metastore is summed up nicely in this doc. Let me know if your use-case isn’t covered there..Anandkarthick Krishnakumar
09/06/2022, 5:40 AMItai Admi
09/06/2022, 5:53 AMAnandkarthick Krishnakumar
09/06/2022, 5:57 AMItai Admi
09/06/2022, 6:00 AMAnandkarthick Krishnakumar
09/06/2022, 6:05 AMItai Admi
09/06/2022, 6:24 AM<s3a://my-repo/main/path/to/data>
. Your question depends on the application you choose to process the data. For Athena, you need symlinks. For Spark, you can simply change the s3a.endpoint
.Anandkarthick Krishnakumar
09/06/2022, 6:42 AMItai Admi
09/06/2022, 6:50 AM<lakefs://repo/branch/foo/bar>
to <s3://storagenamespace/uuid1>
). For Athena to be able to query lakeFS data it needs to get the paths from lakeFS. That’s where the lakectl metastore create-symlinks
command helps.create-symlinks
command does behind the scenes.Anandkarthick Krishnakumar
09/06/2022, 7:14 AMmust specify 1-based parent number for reverting merge commit
I'm following documentation hereItai Admi
09/06/2022, 7:17 AM-m, --parent-number int the parent number (starting from 1) of the mainline. The revert will reverse the change relative to the specified parent.
Anandkarthick Krishnakumar
09/06/2022, 7:19 AM