Oz Katz
11/11/2021, 8:35 PMItai David
11/14/2021, 6:17 AMYoni Augarten
11/15/2021, 10:41 AMhadoof fs -cp
doesn't copy to a non-existing path (on neither HDFS nor S3A). What one should do is use -mkdir -p
to create the path, and then perform the copy.Yoni Augarten
11/17/2021, 1:21 PMUttam
11/19/2021, 7:16 AM.diff
files then I landed up something called Mercurial, is this software is decent to use or I should use something else? I'm willing to contribute to webui
is it mandatory to download go
?Tal Sofer
11/21/2021, 9:43 AMdocker compose --profile client run --rm hive-client
to enable the hive-client.
Now, I would like to create my first table in the metastore. How do I use the hive-client to do that?
@Guy Hardonag you can probably point me to the right directionAriel Shaqed (Scolnicov)
11/22/2021, 11:59 AMmishraprafful
11/22/2021, 1:49 PMItai Admi
11/24/2021, 1:53 PMOri Adijes
11/25/2021, 12:45 PMYoni Augarten
11/30/2021, 10:53 AMexamples/
directory) and my notebook is exported in HTML. Where in the project can/should I put this HTML?Ori Adijes
12/01/2021, 9:22 AMAriel Shaqed (Scolnicov)
12/04/2021, 7:12 PMAriel Shaqed (Scolnicov)
12/10/2021, 7:36 PMAriel Shaqed (Scolnicov)
12/11/2021, 8:34 AMto-be-deleted-main
from the lakeFS repository, which used to be called "`main`". Nobody's ever used it, and it contained some filename case issues. The branch tip was 8ff9846ab6e9aacdea56606ff9dde135c42e4f83
. I am purposefully not generating a tag there.
To be clear: the lakeFS trunk is named "`master`".
WHY? This branch was an aborted attempt to rename master
(the lakeFS repo was opened back when this horribly incorrect name was the default). This particular renaming was broken because of some uppercase/lowercase renamings. And those cause issues on a case-insensitive filesystem -- which MacOS helpfully provides by default. So it used to be that after git checkout main
on a macOS box you'd need to seek expert help to recover your local repo on that box...
WHY does trunk remain master
? It is an incorrect name with horrible connotations (and not one used historically). However renaming trunk requires a simultaneous (not concurrent!) change: all developers must rename master
-> main
on the same commit, all actions must change to accept the new name, and other unknown dependencies will break. (I would appreciate pointers to articles from any open-source projects about good practices for doing and testing this!)Ariel Shaqed (Scolnicov)
12/19/2021, 1:51 PMjava.lang.NoSuchMethodException: com.databricks.spark.metrics.FileSystemWithMetrics.getAmazonS3Client()
when writing using HadoopFS (io.lakefs:hadoop-lakefs-assembly:0.1.4
). Has anyone seen this?Oz Katz
12/21/2021, 11:13 AMTal Sofer
12/23/2021, 7:39 AMItai Admi
01/13/2022, 3:58 PMAriel Shaqed (Scolnicov)
01/14/2022, 3:43 PMAriel Shaqed (Scolnicov)
01/17/2022, 7:01 AMkubectl patch ingress myingress --type json --patch-file=/tmp/patch.json
where /tmp/patch.json
is:
[
{
"op": "add",
"path": "/spec/rules/-",
"value": {
"host": "<http://another.example.com|another.example.com>",
"http": {
"paths": [
{
"backend": {
"service": {
"name": "another-svc",
"port": {
"number": 5678
}
}
},
"path": "/another",
"pathType": "Prefix"
}]
}
}
}
]
A (more) readable introduction to JSON Patch is http://jsonpatch.com/; there's also https://datatracker.ietf.org/doc/html/rfc6902 of course.Guy Hardonag
01/19/2022, 8:31 AMtime
of the commit.
git does support it and I believe we should to,
Checking hear to see if anyone has any objectionsAriel Shaqed (Scolnicov)
01/21/2022, 7:34 AMNextRange
(and even SeekGE
)? Basically, have another channel from the wrapping iterator to the base iterator. Every time the wrapping iterator needs to change location, it drops the old channel (@Barak Amar do we need to drain a channel, or can it just be garbage-collected while full?), creates a new channel in its stead, and sends a command <"NextRange", ptrToNewChannel>
to the base iterator. The base iterator now select
s on being able to write on its output or read from its input; if it gets a command on the input channel it can implement it.
Not perfect, but a cheap way to prefetch.Ariel Shaqed (Scolnicov)
01/23/2022, 8:14 AMhttps://youtu.be/ZeE-JxMZDLk▾
Tal Sofer
01/30/2022, 11:42 AMmishraprafful
02/01/2022, 9:59 PMmishraprafful
02/02/2022, 3:23 PMFirst-time contributors need a maintainer to approve running workflows. Learn more.
but I am not a first time contributor, is it that every PR needs to be approved by a maintainer for running the workflows.
Ref: https://github.com/treeverse/lakeFS/pull/2902Adi Polak
02/07/2022, 8:56 AMgo get <http://github.com/rakyll/statik|github.com/rakyll/statik>
returned the following message:
go get: installing executables with 'go get' in module mode is deprecated.
To adjust and download dependencies of the current module, use 'go get -d'.
To install using requirements of the current module, use 'go install'.
To install ignoring the current module, use 'go install' with a version,
like 'go install <http://example.com/cmd@latest|example.com/cmd@latest>'.
For more information, see <https://golang.org/doc/go-get-install-deprecation>
or run 'go help get' or 'go help install'.
which one is should I use to install statik? go get -d
or go install
?Itai David
02/07/2022, 2:40 PM\
) in Windows' style paths to Unix's style forward slashes (/
) to match S3 behavior.
Out main concern is the effect it might have on users currently using Windows' style paths and the effect it might have on their data.
The original message is attached hereafter. I'm attaching all responses as comments.
All further comments are very appreciated.
Thanks
Hi.
I would love to here some opinions and some your of concerns regarding an issue I'm working on - BUG : Upload directory path not parsed correctly #2880
TL;DR - we are not treating Windows' path delimiter - the backslash - as such
Upon uploading a file, from Windows, the original path, e.g. C:\some\path\to\file
is kept and used as a key.
Later, when the file is returned to the client(s), as a response for ListObjects, we do not parse the key as a path correctly.
While lakectl
shows the key as a full path either ways, the web UI fails to create the nice (and expected) tree display, as described in the bug
On @Tal Sofer's advice, I looked at S3 behaviour, and found out that AWS actually converts Windows` style paths to Unix style. Namely, all backslashes are turned into forward slashes.
That's it for the problem description, now for possible solutions:
Option 1 - change lakefs behaviour to match S3 - this means we actively change backslashes in path to forward slashes.
I can think of 2 problems here:
• Users who use Windows clients and has their data in place (let's say they do not use the web UI and are not bothered by the path parsing difference) will be affected, as new files will be 'named' differently
• Not sure this is a real issue, but Unix allows forward slashes as part of a valid path name. If we change these to backslashes it will affect the file path in lakefs.
Option 2 - Store the file path/name/key the same way we do today, but treat both backslash and forward slash as path delimiters.
This will solve the web UI problem and will not affect the way we keep data, and so, existing user data will not be affected .
However, this is handling the symptom rather than the problem, and I'm not sure of where else this problem gonna pop. Moreover, it does not align with our S3-like approach, so not sure this is even an option
Any advice, opinion or other considerations I fell to mention?
All comments will be greatly appreciated 🙏Barak Amar
02/16/2022, 10:16 AMBarak Amar
02/16/2022, 10:16 AMOr Tzabary
02/16/2022, 10:22 AMbuildx
is redundantBarak Amar
02/16/2022, 10:24 AM