Tal Sofer
07/31/2022, 9:09 AMGuy Hardonag
08/03/2022, 7:13 AMAriel Shaqed (Scolnicov)
08/03/2022, 8:58 AMAriel Shaqed (Scolnicov)
08/07/2022, 11:03 AMulimit -n
. Now my hive-metastore container fails to start; AFAICT the issue is
hive | MetaException(message:Required table missing : "`DBS`" in Catalog "" Schema "". DataNucleus requires this table to perform its persistence operations. Either your MetaData is incorrect, or you need to enable "datanucleus.schema.autoCreateTables")
(full logs are longer). It looks like Hive needs some startup script (or to set that value). Has anyone run into this?Niro
08/08/2022, 2:13 PMEntry
struct per write:
ent := &Entry{
Address: entry.PhysicalAddress,
AddressType: addressTypeToProto(entry.AddressType),
Metadata: entry.Metadata,
LastModified: timestamppb.New(entry.CreationDate),
ETag: entry.Checksum,
Size: entry.Size,
ContentType: ContentTypeOrDefault(entry.ContentType),
}
Creating a gross calculation taking into account field limits:
Address - according to AWS guidelines does not exceed -- 1024 bytes
AddressType - int32 -- 4 bytes
Metadata - according to AWS limited to 2KB user data -- 2048 bytes
LastModified - int64 -- 8bytes
Etag - AWS limitation -- 1024 bytes
Size - int64 -- 8 bytes
ContentType - Lets use the worst case scenario -- 1024 bytes
Summing this up we get over 5000 bytes which is far from the given estimation, and this is without taking into consideration other data which is saved such as entry key and checksum
Am I missing something here?? (Keep in mind that these are general approximation - not trying to do exact math here but rather get a sense of the size)Ariel Shaqed (Scolnicov)
08/11/2022, 12:59 PMGuy Hardonag
08/11/2022, 1:55 PMFix DB serialization error during multiple writes to the same key (#3862)
Any objections?Niro
08/11/2022, 3:35 PMAriel Shaqed (Scolnicov)
08/24/2022, 11:40 AMโฏ mcs class-search -f org.apache.hadoop.fs.FileSystem
Searching for artifacts containing org.apache.hadoop.fs.FileSystem...
Found 743 results (showing 20)
Coordinates Last updated
=========== ============
org.scray:scray-hdfs-writer:1.1.1 13 Sep 2018 at 10:08 (IDT)
org.scray:scray-hdfs-writer:1.1.0 12 Sep 2018 at 11:45 (IDT)
org.apache.hadoop:hadoop-common-instrumented:0.22.0 10 Dec 2011 at 04:33 (IST)
org.apache.hadoop:hadoop-common:0.22.0 10 Dec 2011 at 04:33 (IST)
org.jvnet.hudson.hadoop:hadoop-core:0.19.1-hudson-3 03 Sep 2009 at 00:41 (IDT)
org.apache.mahout.hadoop:hadoop-core:0.19.1 03 Apr 2009 at 09:53 (IDT)
org.jvnet.hudson.hadoop:hadoop-core:0.19.1-hudson-2 16 Mar 2009 at 00:54 (IST)
org.jvnet.hudson.hadoop:hadoop-core:0.19.1-hudson-1 15 Mar 2009 at 22:40 (IST)
org.jvnet.hudson.hadoop:hadoop-core:0.19.1 12 Mar 2009 at 07:15 (IST)
org.jvnet.hudson.hadoop:hadoop-core:0.19.0 11 Mar 2009 at 08:02 (IST)
com.tencent.bk.base.datahub:hadoop-common:2.7.3-bkbase.1 26 Oct 2021 at 06:51 (IDT)
org.apache.hadoop:hadoop-common:0.23.11 19 Jun 2014 at 17:17 (IDT)
org.apache.hadoop:hadoop-common:0.23.10 03 Dec 2013 at 07:46 (IST)
org.apache.hadoop:hadoop-common:0.23.9 01 Jul 2013 at 17:45 (IDT)
org.apache.hadoop:hadoop-common:0.23.8 28 May 2013 at 18:27 (IDT)
org.apache.hadoop:hadoop-common:0.23.7 11 Apr 2013 at 21:26 (IDT)
com.google.code.maven-play-plugin.org.apache.hadoop:hadoop-core:0.20.2-with-200-826 11 Mar 2013 at 22:15 (IST)
org.apache.hadoop:hadoop-common:0.23.6 29 Jan 2013 at 05:53 (IST)
org.apache.servicemix.bundles:org.apache.servicemix.bundles.hadoop-core:0.20.203.0_3 21 Jan 2013 at 04:37 (IST)
org.apache.hadoop:hadoop-common:0.23.5 20 Nov 2012 at 20:29 (IST)
Ariel Shaqed (Scolnicov)
08/25/2022, 1:44 PMOz Katz
Lynn Rozen
08/29/2022, 7:51 AMBarak Amar
Oz Katz
Ariel Shaqed (Scolnicov)
08/31/2022, 7:16 AMSlackbot
08/31/2022, 11:31 AMNiro
09/04/2022, 5:58 AMNiro
09/04/2022, 6:49 AMAriel Shaqed (Scolnicov)
09/04/2022, 9:08 AMNo linked issues found. Please add the corresponding issues in the pull request description.message we get on GitHub? Just open another issue? (@Tal Sofer, can you advise?)
Or Tzabary
09/05/2022, 1:10 PMencodeURIComponent
or encodeURI
doesn't help as % is used as part of the encoded value.
Another approach is to encode using base64 which is URL-safe.
I believe the right approach would be to disallow some characters for user/group/policy names as we use them in the URL and to avoid other potential bugs.
Comparing to AWS IAM's restrictions, they allow only specific characters to be used in the (alphanumeric and '+=,.@-_' characters.)
I think that restricting to these characters is a the right approach to take.
this means it'll be a breaking change in case someone used % in previous versions, but to be honest, the UX was already broken with entries with % in its name.
Thoughts?
@Elad Lachmi thanks for the helpAriel Shaqed (Scolnicov)
09/06/2022, 10:40 AM<UUID>
we will use something like <TS_ISO_8601>-<RANDOM>
. Now the files .../retention/gc/commits/run_id.../commits.csv
will list in time-sorted order. So no need for KV, we can simply list these objects in S3.
Unfortunately, it is still hard to go backwards in such a list. And that's actually the preferred mode: "find the last run_id from at least 2 weeks ago". So... I'd really like to name them <REVERSE_8601_ISH>-<TS_ISO_8601>-<RANDOM
, where RANDOM_8601_ISH is an ISO 8601-like printable representation of timestamp that sorts in reverse order, and we include TS_ISO_8601 merely for readability. Now we just need an encoding ๐
Where I've gotten so far: use a 9s-complement ASCII representation. An even less-readable representation would use e.g. 10000000000 - EPOCH
(right now would be 8337539453
(== 10000000000-1662460547), and 5 seconds later it would be 8337539448
. A more-readable representation might use <10000-YYYY>-<100-mm>-<100-DD>N<100-HH>:<100-MM>:<100-SS>
, and let you decode in your head with a little effort.
Is there any more standard way to do this?Niro
09/07/2022, 7:54 AMAriel Shaqed (Scolnicov)
09/07/2022, 2:52 PMtaylor schneider
09/09/2022, 12:03 AMNiro
09/11/2022, 8:30 AMAriel Shaqed (Scolnicov)
09/11/2022, 3:09 PM@transient lazy val notSerializable = computeFromSerializables(param1, param2)
๐คท๐ผAriel Shaqed (Scolnicov)
09/13/2022, 8:53 AMBarak Amar
Amit Kesarwani
09/14/2022, 12:51 AMlakectl fs rm
command already has this functionality but Python API doesnโt.Vaibhav Kumar
09/14/2022, 6:56 PM