https://lakefs.io/ logo
Docs
Join the conversationJoin Slack
Channels
announcements
blockers_for_windward
career-opportunities
celebrations
cuddle-corner
data-discussion
data-events
dev
events
general
help
iceberg-integration
lakefs-for-beginners
lakefs-hubspot-cloud-registration-email-automation
lakefs-releases
lakefs-suggestions
lakefs-twitter
linen-dev
memes-and-banter
new-channel
new-channel
say-hello
stackoverflow
test
Powered by Linen
memes-and-banter
  • k

    Karen

    05/21/2021, 5:13 AM
    Hi everyone! Share your exciting plan for the upcoming weekend! I'm going to tackle a new Facebook community certification. You'll see if I'll manage it by Monday - if yes, then I'll post the proof to #achievements. Haha, now it will be difficult for me to procrastinate on this 😄 And what's your plan?
    1 reply · 1 participant
  • k

    Karen

    06/01/2021, 9:01 PM
    Hello lake people! Let's run a quick challenge! Post here any water related music theme that you like! 3-2-1-Go!
    p
    y
    +2
    7 replies · 5 participants
  • m

    Matias Rebolledo Dezerega

    06/30/2021, 2:35 PM
    Hey everyone, I know this is off topic, but i'm looking out for a book called big book of dashboards. If anyone has it, can you lend it to me? plssss. thnks and love!
    k
    2 replies · 2 participants
  • o

    Oz Katz

    07/05/2021, 12:48 PM
    https://www.theregister.com/2021/07/05/infinidash/ (Of course, versioning your AWS Infinidash workloads using lakeFS is fully supported out of the box)
    e
    a
    2 replies · 3 participants
  • p

    Paul Singman

    01/25/2022, 2:45 PM
    interesting discussion on the value of open source today vs in the past https://twitter.com/bernhardsson/status/1485793044708405251
    👀 3
    a
    1 reply · 2 participants
  • s

    Shradheya Thakre

    02/09/2022, 10:47 AM
    Wanted to share this program here which encourages OSS and something lakefs could probably benefit from by getting some strong young contributors 🙂 https://summerofcode.withgoogle.com/
    💪 1
    💪🏻 1
    a
    2 replies · 2 participants
  • o

    Oz Katz

    02/17/2022, 7:32 PM
    https://twitter.com/xkcd/status/1494179003816677381?s=21
    😂 4
    b
    a
    2 replies · 3 participants
  • b

    Barak Amar

    03/08/2022, 1:50 PM
    Part of the things I got from hacktoberfest, need something like that for data engineers :)
    👀 1
    ✅ 1
    a
    j
    6 replies · 3 participants
  • a

    Adi Polak

    03/09/2022, 9:51 AM
    Awesomeness. Go News 📰 shares lakeFS project 😒unglasses_lakefs: https://twitter.com/golang_news/status/1501469805060280325?s=20&t=4GBLpw0UGHVNAfJxnq4vQQ
    💥 3
    :jumping-lakefs: 3
    a
    1 reply · 2 participants
  • j

    jj t

    04/02/2022, 10:18 AM
    hello I had read the blog 3 Strategies For Late-Arriving Data .But I can't understand repeatable queries. What is repeatable queries?
    p
    2 replies · 2 participants
  • a

    Adi Polak

    04/02/2022, 8:02 PM
    The Axolotls and I are getting ready for QCon London 🇬🇧
    🇬🇧 6
    🙌 1
    :heart_lakefs: 4
    :jumping-lakefs: 7
    n
    1 reply · 2 participants
  • y

    Yusuf K

    04/07/2022, 5:26 PM
    I've searched Lakefs on google so many times to pull up the docs etc. and Google has corrected it to the "Lakers" so many times that now my google news feed shows sports updates and live scores for the Lakers. I don't even watch basketball haha
    😂 6
    :dancing_lakefs: 1
    :jumping-lakefs: 3
    🏀 1
    j
    i
    2 replies · 3 participants
  • o

    Oz Katz

    04/09/2022, 9:42 AM
    https://cloud.google.com/blog/products/data-analytics/unifying-data-lakes-and-data-warehouses-across-clouds-with-biglake Perhaps paving the way to a future BigQuery / lakeFS integration? 🫢
    ❤️ 1
    b
    a
    +1
    3 replies · 4 participants
  • o

    Oz Katz

    04/10/2022, 7:26 AM
    while we're over here, in the present, supporting the S3 API and storage backend, the future is already here! S4 is a *super *simple storage service 😄
    😂 6
    j
    2 replies · 2 participants
  • g

    Giorgio Zoppi

    04/12/2022, 8:45 PM
    A la dremio?
    y
    6 replies · 2 participants
  • i

    Iddo Avneri

    04/14/2022, 8:39 PM
    I experienced very interesting ad targeting today: https://www.ivysaxolotls.com/collections/buy-live-axolotls
    🤦‍♀️ 2
    😲 1
    a
    1 reply · 2 participants
  • a

    Ariel Shaqed (Scolnicov)

    05/02/2022, 7:16 AM
    @Barak Amar, I think you just won the "biggest lakeFS committer 2022" award 🥇 🏆 !
    Pull request merged by nopcoder#50 Bump nokogiri from 1.11.5 to 1.13.4 in /v0.48
    treeverse/docs-lakeFS | Today at 01:43
    However github.com will soon be complaining about DoS...
    😁 1
    b
    1 reply · 2 participants
  • i

    Iddo Avneri

    05/09/2022, 7:32 PM
    My Monitor started Flickering 😞 Any suggestions for one that is generally healthy for the eyes? :)
    r
    1 reply · 2 participants
  • m

    Michal Wosk

    05/13/2022, 11:16 AM
    Watched “Everything Everywhere All at once” yesterday and could not hold back from re-reading this great piece: https://lakefs.io/the-everything-bagel-ii-versioned-data-lake-tables-with-lakefs-and-trino/. Great movie btw 🙂

    https://www.youtube.com/watch?v=wxN1T1uxQ2g▾

    😄 2
    🥯 2
    p
    r
    4 replies · 3 participants
  • i

    Itai Admi

    05/18/2022, 6:47 AM
    I’ve managed to review a negative number of files 💪
    😂 2
    ➖ 2
    👏 2
    a
    1 reply · 2 participants
  • o

    Oz Katz

    05/18/2022, 12:17 PM
    Have you ever wondered whether git ignores
    .gitignore
    if you add it to
    .gitignore
    ? well now you know: https://rubenerd.com/git-ignores-gitignore-with-gitignore-in-gitignore/
    🤣 4
    i
    1 reply · 2 participants
  • a

    Ariel Shaqed (Scolnicov)

    05/22/2022, 1:23 PM
    For @Oz Katz and for our Windows users: there's a new version of the clone of the standard DOS text editor! Simpler than ed, loads faster that VSCode.
    👀 1
    b
    1 reply · 2 participants
  • c

    carlos osuna

    06/07/2022, 11:45 PM
    This has been bugging me: what does the FS stand for?
    e
    2 replies · 2 participants
  • a

    Ariel Shaqed (Scolnicov)

    06/19/2022, 7:15 AM
    Citus is a distributed PostgreSQL setup (and more). They're releasing the whole thing as OSS now. Apart from being good news, it's also interesting to read their words on why many data products are better when they're open source. https://www.citusdata.com/blog/2022/06/17/citus-11-goes-fully-open-source/
    👀 1
    🙌 1
    e
    1 reply · 2 participants
  • o

    Oz Katz

    07/09/2022, 5:34 AM
    Fun game for the data savvy (by the folks at Rockset): https://sqlordle.rockset.com/
    :dancing_lakefs: 4
    a
    6 replies · 2 participants
  • i

    Idan Novogroder

    07/12/2022, 1:03 PM

    https://www.youtube.com/watch?v=kOO31qFmi9A▾

    #technology
    a
    n
    2 replies · 3 participants
  • p

    Paul Singman

    07/21/2022, 1:32 PM
    No offense to Databricks, my preferred type of Lakehouse. In Brookfield, NH:
    📸 2
    🤩 5
    :jumping-lakefs: 1
    ⛵ 4
    a
    1 reply · 2 participants
  • a

    Adi Polak

    07/22/2022, 3:00 PM
    please no https://twitter.com/ismonkeyuser/status/1550361319513243648
    🤣 1
  • a

    Adi Polak

    07/28/2022, 7:07 AM
    what is your favorite movie?
  • a

    Adi Polak

    08/04/2022, 7:21 AM
    Interesting article that discusses the data lifecycle management challenges in BioPharma. here are some quotes:
    " Recent years have seen the rise of “483s,” FDA regulatory warning letters, with data integrity violations accounting for most of the notices. In 2019, almost half (47%) of all warning letters issued by FDA concerned data integrity. By the end of 2021, that number had increased to 65% (1). "
    Larger organizations are also seeing a resurge in investments in building centralized data repositories, such as data lakes, to help drive their digitization initiatives. A core business objective of many of these data lakes is to break down data silos to create centralized repository of data for end users that is easily accessible, coherent, and complete. However, automating the integration of such varied systems, whilst ensuring data integrity and regulatory compliance, remains a significant industry challenge (2).
    Another one:
    A less appreciated and more nuanced data integrity issue is data contextualization. Even if an operator can extract data from a specific system (e.g., chromatography), the data may be of little use without combining it with data stored in other systems, such as the experimental conditions under which the sample was generated.
    The world of Parma has been working with spreadsheets (and spreadsheet likes tools) for many, many years; It's interesting to see the author speaks of embracing new technology and building a data lake!
    👀 3
Powered by Linen
Title
a

Adi Polak

08/04/2022, 7:21 AM
Interesting article that discusses the data lifecycle management challenges in BioPharma. here are some quotes:
" Recent years have seen the rise of “483s,” FDA regulatory warning letters, with data integrity violations accounting for most of the notices. In 2019, almost half (47%) of all warning letters issued by FDA concerned data integrity. By the end of 2021, that number had increased to 65% (1). "
Larger organizations are also seeing a resurge in investments in building centralized data repositories, such as data lakes, to help drive their digitization initiatives. A core business objective of many of these data lakes is to break down data silos to create centralized repository of data for end users that is easily accessible, coherent, and complete. However, automating the integration of such varied systems, whilst ensuring data integrity and regulatory compliance, remains a significant industry challenge (2).
Another one:
A less appreciated and more nuanced data integrity issue is data contextualization. Even if an operator can extract data from a specific system (e.g., chromatography), the data may be of little use without combining it with data stored in other systems, such as the experimental conditions under which the sample was generated.
The world of Parma has been working with spreadsheets (and spreadsheet likes tools) for many, many years; It's interesting to see the author speaks of embracing new technology and building a data lake!
👀 3
View count: 2