• Oz Katz

    Oz Katz

    4 months ago
    Have you ever wondered whether git ignores
    .gitignore
    if you add it to
    .gitignore
    ? well now you know:https://rubenerd.com/git-ignores-gitignore-with-gitignore-in-gitignore/
    Oz Katz
    1 replies
    Copy to Clipboard
  • Ariel Shaqed (Scolnicov)

    Ariel Shaqed (Scolnicov)

    4 months ago
    For @Oz Katz and for our Windows users: there's a new version of the clone of the standard DOS text editor! Simpler than ed, loads faster that VSCode.
    Ariel Shaqed (Scolnicov)
    1 replies
    Copy to Clipboard
  • carlos osuna

    carlos osuna

    3 months ago
    This has been bugging me: what does the FS stand for?
    carlos osuna
    einat.orr
    2 replies
    Copy to Clipboard
  • Ariel Shaqed (Scolnicov)

    Ariel Shaqed (Scolnicov)

    3 months ago
    Citus is a distributed PostgreSQL setup (and more). They're releasing the whole thing as OSS now. Apart from being good news, it's also interesting to read their words on why many data products are better when they're open source. https://www.citusdata.com/blog/2022/06/17/citus-11-goes-fully-open-source/
    Ariel Shaqed (Scolnicov)
    1 replies
    Copy to Clipboard
  • Oz Katz

    Oz Katz

    2 months ago
    Fun game for the data savvy (by the folks at Rockset): https://sqlordle.rockset.com/
    Oz Katz
    Ariel Shaqed (Scolnicov)
    6 replies
    Copy to Clipboard
  • Idan Novogroder

    Idan Novogroder

    2 months ago

    https://www.youtube.com/watch?v=kOO31qFmi9A

    #technology
    Idan Novogroder
    Ariel Shaqed (Scolnicov)
    2 replies
    Copy to Clipboard
  • Paul Singman

    Paul Singman

    2 months ago
    No offense to Databricks, my preferred type of Lakehouse. In Brookfield, NH:
    Paul Singman
    1 replies
    Copy to Clipboard
  • Adi Polak

    Adi Polak

    2 months ago
    what is your favorite movie?
  • Adi Polak

    Adi Polak

    1 month ago
    Interesting article that discusses the data lifecycle management challenges in BioPharma. here are some quotes:
    " Recent years have seen the rise of “483s,” FDA regulatory warning letters, with data integrity violations accounting for most of the notices. In 2019, almost half (47%) of all warning letters issued by FDA concerned data integrity. By the end of 2021, that number had increased to 65% (1). "
    Larger organizations are also seeing a resurge in investments in building centralized data repositories, such as data lakes, to help drive their digitization initiatives. A core business objective of many of these data lakes is to break down data silos to create centralized repository of data for end users that is easily accessible, coherent, and complete. However, automating the integration of such varied systems, whilst ensuring data integrity and regulatory compliance, remains a significant industry challenge (2).
    Another one:
    A less appreciated and more nuanced data integrity issue is data contextualization. Even if an operator can extract data from a specific system (e.g., chromatography), the data may be of little use without combining it with data stored in other systems, such as the experimental conditions under which the sample was generated.
    The world of Parma has been working with spreadsheets (and spreadsheet likes tools) for many, many years; It's interesting to see the author speaks of embracing new technology and building a data lake!