I wrote about Data Lakes and Lakehouse, how to build a Data Lake on top of 1. Storage Layer (AWS S3, Azure Blob Storage, Google Cloud Storage) 2. File Format (Apache Parquet, Avro, ORC) and 3. Table Format (Delta Lake, Apache Hudi, and Iceberg). The next thing would be to add LakeFS for combining Time Travel with additional git-like features. I thought maybe it would be interesting for this audience. If you have any questions, please let me know; happy to chat🙂.
08/27/2022, 6:51 AM
It's a great introduction! Thanks for sharing @sspaeti
08/27/2022, 9:31 AM
Thank you Einat. I also linked to your awesome blog post 🙂 .