Jennifer Cristina Evangelista Da Silva
05/03/2022, 7:22 PMimport io.treeverse.clients.LakeFSContext
val commitID = "a1b2c3d4"
val df = LakeFSContext.newDF(spark, "example-repo", commitID)
Or Tzabary
05/03/2022, 7:29 PMdf.createOrReplaceTempView("files")
spark.sql("SELECT DATE(last_modified), COUNT(*) FROM files GROUP BY 1 ORDER BY 1")
/* output example:
+----------+--------+
| dt|count(1)|
+----------+--------+
|2021-03-05| 2|
|2021-03-07| 2|
+----------+--------+
*/
mind sharing what you're trying to do?