Working with the Apache Parquet file format
Introducing Daft: A High-Performance Distributed Dataframe Library for Multimodal Data