Daft Engineering Blog

Daft Engineering Blog

Share this post

Daft Engineering Blog
Daft Engineering Blog
Adversarial file reading: from 10,000 small CSVs to massive Parquet files
Copy link
Facebook
Email
Notes
More

Adversarial file reading: from 10,000 small…

Kevin Wang
Mar 6, 2024
10

Share this post

Daft Engineering Blog
Daft Engineering Blog
Adversarial file reading: from 10,000 small CSVs to massive Parquet files
Copy link
Facebook
Email
Notes
More
1

How Daft optimizes the reading of real-world data which is often a mix of "many small files" and "few large files"

Read →
Comments
User's avatar
© 2025 Sammy Sidhu
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More