Production Data Workflows
Examples for file-parallel ETL, image processing, external service limits, and operational backfills.

ETL 10K S3 files to Postgres
Transform gzipped JSON files with idempotent inserts while protecting Postgres with max_parallelism.

Resize an image corpus
Chunk image keys, resize with Pillow, write outputs to S3, and stream a manifest.

Run a 2M-user API backfill
Backfill user profiles while keeping provider limits explicit in chunk size, sleeps, retries, and max_parallelism.
Last updated

