Hi, I'm Simone —

I build systems
that hold under
pressure.

Data engineer by day — I design pipelines, ML infrastructure, and distributed systems that run quietly in production. On the side, I build products out of curiosity (and occasionally ship them).

See my work More about me

Pipeline Throughput — 24hLIVE

15M10M5M0

00:0006:0012:0018:00now

12.1^Mevents / day↑ 4.2%

90^savg latency↓ 98%

99.9^%SLA uptime30d avg

00:00:00[kafka]batch 14.2k msgs/s processed

00:00:00[flink]checkpoint #4821 complete

00:00:00[feature]model_v14 features synced

00:00:00[airflow]dag etl_daily finished OK

00:00:00[kafka]consumer lag: 0ms

00:00:00[dbt]mart_users materialized

00:00:00[spark]job finished 3.2TB read

00:00:00[redis]cache hit ratio 94.1%

00:00:00[snowflake]query p99 2.1s

00:00:00[flink]watermark advanced +30s

00:00:00[kafka]batch 13.9k msgs/s processed

00:00:00[feature]online serving 1.2ms p50

00:00:00[airflow]dag ml_pipeline started

00:00:00[dbt]fact_events OK 8.4M rows

00:00:00[redis]eviction rate 0.01%

00:00:00[kafka]batch 14.2k msgs/s processed

00:00:00[flink]checkpoint #4821 complete

00:00:00[feature]model_v14 features synced

00:00:00[airflow]dag etl_daily finished OK

00:00:00[kafka]consumer lag: 0ms

00:00:00[dbt]mart_users materialized

00:00:00[spark]job finished 3.2TB read

00:00:00[redis]cache hit ratio 94.1%

00:00:00[snowflake]query p99 2.1s

00:00:00[flink]watermark advanced +30s

00:00:00[kafka]batch 13.9k msgs/s processed

00:00:00[feature]online serving 1.2ms p50

00:00:00[airflow]dag ml_pipeline started

00:00:00[dbt]fact_events OK 8.4M rows

00:00:00[redis]eviction rate 0.01%

Pipeline Stats — LiveLIVE

12.1^M

events/day

↑ 4.2%

90^s

latency

↓ 98%

99.9^%

uptime

30d avg

Data EngineeringStreaming PipelinesML InfrastructureCloud (AWS/GCP)Distributed SystemsPython · Spark · Airflowdbt · Kafka · Terraform

Selected work

Featured Projects

All projects

DataBackendDevOps

Real-Time Event Pipeline

Legacy batch pipeline couldn't handle peak traffic spikes, causing 4-6 hour data delays in downstream ML models.

→ Handled 12M+ events/day

→ Reduced end-to-end latency from 4h to 90s

Apache KafkaApache FlinkPython+2

View on GitHub

MLDataBackend

ML Feature Store

Data science teams were duplicating feature engineering logic across 8+ models, causing inconsistencies and wasted compute.

→ Reduced feature computation time by 63%

→ Unified 40+ features across 8 models

PythonFeastRedis+3

View on GitHub

DataDevOps

Data Warehouse Migration

Migrating a 10TB legacy Redshift warehouse to Snowflake with zero downtime and full historical parity.

→ Migrated 10TB+ data with zero data loss

→ Reduced query costs by 41%

dbtPythonSnowflake+3

View on GitHub

See all projects

Building in public

Apps & MVPs

Alongside the day job, I build micro-products — mostly data tooling and developer utilities. Some stay experiments; some ship.

Explore Apps

Thinking out loud

Writing

All posts

System design breakdowns, data modeling decisions, performance tuning, and lessons learned building infrastructure at scale. Coming soon.

Subscribe for updates

I build systemsthat hold underpressure.

Featured Projects

Real-Time Event Pipeline

ML Feature Store

Data Warehouse Migration

Apps & MVPs

Writing

I build systems
that hold under
pressure.