Helios
Real-time observability platform ingesting 2.1M events/sec.
- Role
- Tech lead, end-to-end
- Year
- 2025
- Stack
- Rust / ClickHouse / React / WebGL
- Read
- Case study →

[ 01 — The Manifesto ]
I build tools for people who build tools. My work sits at the seam between systems that need to be fast and interfaces that need to be legible — places where the abstraction either holds or it doesn't, where the difference between a good primitive and a mediocre one compounds across every engineer who touches it.
The interesting problems are rarely about which framework. They're about where the boundaries go: what belongs in the kernel of a system and what belongs at its surface, what to make explicit and what to hide, when to pay for a cache and when accuracy matters more. I spend most of my time on those questions and the long, unhurried conversations they require.
After a decade shipping infrastructure that hundreds of engineers depend on daily, I've come to believe the best developer experience is indistinguishable from good taste — opinion, honesty, and a refusal to paper over the parts that should still feel hard.

Aimen Habis · Algiers — Spring 2026
[ 02 — Selected Work ]
Real-time observability platform ingesting 2.1M events/sec.

Distributed task scheduler in Rust with exactly-once semantics.

Internal developer platform for 200+ engineers across 12 teams.

Edge runtime with deterministic cold starts under 5ms.

[ 03 — Case Study / Helios ]
Helios replaces a legacy Prometheus + Grafana stack for a fintech processing 40 billion daily transactions. The legacy stack had p99 dashboard-load latency of 4.8 seconds. We brought it to 1.3s without reducing data fidelity, and the system now costs 60% less to run.
−0%
Dashboard p99 latency
0.0M ev/s
Ingestion throughput
−0%
Infra cost / month
−0%
On-call pages
Problem
The legacy observability stack had grown organically across three acquisitions. Metric cardinality crossed 18 million active series. Queries that should have taken milliseconds were timing out mid-incident, and the people on-call were getting paged because the tool they used to diagnose pages was itself failing.
“The fastest way to reduce incidents was to make the diagnostic tool trustworthy again.”
Architecture
We replaced the TSDB with ClickHouse fronted by a Rust ingest layer that performs online sketch aggregation (HLL, t-digest) and materializes the forty most-queried roll-ups at write time. Query cost became predictable. The frontend switched to a canvas-based renderer so 50k-series overlays remained interactive at 60fps.
pub async fn aggregate(batch: RecordBatch) -> Result<Materialized> {
let mut sketch = TDigest::with_compression(200);
let mut roll_ups: HashMap<DimKey, Counter> =
HashMap::with_capacity(batch.num_rows());
for row in batch.iter() {
sketch.add(row.latency_ms);
roll_ups
.entry(row.dimension_key())
.or_default()
.observe(row);
}
Ok(Materialized { sketch, roll_ups })
}Results
By week three of the partial migration, on-call engineers were self-routing investigations to Helios even for services not yet migrated. That informal shift was the real signal. The explicit metrics followed.
[ 04 — Experience ]
2022 — Present
Stripe
Led the rewrite of the internal telemetry pipeline serving 4,000 engineers.
2020 — 2022
Linear
Designed the offline sync protocol that kept p95 interaction latency under 16ms.
2018 — 2020
Independent
Shipped developer tooling for four YC-backed infrastructure startups.
2015 — 2018
Palantir
Built the query planner for a data platform handling petabyte-scale deployments.
[ 05 — Writing & Lab ]
Now Building
A streaming query planner for time-series joins in ClickHouse.
Now Reading
"A Philosophy of Software Design" — John Ousterhout.
Most observability tools optimize for the wrong viewer. A note on designing for the person being paged at 3am.
Why the best APIs feel inevitable, and how to recognize the feeling before you ship the wrong one.
Talk: how Verge achieves sub-5ms cold starts without sacrificing correctness guarantees.
A small experiment: visualizing every round-trip between a keystroke and a paint.
[ 06 — Contact ]
hello@aimencodexai.tech