●PagerDuty is screaming. Your phone won't stop. It's 3 AM.

Can You Fix It Before
Everything Burns?

Master production incidents through realistic simulations. Debug databases, wrestle Kubernetes, and save the day - without the 3 AM wake-up calls.

youbrokeprod.com/play/db-connection-pool-001
YouBrokeProd game interface showing a database connection pool incident with terminal, logs, and metrics
Try It Now - Free
1,247
πŸ”₯ incidents resolved today
98%
⚑ say it's addictive
2,400+
πŸ‘₯ engineers training
What engineers are saying

Engineers Can't Stop Playing

From junior devs to SRE directors - here's what the community is saying.

N
null_ptr_exception
@null_ptr_exception

just spent 2 hours on YouBrokeProd instead of sleeping. diagnosed a connection pool leak in 47 seconds. my team lead is going to love this

23441
Dataflow

"We replaced our entire incident response training program with YouBrokeProd. Our MTTR dropped 40% in 3 months."

SC
Sarah Chen
VP Engineering, Dataflow
K
kubewhisperer
@kubewhisperer

the kubernetes crashloop scenario is TOO real. i got vietnam flashbacks from my last on-call shift. 10/10

891187
D
devops_sarah
@devops_sarah

finally a way to practice incident response without the 3am adrenaline. my juniors went from deer-in-headlights to confident in 2 weeks

45689
ScaleStack

"Our team treats the daily challenge like Wordle. The Slack channel is chaos every morning comparing times."

JP
James Park
Director of SRE, ScaleStack
S
sre_marcus
@sre_marcus

showed this to my CTO and now the whole engineering org has accounts. the leaderboard is getting competitive

31264
N
null_ptr_exception
@null_ptr_exception

just spent 2 hours on YouBrokeProd instead of sleeping. diagnosed a connection pool leak in 47 seconds. my team lead is going to love this

23441
Dataflow

"We replaced our entire incident response training program with YouBrokeProd. Our MTTR dropped 40% in 3 months."

SC
Sarah Chen
VP Engineering, Dataflow
K
kubewhisperer
@kubewhisperer

the kubernetes crashloop scenario is TOO real. i got vietnam flashbacks from my last on-call shift. 10/10

891187
D
devops_sarah
@devops_sarah

finally a way to practice incident response without the 3am adrenaline. my juniors went from deer-in-headlights to confident in 2 weeks

45689
ScaleStack

"Our team treats the daily challenge like Wordle. The Slack channel is chaos every morning comparing times."

JP
James Park
Director of SRE, ScaleStack
S
sre_marcus
@sre_marcus

showed this to my CTO and now the whole engineering org has accounts. the leaderboard is getting competitive

31264

Real Incidents. Real Skills. Zero Downtime.

πŸ—„οΈ

Database Nightmares

Connection pool exhaustion, replication lag, deadlocks, and that one query that's doing a full table scan.

☸️

Kubernetes Chaos

Pods crashlooping, OOMKilled, PVCs stuck in Pending, and networking that makes no sense.

πŸ”’

Security Incidents

Credential leaks, suspicious traffic, rate limiting gone wrong, and that JWT that expired 6 months ago.

From Panic to Pro in 3 Steps

1

Get Paged

Pick an incident type and difficulty. You'll get realistic symptoms and access to logs, metrics, and debugging tools.

2

Debug & Diagnose

Use the terminal to run commands, check logs, and analyze metrics. Find the root cause before time runs out.

3

Fix & Flex

Apply the fix, earn points, and climb the leaderboard. Share your victory and challenge your team!

The Next Incident Won't Wait.
Will You Be Ready?

Join thousands of engineers who are leveling up their incident response skills - one simulated outage at a time.

Start Training Free β†’