DEV Community

# reliability

General discussions on building and maintaining reliable software systems.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Orchestration Allows Microservices to Be Unreliable (That's a Good Thing)

Orchestration Allows Microservices to Be Unreliable (That's a Good Thing)

Comments
4 min read
Unlocking Reliability: Why Data Pipelines Need Declarative Deployment & GitOps

Unlocking Reliability: Why Data Pipelines Need Declarative Deployment & GitOps

Comments
4 min read
When Retries Turn Hostile — How Control Logic Kills Production Systems

When Retries Turn Hostile — How Control Logic Kills Production Systems

1
Comments
4 min read
CI/CD Reliability: When Your Deploy Pipeline is Your SPOF

CI/CD Reliability: When Your Deploy Pipeline is Your SPOF

Comments
3 min read
software engineers are becoming reliability engineers for generated output

software engineers are becoming reliability engineers for generated output

Comments
5 min read
Disaster Recovery Drills That Actually Work

Disaster Recovery Drills That Actually Work

Comments
3 min read
Disaster Recovery Drills That Actually Work

Disaster Recovery Drills That Actually Work

Comments
3 min read
Automatic Error Recovery in AI Agent Networks

Automatic Error Recovery in AI Agent Networks

Comments
2 min read
AI fallback modes should protect user momentum, not just fail safely

AI fallback modes should protect user momentum, not just fail safely

Comments
9 min read
Feature Flags as a Reliability Tool, Not Just an A/B Platform

Feature Flags as a Reliability Tool, Not Just an A/B Platform

Comments
3 min read
API Idempotency Keys: Prevent Duplicate Requests

API Idempotency Keys: Prevent Duplicate Requests

Comments
2 min read
Service Level Objectives for Complex Microservices

Service Level Objectives for Complex Microservices

Comments
3 min read
Building a Culture of Reliability: Beyond the SRE Handbook

Building a Culture of Reliability: Beyond the SRE Handbook

Comments
3 min read
I Ran 20 Cycles in a Row and Every Single One Failed — Here's What I Learned About Resilience

I Ran 20 Cycles in a Row and Every Single One Failed — Here's What I Learned About Resilience

Comments
2 min read
Multi-region is theater. Multi-AZ is engineering.

Multi-region is theater. Multi-AZ is engineering.

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.