DEV Community

Site Reliability Engineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Passo a Passo: Configuração do WSL para DevOps e SRE no Windows

Passo a Passo: Configuração do WSL para DevOps e SRE no Windows

1
Comments
48 min read
The Open-Source On-Call Integration

The Open-Source On-Call Integration

Comments
5 min read
How to Configure Grafana to Send Alerts to Slack and Telegram

How to Configure Grafana to Send Alerts to Slack and Telegram

1
Comments
4 min read
Hosted Prometheus vs. Self-Managed: A Neutral Guide to Costs, Control, and Trade-offs

Hosted Prometheus vs. Self-Managed: A Neutral Guide to Costs, Control, and Trade-offs

Comments
3 min read
Federated Kubernetes: The Post-Kubefed Era

Federated Kubernetes: The Post-Kubefed Era

1
Comments
3 min read
Why Platform Engineering? Do You Really Need It?

Why Platform Engineering? Do You Really Need It?

Comments
4 min read
Hack the Planet as a Service

Hack the Planet as a Service

Comments
3 min read
AWS Appconfig

AWS Appconfig

Comments
2 min read
Script to list the S3 Bucket storage size

Script to list the S3 Bucket storage size

Comments
1 min read
How do I use the ResourceTag, condition keys to create an IAM policy for tag-based restriction

How do I use the ResourceTag, condition keys to create an IAM policy for tag-based restriction

Comments
3 min read
Kubernetes DaemonSets vs Deployments: Key Differences and Use Cases

Kubernetes DaemonSets vs Deployments: Key Differences and Use Cases

Comments
5 min read
DevOps Made Simple: A Beginner’s Guide to Self-Healing Systems in DevOps

DevOps Made Simple: A Beginner’s Guide to Self-Healing Systems in DevOps

6
Comments
2 min read
Diving into Banking Infrastructure on AWS Cloud – Thoughts on this Series?

Diving into Banking Infrastructure on AWS Cloud – Thoughts on this Series?

Comments
3 min read
Replace Opsgenie with this open-source alert router (save $2,280/year)

Replace Opsgenie with this open-source alert router (save $2,280/year)

1
Comments
2 min read
Chaos Mesh: O que é e faz?

Chaos Mesh: O que é e faz?

5
Comments
2 min read
Architecting Event-Driven Architecture on Google Cloud: A Journey Through Real-World Scenarios

Architecting Event-Driven Architecture on Google Cloud: A Journey Through Real-World Scenarios

Comments
4 min read
Bandwidth and Throughput: A Clear Comparison You Need to Know

Bandwidth and Throughput: A Clear Comparison You Need to Know

2
Comments 3
2 min read
Insider Realities of Site Reliability Engineering: Lessons from a DevRel Perspective

Insider Realities of Site Reliability Engineering: Lessons from a DevRel Perspective

1
Comments
3 min read
The Beginner’s Guide to Observability: From Basics to Better Quality of Life

The Beginner’s Guide to Observability: From Basics to Better Quality of Life

Comments
5 min read
Mastering Kubernetes: Become a Pro in K8s Deployments

Mastering Kubernetes: Become a Pro in K8s Deployments

11
Comments
7 min read
In 2025, I resolve to be proactive about reliability

In 2025, I resolve to be proactive about reliability

Comments
6 min read
AWSsence: Exploring Event Monitoring

AWSsence: Exploring Event Monitoring

Comments
1 min read
In 2025, I resolve to eliminate escalations and finger pointing

In 2025, I resolve to eliminate escalations and finger pointing

Comments
5 min read
Involving the Right People in an Incident

Involving the Right People in an Incident

1
Comments 1
4 min read
In 2025, I resolve to spend less time troubleshooting

In 2025, I resolve to spend less time troubleshooting

Comments
12 min read
loading...