A curated list of Site Reliability and Production Engineering resources.
-
Updated
Jun 10, 2024
A curated list of Site Reliability and Production Engineering resources.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
🏓 The open-source synthetic monitoring platform 🏓
OneUptime is the complete open-source observability platform.
📟✨ PagerDuty on-call widget for monitoring dashboard. Datadog and Grafana compatible
A curated list of awesome Site Reliability and Production Engineering resources.
A collection of awesome tools, software, libraries, learning tutorials & videos, frameworks, best practices and technical resources about Incident Response & Management in Cybersecurity
💯网站可靠性和生产工程资源精选清单
Simple custom UI for Pagerduty incident searching
opsway mono backend for API, Probes etc.
The backend powerhouse for FlowInquiry, designed to handle requests, manage workflows, and ensure smooth operations for internal and external request management with SLA compliance.
Streamlined solutions for managing requests and workflows within SLA timelines. This repository provides deployment scripts and configurations for Docker, Kubernetes, and more to ensure efficient, scalable setups.
A simple notification architecture to remind employees through email or sms their on-call shifts. No need to have a document and keep forgetting your on-call shift schedule! 🤓
iLert docs ☀️
An HTTP API service that simplifies daily operations and on-call duty by letting you run repeated and cumbersome tasks in no time!
A CLI tool for oopsiee-server. Simplifies daily operations and on-call duty by letting you run repeated and cumbersome tasks with one-liners.
Add a description, image, and links to the on-call topic page so that developers can more easily learn about it.
To associate your repository with the on-call topic, visit your repo's landing page and select "manage topics."