A fun week of stories with an emphasis on logging, incident response, and alerting. Some interesting tools to play with this weekend too… enjoy! ☕🍂🧠
This issue is sponsored by:
Are you looking to modernize Log Analytics while controlling the cost?
DataSet is the cloud-native event data platform that enables teams to achieve petabytes of effortless scalability and real-time performance at a fraction of the cost. See DataSet in action at KubeCon or SREcon Europe and get a personalized demo, collect awesome swag, and win exciting prizes.
Articles & News on monitoring.love
Come hang out with all your fellow Monitoring Weekly readers. I mean, I’m also there, but I’m sure everyone else is way cooler.
From The Community
A look at how Razorpay improved their platform resilience by introducing self-service alerting and automated incident management.
A postmortem of an incident affecting Honeycomb’s ingest system last month. It’s always interesting to discover unexpected failure patterns for systems that you swear you know completely; props to their engineers for sharing lessons learned from this outage.
A very handy guide for anyone new to Prometheus monitoring and alerting. This is a good one to share with any developers new to our domain.
Examples for alerting on your service levels with Prometheus and PromQL.
A solid primer on Kubernetes logging and interacting with the logs using
A reminder that not everything with “Observability” in the name really is.
Show notes and videos from Amazon’s recent “Build On Observability Day” event.
There are different approaches for handling distributed tracing context propagation. In this article, ContainIQ explains their use of metadata based correlation with eBPF.
New Relic has published their annual “State of Logs” report with some interesting takeways and developing trends.
Groundcover has released a new OSS tool that looks a lot like “top”, but for Kubernetes clusters.
How to host your own Pingdom-like website monitoring with the open-source HotHost project.
“Lightweight and minimalistic free and opensource Servers and HTTP monitor.”
“Murre is an on-demand, scaleable source of container resource metrics for K8s.”
Monitorama is returning to Portland, OR next summer. The 2022 conference was a fantastic event and I look forward to seeing you all again in 2023.
See you next week!
– Jason (@obfuscurity) Monitoring Weekly Editor