I hope you had a great week and are ready for an equally great newsletter. This week’s theme is all about resiliency in the face of complex systems. Grab your favorite drink and cuddle up to this week’s collection of articles from across the globe. Enjoy! 💔💾🔥
This issue is sponsored by:
There are a LOT of questions surrounding observability. Heck, is observability even a noun or a verb? To answer some of those burning questions, Chronosphere teamed up with Forrester to discuss what cloud native observability really is and why observability can lead to better business outcomes. Read our latest blog summarizing the discussion.
Articles & News on monitoring.love
Come hang out with all your fellow Monitoring Weekly readers. I mean, I’m also there, but I’m sure everyone else is way cooler.
From The Community
It’s come to my attention that Dr. Richard Cook recently passed away. His research influences the way we think about complex system failures and how to engineer resilience into these systems. Dr. Cook will be missed but our industry is better for having intersected with his life’s work.
It’s not impossible to ensure a highly resiliant observability stack, but it takes careful planning and mindful execution. Glad to see someone covering this topic, I’d like to see more about it.
Unusual title, but this looks like a handy introduction to Prometheus concepts and setup. Probably a good one to share with colleagues who are less familiar to the space.
This might be one of the best guides I’ve seen for setting up distributed tracing in your application, from instrumentation through visualization. If you’re a Rust programmer (or a wanna-be Rustacean like me), you should absolutely check this one out.
If you’re using kOps to provision your Kubernetes control plane, you might have already hit the proverbial wall with Datadog’s native integrations. This guide should help you get things hooked up properly.
An overview of monitoring considerations and tooling suggestions for Kubernetes clusters.
Some of the more important metrics respresenting failure modes across a variety of AWS services.
If you maintain your own Grafana plugin, there’s now a tool to help ensure compatibility with new Grafana versions.
“A tool for helping to understand APIs exported and consumed by NPM packages (or any TypeScript code).”
See you next week!
– Jason (@obfuscurity) Monitoring Weekly Editor