I hope everyone enjoyed the quarterly Best Of issue last week. This week brings us an interesting mix of Prometheus and OTel topics, network monitoring, and a big announcement from Grafana. Enjoy!
This issue is sponsored by:
See how companies like DoorDash are “no longer flying blind” with increased visibility and reliability from Chronosphere’s end-to-end solution. Chronosphere is the only observability platform that puts you back in control by taming rampant data growth and cloud-native complexity, delivering increased business confidence. Learn more here.
Articles & News on monitoring.love
It’s been amazing to see the community continue to grow. We’d love to have you join us and share what you’ve been working on.
From The Community
How (and why) Razorpay engineers switched from Jaeger to Hypertrace for their distributed tracing needs.
Pretty big announcement from Grafana. Interesting to read that they plan to support more than just Prometheus metrics.
Great read on some of the challenges they faced scaling Mimir to its current capacity. Personally, I’m less interested in their write path improvements and more curious how it compares to similar (complex) queries in something like ClickHouse or Victoria Metrics.
As someone who’s had to do this in production, I wish this article existed three years ago. I really appreciate that the author covers both the how-to as well as any potential gotchas.
This is more about scaling than monitoring, but I always enjoy a good story with network graphs and bandwidth challenges.
TIL that you couldn’t previously alert on GCP logs directly (rather, you’d have to transform them into log-based metrics first). Still, nice to see this feature finally land, and this article covers the feature set and use cases.
Mizu looks like a pretty handy tool for visualizing network traffic and service activity on your Kubernetes clusters.
One part of a larger series on AWS products, this post is particularly interesting for it’s broad coverage of the monitoring and observability services in AWS.
A straightforward example for setting up the OpenTelemetry Collector with Prometheus on your Kubernetes cluster.
Speaking of OpenTelemetry, here’s a handy guide for updating your Python apps to begin sending spans and collecting traces.
It’s natural to want to consolidate tooling, especially when it comes to certain cloud-provider services. This author cautions that you might be getting less (or more) than you bargained for.
Frankly I’m only including this one because I found it such a fascinating contrast (to the previous article) in the same week. 😂
“_A simple-yet-powerful API traffic viewer for Kubernetes enabling you to view all API communication between microservices to help your debug and troubleshoot regressions… [t]hink TCPDump and Wireshark re-invented for Kubernetes.
Monitorama is returning to Portland, OR this summer. It looks like a return to form for one of our favorite events (ok, we might be biased). Hope to see you there!
Negotiating your AWS contract? Let us help. At The Duckbill Group, we’re on your side and we see dozens of these a year–more than most AWS account managers! We’ve helped negotiate everything from $3mm contracts to $650mm contracts and a whole slew in between. Check out our AWS contract negotiation services. (SPONSORED)
See you next week!
– Jason (@obfuscurity) Monitoring Weekly Editor