A really fun week of articles, with an emphasis on load testing, Kubernetes observability, and network performance monitoring. Great stuff all around. Enjoy! 🚢💾🐰
This issue is sponsored by:
Are you looking to modernize Log Analytics while controlling the cost?
DataSet is the cloud-native event data platform that enables teams to achieve petabytes of effortless scalability and real-time performance at a fraction of the cost. Get complete visibility into your entire stack and experience the DataSet difference for free.
Articles & News on monitoring.love
Observability & Monitoring Community Slack
Come hang out with all your fellow Monitoring Weekly readers. I mean, I’m also there, but I’m sure everyone else is way cooler.
From The Community
Evaluating and improving the performance of your ingress controller on Oracle Kubernetes Engine with Locust
An overlooked aspect of monitoring is understanding the baseline and theoretical ceiling of your architecture. Here’s a great example for testing the throughput of your Kubernetes ingress controller using some open source tools.
Optimizing TCP for high WAN throughput while preserving low latency
If you enjoy TCP network monitoring and performance debugging conversations, you’re going to love this one. Props to Cloudflare for sharing their lessons learned in performance tuning the Linux kernel.
Getting Started on Kubernetes observability with eBPF
An excellent example for capturing network flow telemetry within your Kubernetes cluster using eBPF. You may not need this, but it’s a very approachable tutorial that should be helpful elsewhere.
I love this post from Slack engineers on their continuous load testing platform and how it’s elevated their resilience by helping to identify performance regressions before they make it to production.
Kubernetes CrashLoopBackOff: What it is, and how to fix it?
If you already administer Kubernetes clusters, chances are you’re familiar with CrashLoopBackOff. On the other hand, if you’re a developer using K8s… maybe not so much. This post is a good overview of the concept along with some examples for debugging and monitoring this state in your Pods.
Sample vs Metrics vs Cardinality
A visual explanation of some of the attributes and constraints that affect how we store and query time-series data.
Chronosphere is the only observability platform that puts you back in control by taming rampant data growth and cloud-native complexity, delivering increased business confidence. See how companies like Abnormal Security cut observability costs and boosted platform reliability with Chronosphere’s end-to-end solution. Read the story here. (SPONSORED)
How to Configure Grafana to Use Remote Database for HA
A handy guide for configuring Grafana to use a shared remote database for highly available setups.
OpenTelementry: Modern Day Solution For Observability
This author makes a strong case for adopting OpenTelemetry for modern systems and applications. Probably a good resource to share with your friends who are still relatively new to the space.
Observability vs Monitoring: What’s the Difference?
I’ve read so many articles like this, but it’s always interesting to read others’ take on what distinguishes these two capabilities.
A look at the newest release for Grafana Tempo with a look forward at their plans for the new Parquet backend and TraceQL query language planned for Tempo 2.0.
“Locust is an easy to use, scriptable and scalable performance testing tool.”
“Zero instrumentation observability on Kubernetes with eBPF”
Senior DevOps Engineer at Hive Collective (Remote)
Sr. Platform Engineer at SeatGeek (US Remote)
See you next week!
– Jason (@obfuscurity) Monitoring Weekly Editor