Issue 179

So much great stuff this week, with a particular emphasis on Prometheus, debugging, and database monitoring. Enjoy! 💾🪓🍹

This issue is sponsored by:

Chronosphere logo

Chronosphere is the only observability platform that puts you back in control by taming rampant data growth and cloud-native complexity, delivering increased business confidence. Find out how top performing organizations are adopting observability to achieve exceptional digital experiences. Read the report here.

Articles & News on monitoring.love

Observability & Monitoring Community Slack

Come hang out with all your fellow Monitoring Weekly readers. I mean, I’m also there, but I’m sure everyone else is way cooler.

From The Community

External Debugging Tools 1: dtrace and strace

An introduction to dtrace and strace, along with some useful examples for using them effectively.

SRE Bytes: The Four Golden Signals of Monitoring

It feels like a great day to revisit the golden signals of monitoring. I love how the author adds context around each of these signal types with some useful examples and perspective.

PostgreSQL Logs Explained: Logging Configuration Tutorial

An excellent primer on PostgreSQL logging and how to manage them effectively.

Multi-site monitoring with HA and dynamic scale using VictoriaMetrics

I’m starting to hear more frustration from the community around scaling Prometheus. This article makes the case for using VictoriaMetrics as an alternate backend, but are we trading one complex stack for another? Regardless, if you’d like to try it out, this looks like a solid guide for getting started.

Monitoring MySQL using Prometheus, Grafana and mysqld_exporter in Kubernetes

A very thorough look at monitoring your MySQL containers in Kubernetes with Prometheus. I appreciate that the author went to the trouble of showing how to actually generate some sample load to help visualize the results.

Effective SRE: SLO Engineering and Error Budget

If you can get past the triangle of SRE doom (my description, not theirs), this is a good deep-dive on SLOs & error budgets and how they impact SRE.

Adevinta logo

Adevinta’s 40+ online marketplaces receive more than 3 billion monthly visits across 14 countries. Follow our Global Product and Tech teams on Medium and learn how to develop scalable products, platforms and services. Our main technologies include Python, Java, Scala, Kotlin, Go, AWS, Kafka, Kubernetes and Spark. Visit the blog. (SPONSORED)

Stateless, Secretless Multi-cluster Monitoring in Azure Kubernetes Service with Thanos, Prometheus and Azure Managed Grafana

This feels like one of those guides that you’re going to want to read once, bookmark, and hopefully remember later on when you finally encounter this particular set of constraints. I don’t personally run AKS but I still found some useful bits in this one.

Reflections on the OTel Community Day 2022

I completely missed that there was an OTel Community Day in Austin this summer. This attendee shares their experience and some of the highlights from the event.

Outer Joins in Flux

I’m a little sad that outer joins are (still) actually a thing in any time-series query language, but if you’re an Influx user this might be a good resource for you.

How to use WebSockets to visualize real-time IoT data in Grafana

If you woke up today wondering how to visualize real-time data in your Grafana, I have you covered. You’re welcome. 🤗

Job Opportunities

Senior DevOps Engineer at ChiroTouch (US Remote)

See you next week!

– Jason (@obfuscurity) Monitoring Weekly Editor