Issue 170
I don’t know about you all, but I love that there’s something different and unique to share with you every week. A great collection of practical advice and aspirational projects from the community, plus an explanation of the logging changes included with Kubernetes 1.24. Enjoy! 😸💾🔎
This issue is sponsored by:
Observability Powered by SQL: OpenTelemetry Traces + SQL
Promscale is an observability backend built on the rock-solid foundation of PostgreSQL and TimescaleDB. With the new support for OpenTelemetry traces, you now have full SQL superpowers to solve even the most complex issues in your distributed systems. Read more about it here.
Articles & News on monitoring.love
Observability & Monitoring Community Slack
Come hang out with all your fellow Monitoring Weekly readers. I mean, I’m also there, but I’m sure everyone else is way cooler.
From The Community
OpenTelemetry in Action: Optimizing Database Operations
An excellent follow-up to the author’s previous post, this article looks at how to leverage OpenTelemetry to detect and diagnose database performance issues.
Closing observability gaps with custom metrics
A guide for determining the appropriate golden signals for your respective service and workload types.
Leveraging Consul for Thanos Query Discovery
How to build your own service discovery mechanism for Thanos with Consul and Prometheus’ built-in primitives.
Contextual Logging in Kubernetes 1.24
Kubernetes 1.24 includes a number of enhancements to structured and contextual logging. Great to see the scoping improvements and the benefits this will have in scenarios like unit tests.
Learning from an incident, hitting the 2,147,483,265 limit at Shipup!
Some very pragmatic lessons learned from a small team about a customer-impacting outage. It’s easy to pick at their obvious mistakes, but who of us hasn’t gone through some of these in production?
Continuous Monitoring & KPI benchmarking
Although this article is focused on continuous delivery, it’s a topic that very relevant to my current interests and has me thinking there should be more collaboration between product, platform, and observability teams in terms of plugging into our existing monitoring infrastructure.
The Future of Cloud-Native Observability and 5 Open Source Tools to Help You…
A concise primary on observability, how to approach it for different types of systems, and a list of the most useful open source tools in the space.
Get your incidents under control and focus on fixing 🔥
Being on-call at 2am when everything is on fire can be stressful! incident.io makes responding to incidents quick and easy by automating your entire incident management process in Slack. Sign up for free. (SPONSORED)
How to Better Understand your Applications Real-Time Value
How the Domain-Oriented Observability pattern can result in more efficient code and better accountability among software teams.
A GitOps Way To Manage Grafana Data Sources At Scale
The title really says it all. Check out the related GitHub project linked in the Tools section below for the related source code.
NGINX Access & Error Logs Configuration: Logging Explained
A deep dive on NGINX logs and how to configure them effectively.
How We Built Alert Rules, Runbooks, and Dashboards to Observe Our Observabity Tool
Timescale explains how they monitor their own services (“who watches the watcher”) and how you can apply these techniques to your own Promscale installation.
Tools
“A GitOps Way To Manage Grafana Data Sources At Scale”
Events
Monitorama PDX 2022 - June 27-29 (Portland, OR)
Monitorama is returning to Portland, OR this summer. It looks like a return to form for one of our favorite events (ok, we might be biased). Hope to see you there!
Job Opportunities
DevOps Engineer at Lively (US Remote)
Senior DevOps Engineer at Forbes (US Remote)
See you next week!
– Jason (@obfuscurity) Monitoring Weekly Editor