Issue 150
I hope everyone enjoyed their holidays (or at least had a quiet on-call rotation) as we tread gently into 2022. Lots of fun articles this week, especially if you geek out over the pillars of observability. Enjoy! šš®
Articles & News on monitoring.love
Observability & Monitoring Community Slack
Come hang out with all your fellow Monitoring Weekly readers. I mean, Iām also there, but Iām sure everyone else is way cooler.
From The Community
High Availability & Fault Tolerance for Monitoring Stack
A good primer for thinking about highly available Prometheus, Alertmanager, and Grafana. TBQH thereās a lot more to plan for when scaling a truly HA Prometheus environment, but this will get you started in the right direction.
Fluent Bit ā Write custom output plugins using Golang
A nifty example for writing your own custom output plugins for the Fluent Bit log processor.
One authorās take on why we donāt see more consolidation among open source tools. Personally, I think they might be conflating monitoring tools with the rest of the observability space, e.g. traces (which do a better job normalizing highly cardinal systems). Still an interesting read with some fresh takes on the space.
Why you should use CloudWatch Embedded Metric Format
An excellent case for using CloudWatch EMF for custom metrics rather than publishing them yourself to PutMetricData
.
We seemingly talk about reliable systems every day, but investing in reliability often competes with enhancements for engineering resource and priority. Nice to hear from a company that treats reliability as an important āfeatureā.
Great to see a community article on Vector. It started as a high-performance log router (written in Rust), but it appears to be evolving into something akin to an āobservability pipelineā. Good to see Datadogās acquisition of the Vector team hasnāt slowed them down.
As a time-series nerd, I canāt help but geek out when engineers share their notes (and math) when scaling TSDBs. Even though Goku isnāt open source, it makes for a great read if you work with these types of systems.
Monitor CSGO - Counter Strike: Global Offensive with Prometheus
I donāt play CSGO, but Iāve been looking for better ways of tracking performance on my personal desktop. Fortunately, thereās always the Minecraft exporter for us casual gamers.
Spans - a key concept of distributed tracing
A helpful overview of spans and their role in distributed tracing systems.
The journey of one developerās experience contributing to the Thanos project through the Linux Foundationās mentorship program.
Tools
āA reliable, high-performance tool for building observability data pipelines.ā
āPrometheus exporter for SRCDS Gameserver using Source RCON.ā
sladkoff/minecraft-prometheus-exporter
āA Bukkit plugin which exports minecraft server stats to Prometheus.ā
Job Opportunities
DevOps Engineer at Munibilling (Remote)
Cloud Operations Engineer at HomeValet (Remote)
Senior Site Reliability Engineer - Monitoring/Observabity at Axon (Seattle, WA)
Ready to lower your AWS bill? Now might be the perfect time for an AWS Cost Optimization project with The Duckbill Group. The Duckbill Group aims for a 15-20% cost reduction in identified savings opportunities through tweaks to your architectureāor your money back. (SPONSORED)
See you next week!
ā Jason (@obfuscurity) Monitoring Weekly Editor