Issue 196

A wonderful variety of stories this week, with tons of logging coverage, considerations for the cost-conscious Observability engineer, and a really great write-up (and video) from eBay on their transition to OpenTelemetry. Enjoy! ☕🍩😍

This issue is sponsored by:

Chronosphere logo

Imagine this: zero catastrophic service disruption.

Check out Forrester’s Total Economic Impact™ (TEI) study to learn the cost savings and business benefits (165% ROI, to be exact) enabled by Chronosphere's observability platform. Download your free copy here.

Articles & News on monitoring.love

Observability & Monitoring Community Slack

Come hang out with all your fellow Monitoring Weekly readers. I mean, I’m also there, but I’m sure everyone else is way cooler.

From The Community

Why and How eBay Pivoted to OpenTelemetry

Props to eBay engineering for sharing the story of their transition from Elastic Beats to OpenTelemetry for telemetry collection. Note that the author gave a corresponding talk at Open Observability Day back in October.

Tips for analyzing logs

I was excited to learn that Julia Evans, everyone’s favorite tech author and illustrator, is working on a new zine about debugging. In the meantime, she accumulated a massive number of log analysis tips and ended up sharing them in one super compressed (pun intended) blog post for us all.

The ROI of monitoring data usage

Although this post was intended for data teams, it includes some valuable considerations for anyone dealing with the neverending growth of metrics, logs, and event storage.

The intersections among FinOps, Sustainable IT, and Observability

Speaking of ROI, this story takes a broader look at how we might want to think about cloud costs and their relationship with IT and Observability needs.

Observability with Prometheus and Grafana

We see a lot of guides for collecting metrics and monitoring them with Prometheus and Grafana, respectively, but this might be the most concise and comprehensive (is that possible?) article I’ve seen for anyone new to these tools and concepts. A++ would recommend to junior engineers and SRE managers.

Grafana as code: A complete guide to tools, tips, and tricks

A solid collection of approaches for managing your Grafana instance(s) in code. Super helpful comparison guide if you’re looking to level up your Grafana automation.

Loop1 logo

Having trouble responding quickly to network outages, troubleshooting, and minimizing network downtime?

SolarWinds® Network Performance Monitor is a powerful, affordable network monitoring software enabling you to quickly detect, diagnose, and resolve network performance problems and outages. Download a fully-functional 30-day free trial and get ultimate end-to-end visibility for your network. (SPONSORED)

Logging wide events with F#

I love seeing the variety of Advent Calendar articles this time of year. This post demonstrates how to instrument structured logs into your own F# services.

Testing Production-Ready Applications on the Local Kubernetes Environment

I’m always interested in the intersection between CI/CD and Observability, and this post demonstrates the value gained by having these systems working well together. Although you might not be using the same tools as the author here, there are still some valuable takeaways.

The Five Myths of Observability

Not to be confused with some number of pillars of Observability. All kidding aside, this sums up many (hint: there are more) of the misconceptions that we’ve worked through and put to rest over the last decade in tech.

HAProxy Logging Configuration: How to Enable & View Logs

A thorough review of HAProxy’s logging capabilities and configuration options.

Events

Monitorama PDX 2023 - June 26-28 (Portland, OR)

Monitorama is returning to Portland, OR next summer. The 2022 conference was a fantastic event and I look forward to seeing you all again in 2023.

Job Opportunities

Staff Site Reliability Engineer at Home Depot (US Remote)

See you next week!

– Jason (@obfuscurity) Monitoring Weekly Editor