Hope you’re all doing well and staying cool in the face of some unusual weather. Sit back and enjoy another collection of stories from the past week!

This issue is sponsored by:

Splunk logo Get visibility into your complex applications, no matter where they’re deployed. Consolidate tools and eliminate blind spots by combining infrastructure and application monitoring, logging, RUM, and more. Get alerts in real-time, based on all your data without sampling. Check out a free trial of Splunk Observability Cloud today.

Articles & News on monitoring.love

Observability & Monitoring Community Slack

Come hang out with all your fellow Monitoring Weekly readers. This past week there were some fun threads around tracing and tools, with a long list of excellent reading recommendations. Join our Slack so you don’t miss out! 😃

From The Community

Observing containers with the three pillars of observability

A wide-angle look at observability considerations for containers with an emphasis on meaningful signals.

Grafana dashboard showcase

Some of these dashboards are truly gorgeous. I’m not sure how effective they are for daily use, but they’re solid inspiration for future projects.

What we learned from an iOS app OOMs incident

I’ll never get tired reading how other engineers think about debugging and interpreting signals. A great diagnostic story.

How Netflix uses eBPF flow logs at scale for network insight

The Flow Exporter is a sidecar that uses eBPF tracepoints to capture TCP flows at near real time on instances that power the Netflix microservices architecture.

Genuinely curious to see what this looks like at scale. Drool.

Configuring fluentd for logging in Kubernetes in under 15 minutes

How and why you might want to use fluentd for Kubernetes logging.

My Top 5 Grafana sessions from GrafanaCONline 2021

Still sitting here still tapping my foot waiting for GrafanaCONline 2021 videos to be released. In the meantime, here’s a quick recap of some highlights from the event.

Leveraging OpenTelemetry For Custom Context Propagation

Is there anything that OpenTelemetry can’t do? All kidding aside, custom context propagation sounds like one of those things that solve problems that I’m really glad I don’t have to deal with.

Trusting Metrics at Pinterest

I’ve never heard of metrics certification before, this sounds wild. Feels a little domain-specific (i.e. social media), but interesting nonetheless. Where else might this be useful?

How To Monitor Your Services Hosted On AWS EC2 Instances

This feels like a lot of complexity to avoid using CloudWatch metrics, but it’s an interesting use of Lambda functions and AWS Secrets Manager.

Monitoring Kafka streams applications

If you rely on Kafka Streams, you should check this out. Personally, I got lost in a rabbit hole reading about Ziggurat, their “stream processing framework to build stateless applications on Kafka”.


Monitorama PDX 2021 - September 13-15 (Portland, OR)

One of the first technical conferences to resume in-person events, Monitorama is returning to Portland, OR this fall. It looks like a return to form for one of our favorite events (ok, we might be biased). Hope to see you there!

Job Opportunities

Cloud Engineer at Redox (Remote)

SRE - Supercomputing at Tesla (Fremont, CA)

Cloud SRE - Reliability at Elastic (Remote)

Ready to lower your AWS bill? Now might be the perfect time for an AWS Cost Optimization project with The Duckbill Group. The Duckbill Group aims for a 15-20% cost reduction in identified savings opportunities through tweaks to your architecture–or your money back. (SPONSORED)

See you next week!

– Jason (@obfuscurity) Monitoring Weekly Editor