Issue 218

So much great stuff this week, with an emphasis on Lambda logging, Prometheus, and distributed tracing. On a personal level, I’d like to take a moment to thank any service members in our audience. Your dedication and sacrifice is truly appreciated. Thank you! 🎖️💜📈

This issue is sponsored by:

Armory logo

Can you rely on your deployments?

In a recent Armory and Gartner report, 35% of respondents’ top pain point with app deployment is reliability and consistency. If you need help with consistent, reliable deployments, try Armory Continuous Deployment-as-a-Service. Check out more in the report here.

Articles & News on monitoring.love

Observability & Monitoring Community Slack

Come hang out with all your fellow Monitoring Weekly readers. I mean, I’m also there, but I’m sure everyone else is way cooler.

From The Community

OTEL: Plug-and-Play APM

A reminder that OpenTelemetry gives us a level of portability (and resistance to vendor lock-in) that we’ve never really seen before.

Loki: collecting logs from CloudWatch Logs using Lambda Promtail

You’ve deployed Loki and it’s working great for your Kubernetes logs, but now you’re tasked with onboarding CloudWatch and Lambda logs. What to do?

Blackbox Exporter: To Probe or Not to Probe

Blackbox monitoring is a common pattern but it always seems to take longer than it should to set up properly. This guide walks you through all the steps to start probing and capturing your blackbox metrics in Prometheus.

Sending AWS Lambda Logs to an Observability Service Like Datadog or New Relic

Similar to the earlier Loki article, but this time for anyone needing to consolidate their CloudWatch and Lambda logs to a third party observability provider.

The Ticking Time Bomb of Observability Expectations

This post reminds me how much has changed over the last 10 years, and yet so much remains the same. The author isn’t wrong, but there will always be compromises when you’re designing and supporting complex systems at scale.

Distributed Tracing: A Guide for 2023

An awesome getting started document for anyone curious about tracing and wanting to absorb as much context as possible.

Chronosphere logo

The complexity of microservices means that traditional monitoring solutions are not effective or efficient enough for new dynamic observability needs. In this article, explore three traps that organizations should avoid as organizations approach visibility into their cloud native environments.

Decoding eBPF Observability: How eBPF transforms Observability as we know it

A comprehensive argument for the benefits of eBPF and what it’s such an important piece of any modern observability stack.

Optimizing Resource Allocation with Prometheus Monitoring and Spot Instances in Kubernetes

It’s my belief that any service that isn’t falling over is underutilized. This post provides some high-level tips for leveraging your monitoring service to optimize your flexible infrastructure.

2023-03-08 Incident: Infrastructure Connectivity Issue Affecting Multiple Regions

An interesting postmortem from Alexis himself looking back at a Datadog outage from March 2023.

Introducing logging & tracing in Snowflake via Event Tables

If you or anyone else on your team has to support Snowflake in your company, you’ll be happy to hear they’ve added support for logging and tracing. Sounds like there are some limitations still but it’s a start.

Events

Monitorama 2023 PDX

Just one month left until everyone’s favorite monitoring conference of the year. I’m super excited to see the new speakers and to hear what everyone has been up to since the conference returned to Portland in 2022. Hope to see you there!

See you next week!

– Jason (@obfuscurity) Monitoring Weekly Editor