Issue 152

Hope you’re all doing well and staying warm today. A fun collection of articles this week with a little bit for everybody. Sit down, grab your favorite hot beverage, and enjoy! ☕

Chronosphere logo

Chronosphere is the only observability platform that puts you back in control by taming rampant data growth and cloud-native complexity, delivering increased business confidence. Ready to stop managing your own Prometheus? Check out your buyers guide for Prometheus-Native Monitoring SaaS Solutions here.

Articles & News on monitoring.love

Observability & Monitoring Community Slack

It’s been amazing to see the community grow throughout 2021 and into 2022. We’d love to have you join us and share what you’ve been working on.

From The Community

5 Design Patterns for Building Observable Services

An excellent article from Salesforce engineering, covering their more popular design choices for building observable services.

Live Utility Meter Monitoring With Grafana & Software Defined Radio

Neat little weekend project for reading RF-transmitting utility meters and tracking the data in familiar observability tools.

SRE Principles Part 1

A look at some of the differences between SRE and DevOps principles, with a particular emphasis on service levels and monitoring signals.

Monitoring errors in your A/B tests

A/B testing is a great way to evaluate changes in your application, but it can become a chore to maintain the constantly evolving tests. Preply shares how they’ve automated their stack to enable development teams to more easily monitor and alert on their own test failures.

Auto-Diagnosis and Remediation in Netflix Data Platform

I never get tired of reading about Netflix teams and their custom data pipelines. An interesting read about how they perform auto-diagnosis and remediation across their infrastructure.

The New Version of Orbit (v1.1) is Released: The Improvements, Design Changes, and Exciting Collaborations

Last year, Uber released Orbit, an open source Python package for Bayesian time-series analysis and forecasting. This article covers all the updates and changes in their recent v1.1 release.

Quantifying Latency at Scale

Fantastic article from Fred Moyer, one of our friends in the Monitoring Weekly community, on the importance of accuracy in our data with some tooling recommendations.

Raygun logo

Raygun + Flutter: Build more resilient mobile applications ⚡

Raygun is expanding its powerful Error Monitoring and Crash Reporting solution to Flutter. Now you can get complete visibility into the health of your Flutter applications, with rich diagnostics that take you to the root cause of errors and crashes. Read the blog today. (SPONSORED)

Detect crashes in your Kubernetes cluster using kwatch and Slack

This looks like a genuinely interesting tool for monitoring crashes in your Kubernetes cluster, with native hooks into Slack. Very cool.

Key Metrics for Monitoring Azure SQL Databases

Datadog always does a great job with these “which metrics to monitor” posts, and this one is no different. If you’re using Azure SQL, this is a great starting point for learning which metrics you should be keeping an eye on.

Configuring Grafana Tempo and Linkerd for distributed tracing

A guest post on the Grafana blog demonstrating Grafana Tempo and Linkerd with NGINX Ingress Controller for tracing requests through their request stack.

A curated list of “Top” based monitoring tools for use in Linux and Unix terminals.

A surprisingly comprehensive compilation of just about every Top-like utility in the known universe. There were quite a few I’d never heard of here. o_O

Deploy Grafana Loki and Promtail using ArgoCD

A sample GitOps workflow for standing up Grafana Loki with Promtail in your Kubernetes cluster.

Tools

uber/orbit

“Orbit is a Python package for Bayesian time series forecasting and inference. It provides a familiar and intuitive initialize-fit-predict interface for time series tasks, while utilizing probabilistic programming languages under the hood.”

abahmed/kwatch

“kwatch helps you monitor all changes in your Kubernetes(K8s) cluster, detects crashes in your running apps in realtime, and publishes notifications to your channels (Slack, Discord, etc.) instantly”

Negotiating your AWS contract? Let us help. At The Duckbill Group, we’re on your side and we see dozens of these a year–more than most AWS account managers! We’ve helped negotiate everything from $3mm contracts to $650mm contracts and a whole slew in between. Check out our AWS contract negotiation services. (SPONSORED)

See you next week!

– Jason (@obfuscurity) Monitoring Weekly Editor