Data Science Briefing #316

Issue #316

April 30, 2026

Announcements

 

✈️ Mapping the skies: How do we visualize airline traffic between states?

We often think of air travel in terms of airports, but viewing it as a network of state-to-state connections reveals fascinating patterns in how our country moves.

Our latest substack uses data visualization to turn raw statistics into a clear story about infrastructure and mobility.

👉 Airline Traffic Between States

Check it out and Subscribe so you don't miss another post.


Book of the Week

"Designing Data-Intensive Applications" by M. Kleppmann and C. Riccomini is the kind of book that quietly raises the level of everyone who reads it. In this new edition, the authors do an outstanding job of explaining the core ideas behind modern data systems, like replication, consistency, storage, streaming, fault tolerance, and scalability, without reducing them to buzzwords or vendor-specific recipes. That makes the book especially valuable for data scientists and machine learning engineers as it bridges the gap between building models and understanding the data infrastructure those models depend on in production.

What makes the book so compelling is its focus on first principles. Rather than teaching a single stack or a fleeting set of tools, it gives readers a durable framework for thinking about trade-offs in real systems. That is incredibly useful for ML engineers working on pipelines, model serving, retrieval systems, or any workflow where reliability and performance matter as much as model quality. The downside is that it is more conceptual than hands-on, and readers looking for quick code examples or direct coverage of topics like feature stores, vector databases, or modern LLM infrastructure may wish it connected the dots more explicitly.

Still, that broader systems lens is exactly why the book stands out. It is thoughtful, clear, and deeply practical in the ways that matter over the long run. For anyone in data science or machine learning who wants to understand not just how to build models, but how to build the systems that let those models survive contact with reality, this is an easy book to recommend.

Designing Data-Intensive Applications

Designing Data-Intensive Applications


Links of the Week
  1. 1. How France’s Mistral Built A $14 Billion AI Empire By Not Being American [forbes.com]
  2. 2. AI doom warnings are getting louder. Are they realistic? [www.nature.com]
  3. 3. Designing synthetic datasets for the real world: Mechanism design and reasoning from first principles [research.google]
  4. 4. The Ultimate Guide to Claude Opus 4.7 [productcompass.pm]
  5. 5. Building agents that reach production systems with MCP [claude.com]
  6. 6. Agents CLI in Agent Platform: create to production in one CLI [developers.googleblog.com]
  7. 7. Google WeatherNext 2 — Our most accurate AI weather forecasting technology [deepmind.google]
  8. 8. Scientists stunned by ‘fundamentally new way’ life produces DNA [www.science.org]

Papers of the Week
Video of the Week

Building Large Language Models (LLMs)

Building Large Language Models (LLMs)

All our videos are also available in our YouTube playlist.


Enjoy the newsletter?

Forward it to a friend, or subscribe to get it straight to your inbox.

Subscribe Free
← Back to Newsletter

Subscribe to get our latest content by email.
    We won't send you spam. Unsubscribe at any time.