📰 Towards Data Science

28 articles · Updated every 3 hours · View all reads

Getting reliable, readable responses out of your LLM, and knowing which tool to reach for The post Structured Outputs with LLMs: JSON Mode, Function Calling, an

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 6d ago

You Probably Don’t Need an Agent Framework

Most LLM applications need a clear workflow, not an autonomous agent. Here's how to build one in plain Python. The post You Probably Don’t Need an Agent Framewo

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 6d ago

What the Question Parser Extracts from a User String: Keywords, Scope, Shape, Decomposition, Clarification

Enterprise Document Intelligence [Vol.1 #6b] - The five field families the parser reads straight from the user’s question, with the code that fills each one The

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1w ago

How to Effectively Align with Claude Code

Increase productivity with your LLMs The post How to Effectively Align with Claude Code appeared first on Towards Data Science .

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1w ago

4 Lines You Should Include in Your Claude Skill

Without these, Claude will be confidently wrong. The post 4 Lines You Should Include in Your Claude Skill appeared first on Towards Data Science .

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1w ago

Vision LLMs are PDF Parsers Too: Reading Charts and Diagrams for RAG

Enterprise Document Intelligence [Vol.1 #5quater] - The other parsers read the words on a page. A vision model also reads the pictures The post Vision LLMs are

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1w ago

GPU Time-Slicing for Concurrent LLM Agents on Kubernetes

A systems-level deep dive into the hidden microarchitectural costs of Kubernetes GPU time-slicing, and what it actually costs to co-locate Agentic AI workloads.

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1w ago

Is Language Visual? An Experiment with Chinese Characters

A story about a broken printer, visual inductive bias, and why the race endedin a tie. The post Is Language Visual? An Experiment with Chinese Characters appear

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 2w ago

Increase Recommendation Systems’ Precision with LLMs, Using Python

This is how LLMs are used today to increase precision in recommendation systems The post Increase Recommendation Systems’ Precision with LLMs, Using Python appe

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 2w ago

Five Ways to Fine-Tune Chronos-2, the Time Series Foundation Model

In Part 1 of this series, we introduced Chronos-2, a time-series foundation model. We got our hands dirty by walking through a real case study and saw what Chro

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 2w ago

Is an Online Master’s Degree in AI a Good Idea?

A look at the real-world value of online graduate AI programs, combining hard data with firsthand experience of a big tech machine learning engineer The post Is

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 3w ago

Baseline Enterprise RAG, From PDF to Highlighted Answer

Enterprise Document Intelligence [Vol. 1 #1] The smallest version of RAG that actually works, on a real PDF, with grounded answers and the source lines highligh

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 3w ago

EmoNet: Speaker-Aware Transformers for Emotion Recognition — and What I’d Build Differently in 2026

A retrospective on my MS thesis, the leaderboard it placed on, and the LLM shift that has reshaped the field since. The post EmoNet: Speaker-Aware Transformers

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

LLM Themes Are Not Observations

A practitioner's warning about generated variables in causal analysis The post LLM Themes Are Not Observations appeared first on Towards Data Science .

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Prompt Engineering Isn’t Enough — I Built a Control Layer That Works in Production

Most LLM failures in production aren’t random — they’re predictable. I kept hitting broken JSON, silent failures, and outages that froze my entire app. Prompt e

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Can LLMs Replace Survey Respondents?

How unlearning fixes mode collapse in synthetic survey replies The post Can LLMs Replace Survey Respondents? appeared first on Towards Data Science .

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Grounding LLMs with Fresh Web Data to Reduce Hallucinations

Why production LLM systems need live web search to overcome knowledge cutoffs and stale training data The post Grounding LLMs with Fresh Web Data to Reduce Hall

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

LLM Evals Are Based on Vibes — I Built the Missing Layer That Decides What Ships

Most LLM evaluation systems rely on vague scoring and human judgment disguised as metrics. I built a lightweight evaluation layer in pure Python that turns LLM

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

How I Continually Improve My Claude Code

Learn how to make your Claude Code improve over time The post How I Continually Improve My Claude Code appeared first on Towards Data Science .

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Why My Coding Assistant Started Replying in Korean When I Typed Chinese

From a Chinese prompt to a Korean response: an embedding-space investigation into how code vocabulary reshapes language The post Why My Coding Assistant Started

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Stop Evaluating LLMs with “Vibe Checks”

How to build a decision-grade scorecard for AI agents The post Stop Evaluating LLMs with “Vibe Checks” appeared first on Towards Data Science .

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

I Built the Same B2B Document Extractor Twice: Rules vs. LLM

A practical comparison between rule-based PDF extraction using “pytesseract” and an LLM-based approach with “Ollama” and “LLaMA 3”, based on a realistic B2B ord

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Bytes Speak All Languages: Cross-Script Name Retrieval via Contrastive Learning

Why learn 8 scripts when you can learn 256 bytes? The post Bytes Speak All Languages: Cross-Script Name Retrieval via Contrastive Learning appeared first on Tow

Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 2mo ago

From Ad Hoc Prompting to Repeatable AI Workflows with Claude Code Skills

How I turned LLM persona interviews into a repeatable customer research workflow The post From Ad Hoc Prompting to Repeatable AI Workflows with Claude Code Skil