📰 Dev.to · pueding
18 articles · Updated every 3 hours · View all reads
All
Articles 92,463Blog Posts 110,488Tech Tutorials 23,238Research Papers 19,242News 14,919
⚡ AI Lessons

Dev.to · pueding
🧠 Large Language Models
⚡ AI Lesson
18h ago
AMD ATOM + ATOMesh: Prefill/decode Disaggregation on ROCm
What: AMD shipped ATOM + ATOMesh, a ROCm-native LLM serving stack whose headline trick is...

Dev.to · pueding
🧠 Large Language Models
⚡ AI Lesson
1d ago
NVIDIA Blackwell Sweeps MLPerf Training 6.0: Strong Scaling
What: On June 16, 2026, NVIDIA's Blackwell platform posted the fastest time on all seven...

Dev.to · pueding
6d ago
NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark: Trajectory-Replay Benchmarking
What: The AgentPerf benchmark from Artificial Analysis is the first test built for agentic-AI...

Dev.to · pueding
1w ago
NVIDIA RTX Spark Superchip: Unified CPU–GPU Memory
What: NVIDIA's RTX Spark "superchip" (unveiled around Computex / Build 2026) pairs a 20-core Grace...

Dev.to · pueding
1w ago
Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training
What: Google shipped quantization-aware-trained (QAT) checkpoints for the Gemma 4 family —...

Dev.to · pueding
1w ago
MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)
What: The MiniMax M3 release — an open-weight model with a 1M-token context and 59% on...

Dev.to · pueding
1w ago
Google Releases DiffusionGemma: Parallel Block Decoding
What: Google released DiffusionGemma, an open-weight model whose headline trick is parallel...

Dev.to · pueding
1w ago
AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation
What: The AutoLab benchmark scores agents with iterative experiment-loop evaluation — 36 realistic...

Dev.to · pueding
🤖 AI Agents & Automation
⚡ AI Lesson
2w ago
AgentDoG 1.5: Small Inline Guard Models for Agent Actions
What: AgentDoG 1.5, an arXiv preprint posted in May 2026, is a family of small inline guard models...

Dev.to · pueding
🤖 AI Agents & Automation
⚡ AI Lesson
3w ago
Claude Opus 4.8: Parallel-Subagent Dynamic Workflows
What: The OmniRetrieval paper introduces source-native query dispatch: a router sends a...

Dev.to · pueding
3w ago
OmniRetrieval: Source-Native Query Dispatch
What: The OmniRetrieval paper introduces source-native query dispatch: a router sends a...

Dev.to · pueding
3w ago
Gemini 3.5 Flash: Agent-First Model Design
What: Gemini 3.5 Flash, announced by Google DeepMind on May 25, 2026, is positioned as an...

Dev.to · pueding
3w ago
Cursor Composer 2.5: Targeted Textual Feedback RL
What: The Cursor Composer 2.5 release blog introduces targeted textual feedback RL — a constructed...

Dev.to · pueding
3w ago
Boiling the Frog Paper: Multi-Turn Norm Erosion vs Single-Prompt Agent Safety
What: The Boiling the Frog benchmark is a stateful multi-turn safety eval for tool-using AI agents...

Dev.to · pueding
💻 AI-Assisted Coding
⚡ AI Lesson
4w ago
OpenSCAD Pantheon Benchmark: Human-In-The-Loop vs Autonomous Coding Agents
What: The OpenSCAD Pantheon benchmark grades six agentic coding tools — including Antigravity 2.0,...

Dev.to · pueding
4w ago
Camouflage Injection Paper: Camouflage Detection Gap
What: The Domain-Camouflaged Injection paper shows that prompt-injection detectors collapse on...

Dev.to · pueding
1mo ago
MCP SEP-2468: RFC 9207 Iss Parameter for OAuth Mix-Up Defense
What: MCP SEP-2468 aligns the MCP authorization flow with RFC 9207: authorization servers can...

Dev.to · pueding
🤖 AI Agents & Automation
⚡ AI Lesson
1mo ago
Is Grep All You Need? Grep vs Vector Retrieval for Agentic Search
What: The "Is Grep All You Need?" study wires both a literal grep tool and a vector-search tool...
DeepCamp AI