✕ Clear all filters
18 articles

📰 Dev.to · pueding

18 articles · Updated every 3 hours · View all reads

All Articles 92,463Blog Posts 110,488Tech Tutorials 23,238Research Papers 19,242News 14,919 ⚡ AI Lessons
Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training
Dev.to · pueding 1w ago
Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training
What: Google shipped quantization-aware-trained (QAT) checkpoints for the Gemma 4 family —...
MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)
Dev.to · pueding 1w ago
MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)
What: The MiniMax M3 release — an open-weight model with a 1M-token context and 59% on...
Google Releases DiffusionGemma: Parallel Block Decoding
Dev.to · pueding 1w ago
Google Releases DiffusionGemma: Parallel Block Decoding
What: Google released DiffusionGemma, an open-weight model whose headline trick is parallel...
AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation
Dev.to · pueding 1w ago
AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation
What: The AutoLab benchmark scores agents with iterative experiment-loop evaluation — 36 realistic...
AgentDoG 1.5: Small Inline Guard Models for Agent Actions
Dev.to · pueding 🤖 AI Agents & Automation ⚡ AI Lesson 2w ago
AgentDoG 1.5: Small Inline Guard Models for Agent Actions
What: AgentDoG 1.5, an arXiv preprint posted in May 2026, is a family of small inline guard models...
Claude Opus 4.8: Parallel-Subagent Dynamic Workflows
Dev.to · pueding 🤖 AI Agents & Automation ⚡ AI Lesson 3w ago
Claude Opus 4.8: Parallel-Subagent Dynamic Workflows
What: The OmniRetrieval paper introduces source-native query dispatch: a router sends a...
OmniRetrieval: Source-Native Query Dispatch
Dev.to · pueding 3w ago
OmniRetrieval: Source-Native Query Dispatch
What: The OmniRetrieval paper introduces source-native query dispatch: a router sends a...
Gemini 3.5 Flash: Agent-First Model Design
Dev.to · pueding 3w ago
Gemini 3.5 Flash: Agent-First Model Design
What: Gemini 3.5 Flash, announced by Google DeepMind on May 25, 2026, is positioned as an...
Cursor Composer 2.5: Targeted Textual Feedback RL
Dev.to · pueding 3w ago
Cursor Composer 2.5: Targeted Textual Feedback RL
What: The Cursor Composer 2.5 release blog introduces targeted textual feedback RL — a constructed...
Boiling the Frog Paper: Multi-Turn Norm Erosion vs Single-Prompt Agent Safety
Dev.to · pueding 3w ago
Boiling the Frog Paper: Multi-Turn Norm Erosion vs Single-Prompt Agent Safety
What: The Boiling the Frog benchmark is a stateful multi-turn safety eval for tool-using AI agents...
OpenSCAD Pantheon Benchmark: Human-In-The-Loop vs Autonomous Coding Agents
Dev.to · pueding 💻 AI-Assisted Coding ⚡ AI Lesson 4w ago
OpenSCAD Pantheon Benchmark: Human-In-The-Loop vs Autonomous Coding Agents
What: The OpenSCAD Pantheon benchmark grades six agentic coding tools — including Antigravity 2.0,...
Camouflage Injection Paper: Camouflage Detection Gap
Dev.to · pueding 4w ago
Camouflage Injection Paper: Camouflage Detection Gap
What: The Domain-Camouflaged Injection paper shows that prompt-injection detectors collapse on...
MCP SEP-2468: RFC 9207 Iss Parameter for OAuth Mix-Up Defense
Dev.to · pueding 1mo ago
MCP SEP-2468: RFC 9207 Iss Parameter for OAuth Mix-Up Defense
What: MCP SEP-2468 aligns the MCP authorization flow with RFC 9207: authorization servers can...
Is Grep All You Need? Grep vs Vector Retrieval for Agentic Search
Dev.to · pueding 🤖 AI Agents & Automation ⚡ AI Lesson 1mo ago
Is Grep All You Need? Grep vs Vector Retrieval for Agentic Search
What: The "Is Grep All You Need?" study wires both a literal grep tool and a vector-search tool...