Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,695
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,253 reads from curated sources

AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Evaluating AI agents for production: A practical guide to Strands Evals
In this post, we show how to evaluate AI agents systematically using Strands Evals. We walk through the core concepts, built-in evaluators, multi-turn simulatio
Polly is generally available everywhere you work in LangSmith
LangChain Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Polly is generally available everywhere you work in LangSmith
Debugging agents is different from debugging anything else you've built. Traces run hundreds of steps deep, prompts span thousands of lines, and when something
AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Build an AI-Powered A/B testing engine using Amazon Bedrock
This post shows you how to build an AI-powered A/B testing engine using Amazon Bedrock, Amazon Elastic Container Service, Amazon DynamoDB, and th
AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Migrate from Amazon Nova 1 to Amazon Nova 2 on Amazon Bedrock
In this post, you will learn how to migrate from Nova 1 to Nova 2 on Amazon Bedrock. We cover model mapping, API changes, code examples using the Converse API,
InfoQ AI/ML 🧠 Large Language Models ⚡ AI Lesson 1mo ago
QCon London 2026: Rewriting All of Spotify's Code Base, All the Time
At QCon London 2026, Spotify's Jo Kelly-Fenton and Aleksandar Mitic discussed Honk, an AI-powered coding agent that enables code migrations across Spotify's cod
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The PhD students who became the judges of the AI industry
Artificial intelligence models are multiplying fast, and competition is stiff. With so many players crowding the space, which one will be the best — and who dec
Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The New Experience of Coding with AI
The seduction of AI code assistants The post The New Experience of Coding with AI appeared first on Towards Data Science .
InfoQ AI/ML 🧠 Large Language Models ⚡ AI Lesson 1mo ago
HubSpot’s Sidekick: Multi-Model AI Code Review with 90% Faster Feedback and 80% Engineer Approval
HubSpot engineers introduced Sidekick, an internal AI powered code review system that analyzes pull requests using large language models and filters feedback th
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
DOD says Anthropic’s ‘red lines’ make it an ‘unacceptable risk to national security’
The Defense Department said concerns that Anthropic might "attempt to disable its technology" during "warfighting operations" validate its decision to label the
GPT 5.4 is a big step for Codex
Interconnects 🧠 Large Language Models ⚡ AI Lesson 1mo ago
GPT 5.4 is a big step for Codex
On evaluating and understanding the frontier of agents, and why I still turn to Claude.
From Simulation to Production: How to Build Robots With AI
NVIDIA AI Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
From Simulation to Production: How to Build Robots With AI
The latest open models and frameworks from NVIDIA bring together simulation, robot learning and embedded compute to accelerate cloud-to-robot workflows.
The Download: The Pentagon’s new AI plans, and next-gen nuclear reactors
MIT Technology Review 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The Download: The Pentagon’s new AI plans, and next-gen nuclear reactors
This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology. The Pentagon is planni
7 Ways to Reduce Hallucinations in Production LLMs
KDnuggets 🧠 Large Language Models ⚡ AI Lesson 1mo ago
7 Ways to Reduce Hallucinations in Production LLMs
Most LLM hallucination fixes fail. Here is what actually works in production.
NVIDIA AI Open-Sources ‘OpenShell’: A Secure Runtime Environment for Autonomous AI Agents
MarkTechPost 🧠 Large Language Models ⚡ AI Lesson 1mo ago
NVIDIA AI Open-Sources ‘OpenShell’: A Secure Runtime Environment for Autonomous AI Agents
The deployment of autonomous AI agents—systems capable of using tools and executing code—presents a unique security challenge. While standard LLM applications a
ServiceNow Research Introduces EnterpriseOps-Gym: A High-Fidelity Benchmark Designed to Evaluate Agentic Planning in Realistic Enterprise Settings
MarkTechPost 🧠 Large Language Models ⚡ AI Lesson 1mo ago
ServiceNow Research Introduces EnterpriseOps-Gym: A High-Fidelity Benchmark Designed to Evaluate Agentic Planning in Realistic Enterprise Settings
Large language models (LLMs) are transitioning from conversational to autonomous agents capable of executing complex professional workflows. However, their depl
Justice Department Says Anthropic Can’t Be Trusted With Warfighting Systems
Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Justice Department Says Anthropic Can’t Be Trusted With Warfighting Systems
In response to Anthropic’s lawsuit, the government said it lawfully penalized the company for trying to limit how its Claude AI models could be used by the mili
Unsloth AI Releases Unsloth Studio: A Local No-Code Interface For High-Performance LLM Fine-Tuning With 70% Less VRAM Usage
MarkTechPost 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Unsloth AI Releases Unsloth Studio: A Local No-Code Interface For High-Performance LLM Fine-Tuning With 70% Less VRAM Usage
The transition from a raw dataset to a fine-tuned Large Language Model (LLM) traditionally involves significant infrastructure overhead, including CUDA environm
MIT Technology Review 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The Pentagon is planning for AI companies to train on classified data, defense official says
The Pentagon is discussing plans to set up secure environments for generative AI companies to train military-specific versions of their models on classified dat
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise
Mistral Forge lets enterprises train custom AI models from scratch on their own data, challenging rivals that rely on fine-tuning and retrieval-based approaches
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Why Garry Tan’s Claude Code setup has gotten so much love, and hate
Thousands of people are trying Garry Tan's Claude Code setup, which was shared on GitHub. And everyone has an opinion: even Claude, ChatGPT, and Gemini.
Ranking Engineer Agent (REA): The Autonomous AI Agent Accelerating Meta’s Ads Ranking Innovation
Engineering at Meta 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Ranking Engineer Agent (REA): The Autonomous AI Agent Accelerating Meta’s Ads Ranking Innovation
Meta’s Ranking Engineer Agent (REA) autonomously executes key steps across the end-to-end machine learning (ML) lifecycle for ads ranking models. This post cove
Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago
How to Effectively Review Claude Code Output
Get more out of your coding agents by making reviewing more efficient The post How to Effectively Review Claude Code Output appeared first on Towards Data Scien
GEO Best Practices: Prompt Volume Shouldn’t Drive Your Strategy
Neil Patel Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
GEO Best Practices: Prompt Volume Shouldn’t Drive Your Strategy
Most advice on generative engine optimization best practices starts in the same place: find the prompts people are using with AI tools, track which ones give yo
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The Pentagon is developing alternatives to Anthropic, report says
After their dramatic falling-out, it doesn't seem as though Anthropic and the Pentagon are getting back together.
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
BuzzFeed debuts AI slop apps in bid for new revenue
BuzzFeed unveiled new AI-powered social apps at SXSW, but its demos drew muted reactions.
NVIDIA, Telecom Leaders Build AI Grids to Optimize Inference on Distributed Networks
NVIDIA AI Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
NVIDIA, Telecom Leaders Build AI Grids to Optimize Inference on Distributed Networks
As AI‑native applications scale to more users, agents and devices, the telecommunications network is becoming the next frontier for distributing AI. At NVIDIA G
Measuring progress toward AGI: A cognitive framework
DeepMind Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Measuring progress toward AGI: A cognitive framework
We’re introducing a framework to measure progress toward AGI, and launching a Kaggle hackathon to build the relevant evaluations.
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Google’s Personal Intelligence feature is expanding to all US users
Personal Intelligence allows Google's AI assistant to tap into your Google ecosystem, such as Gmail and Google Photos, to provide more tailored responses.
Bringing the power of Personal Intelligence to more people
Google AI Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Bringing the power of Personal Intelligence to more people
We're expanding Personal Intelligence across AI Mode in Search, the Gemini app and Gemini in Chrome.
AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago
AWS AI League: Atos fine-tunes approach to AI education
In this post, we’ll explore how Atos used the AWS AI League to help accelerate AI education across 400+ participants, highlight the tangible benefits of gamifie
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
OpenAI expands government footprint with AWS deal, report says
OpenAI has reportedly signed a partnership with AWS to sell its AI systems to the U.S. government for classified and unclassified work, marking an expansion bey
Open SWE: An Open-Source Framework for Internal Coding Agents
LangChain Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Open SWE: An Open-Source Framework for Internal Coding Agents
Built on Deep Agents and LangGraph, Open SWE provides the core architectural components for internal coding agents.
GTC Spotlights NVIDIA RTX PCs and DGX Sparks Running Latest Open Models and AI Agents Locally
NVIDIA AI Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
GTC Spotlights NVIDIA RTX PCs and DGX Sparks Running Latest Open Models and AI Agents Locally
The paradigm of consumer computing has revolved around the concept of a personal device — from PCs to smartphones and tablets. Now, generative AI — particularly
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Niv-AI exits stealth to wring more power performance out of GPUs
The company raised $12 million in seed funding to measure and manage GPU power surges.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Holotron-12B - High Throughput Computer Use Agent
The Download: OpenAI’s US military deal, and Grok’s CSAM lawsuit
MIT Technology Review 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The Download: OpenAI’s US military deal, and Grok’s CSAM lawsuit
This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology. Where OpenAI’s technol
The Evolution From Prompt Engineering to Concept Engineering
KDnuggets 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The Evolution From Prompt Engineering to Concept Engineering
Learn about a future-proof shift from fragile prompt strings to reusable, testable building blocks.
Sears Exposed AI Chatbot Phone Calls and Text Chats to Anyone on the Web
Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Sears Exposed AI Chatbot Phone Calls and Text Chats to Anyone on the Web
Customer conversations with chatbots can include contact information and personal details that make it easier for scammers to launch phishing attacks and commit
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Introducing GPT-5.4 mini and nano
GPT-5.4 mini and nano are smaller, faster versions of GPT-5.4 optimized for coding, tool use, multimodal reasoning, and high-volume API and sub-agent workloads.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 1mo ago
OpenAI Japan announces Japan Teen Safety Blueprint to put teen safety first
OpenAI Japan announces the Japan Teen Safety Blueprint, introducing stronger age protections, parental controls, and well-being safeguards for teens using gener
Stratechery 🧠 Large Language Models ⚡ AI Lesson 1mo ago
An Interview with Nvidia CEO Jensen Huang About Accelerated Computing
An interview with Nvidia CEO Jensen Huang about his GTC 2026 keynote, navigating China and DC, and remembering Nvidia's true nature.
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Nvidia’s version of OpenClaw could solve its biggest problem: security
Nvidia announced an open enterprise AI agent platform, called NemoClaw, that is built off of viral OpenClaw.
LangChain Announces Enterprise Agentic AI Platform Built with NVIDIA
LangChain Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
LangChain Announces Enterprise Agentic AI Platform Built with NVIDIA
Comprehensive agent engineering platform combined with NVIDIA AI enables enterprises to build, deploy, and monitor production-grade AI agents at scale Press Rel
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Warren presses Pentagon over decision to grant xAI access to classified networks
Sen. Elizabeth Warren noted that Grok, xAI's controversial chatbot, has created harmful outputs for users and poses a potential national security risk.
AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago
AWS and NVIDIA deepen strategic collaboration to accelerate AI from pilot to production
Today at NVIDIA GTC 2026, AWS and NVIDIA announced an expanded collaboration with new technology integrations to support growing AI compute demand and help you
Roche Scales NVIDIA AI Factories Globally to Accelerate Drug Discovery, Diagnostic Solutions and Manufacturing Breakthroughs
NVIDIA AI Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Roche Scales NVIDIA AI Factories Globally to Accelerate Drug Discovery, Diagnostic Solutions and Manufacturing Breakthroughs
Roche's new deployment spans more than 3,500 NVIDIA Blackwell GPUs across its worldwide operations and embedded across the entire value chain, massively scaling
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Memories AI is building the visual memory layer for wearables and robotics
Memories.ai is building a large visual memory model that can index and retrieve video-recorded memories for physical AI.
NVIDIA DSX Air Boosts Time to Token With Accelerated Simulation for AI Factories
NVIDIA AI Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
NVIDIA DSX Air Boosts Time to Token With Accelerated Simulation for AI Factories
Setting up AI factories in simulation — decreasing deployment time from months to days — is accelerating the next industrial revolution. Nowhere was that more a