Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,209 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection
arXiv:2601.09195v2 Announce Type: replace-cross Abstract: Supervised fine-tuning (SFT) is a fundamental post-training strategy to align Large Language Models (L
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset
arXiv:2601.10305v3 Announce Type: replace-cross Abstract: Vision-Language Pre-training (VLP) models have achieved remarkable success by leveraging large-scale i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation
arXiv:2601.11702v2 Announce Type: replace-cross Abstract: AI compliance is becoming increasingly critical as AI systems grow more powerful and pervasive. Yet th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
HalluJudge: A Reference-Free Hallucination Detection for Context Misalignment in Code Review Automation
arXiv:2601.19072v2 Announce Type: replace-cross Abstract: Large Language models (LLMs) have shown strong capabilities in code review automation, such as review
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making
arXiv:2602.02378v2 Announce Type: replace-cross Abstract: As LLMs expand from assistance to decision support, a dangerous pattern emerges: fluent agreement with
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
SPARE: Self-distillation for PARameter-Efficient Removal
arXiv:2602.07058v2 Announce Type: replace-cross Abstract: Machine Unlearning aims to remove the influence of specific data or concepts from trained models while
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
On Randomness in Agentic Evals
arXiv:2602.07150v3 Announce Type: replace-cross Abstract: Agentic systems are evaluated on benchmarks where agents interact with environments to solve tasks. Mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering
arXiv:2602.07906v4 Announce Type: replace-cross Abstract: Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimiza
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
arXiv:2602.16485v2 Announce Type: replace-cross Abstract: Existing Multi-Agent Systems (MAS) typically rely on homogeneous model configurations, failing to expl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Smooth Gate Functions for Soft Advantage Policy Optimization
arXiv:2602.19345v2 Announce Type: replace-cross Abstract: Group Relative Policy Optimization (GRPO) has significantly advanced the training of large language mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies
arXiv:2602.23811v3 Announce Type: replace-cross Abstract: We investigate the theoretical aspects of offline reinforcement learning (RL) under general function a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings
arXiv:2603.09643v3 Announce Type: replace-cross Abstract: Current evaluation frameworks and benchmarks for LLM powered agents focus on text chat driven agents,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Exploring Collatz Dynamics with Human-LLM Collaboration
arXiv:2603.11066v3 Announce Type: replace-cross Abstract: We develop a structural and quantitative framework for analyzing the Collatz map through modular dynam
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies
arXiv:2603.12510v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models have significant potential to enable general-purpose robotic syste
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents
arXiv:2603.12564v3 Announce Type: replace-cross Abstract: Tool-augmented LLM agents increasingly serve as multi-turn advisors in high-stakes domains, yet their
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Geometry-Guided Camera Motion Understanding in VideoLLMs
arXiv:2603.13119v2 Announce Type: replace-cross Abstract: Camera motion is a fundamental geometric signal that shapes visual perception and cinematic style, yet
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning
arXiv:2603.14867v2 Announce Type: replace-cross Abstract: Many strategic decision-making problems, such as environment design for warehouse robots, can be natur
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models
arXiv:2603.15970v3 Announce Type: replace-cross Abstract: Several data warehouse and database providers have recently introduced extensions to SQL called AI Que
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval
arXiv:2603.17872v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have achieved unprecedented fluency but remain susceptible to "hallucinat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Evolutionarily Stable Stackelberg Equilibrium
arXiv:2603.18385v2 Announce Type: replace-cross Abstract: We present a new solution concept called evolutionarily stable Stackelberg equilibrium (SESS). We stud
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models
arXiv:2603.20957v2 Announce Type: replace-cross Abstract: Frontier LLM companies have repeatedly assured courts and regulators that their models do not store co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection
arXiv:2603.21576v2 Announce Type: replace-cross Abstract: Long-context LLM inference is bottlenecked not by compute but by the O(n) memory bandwidth cost of sca
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Sim-to-Real of Humanoid Locomotion Policies via Joint Torque Space Perturbation Injection
arXiv:2603.21853v2 Announce Type: replace-cross Abstract: This paper proposes a novel alternative to existing sim-to-real methods for training control policies

Hackernoon
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Qwen3.5-9b-uncensored-hauhaucs-Aggressive Model: A Beginner's Guide to Get You Started
Qwen3.5-9B-Uncensored-HauhauCS-Aggressive is an uncensored variant of the base model created by Hauhau CS. This 9-billion parameter model removes safety filters
AWS Machine Learning
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Unlocking video insights at scale with Amazon Bedrock multimodal models
In this post, we explore how the multimodal foundation models (FMs) of Amazon Bedrock enable scalable video understanding through three distinct architectural a
TechCrunch AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Melania Trump wants a robot to homeschool your child
The first lady sees AI and robotics playing a prominent role in the future of American education.
AWS Machine Learning
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Deploy voice agents with Pipecat and Amazon Bedrock AgentCore Runtime – Part 1
In this series of posts, you will learn how streaming architectures help address these challenges using Pipecat voice agents on Amazon Bedrock AgentCore Runtime

Wired AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
There’s Something Very Dark About a Lot of Those Viral AI Fruit Videos
With female AI fruit being fart-shamed and even sexually assaulted, there’s a misogynistic undercurrent to the fruit slop microdramas, even as they appear to be

Wired AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage
In a controlled experiment, OpenClaw agents proved prone to panic and vulnerable to manipulation. They even disabled their own functionality when gaslit by huma
The Verge
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Spotify is letting artists manually approve releases to combat AI fakes
Spotify is beta-testing a new feature called Artist Profile Protection that lets artists review releases before they go live. Sometimes songs end up on the wron
DeepMind Blog
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Protecting people from harmful manipulation
Google DeepMind researches AI's harmful manipulation risks across areas like finance and health, leading to new safety measures.

The Next Web AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
London’s Granola raises $125M to turn meeting recordings into enterprise AI infrastructure
Granola, the London-based AI meeting app that records conversations without dropping a bot into the call, has raised $125 million in a Series C round led by Dan

Forbes Innovation
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Why China Is Winning The Open Source AI Race
China begins to take over the open source AI race as models like DeepSeek and Qwen see greater adoption among developers.

KDnuggets
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Vibe Coding a Private AI Financial Analyst with Python and Local LLMs
Learn to build an AI data analyst with Python: analyzes data, detects anomalies, and generates predictions using local LLMs.

LangChain Blog
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Skills in LangSmith Fleet
Fleet now supports shareable skills, so you equip agents across your team with knowledge for specialized tasks.
ZDNet
🧠 Large Language Models
⚡ AI Lesson
1mo ago
5 ways to use AI when your budget is tight
Yes, you can use AI cost-effectively. Here's how professionals are doing it.
The Verge
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Disney’s big bets on the metaverse and AI slop aren’t going so well
Less than a week into his tenure as Disney's newly-appointed CEO, Josh D'Amaro is already dealing with two separate crises that have cast a shadow over the comp
DeepMind Blog
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Lyria 3 Pro: Create longer tracks in more
Introducing Lyria 3 Pro, which unlocks longer tracks with structural awareness. We’re also bringing Lyria to more Google products and surfaces.

Google AI Blog
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Build with Lyria 3, our newest music generation model
Lyria 3 is now available in paid preview through the Gemini API and for testing in Google AI Studio.
Google AI Blog
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Lyria 3 Pro: Create longer tracks in more Google products
We are bringing Lyria 3 to the tools where professionals work and create every day.

Forbes Innovation
🧠 Large Language Models
⚡ AI Lesson
1mo ago
The Billion-Dollar Robot Race Is Moving Faster Than The Robots
Humanoid robots are attracting capital at a pace the underlying technology cannot yet justify. Between viral dance performances, stratospheric valuations, and f
TechCrunch AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Harvey confirms $11B valuation: Sequoia triples down
Investors like Sequoia, Andreessen Horowitz, Kleiner Perkins, and Elad Gil can't get enough of AI legal tech startup Harvey.

Hackernoon
🧠 Large Language Models
⚡ AI Lesson
1mo ago
This Skill Makes AI Coding Work: Navigating Context Engineering in 2026
context engineering is about designing the information ecosystem that the model has access to when it processes your request. Faros AI identified eight layers o

Wired AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
OpenAI Enters Its Focus Era by Killing Sora
As the ChatGPT-maker eyes an IPO, it's ditching Sora in favor of a unified AI assistant and enterprise coding tools.
TechCrunch AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Granola raises $125M, hits $1.5B valuation as it expands from meeting notetaker to enterprise AI app
Granola's valuation jumped from $250 million to $1.5 billion with this round, and it has added more support for AI agents after users previously complained.
TechCrunch AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Meta turns to AI to make shopping easier on Instagram and Facebook
Meta is using generative AI to provide more product and brand information to consumers when they're shopping in its apps.

The Next Web AI
🧠 Large Language Models
⚡ AI Lesson
1mo ago
OpenAI Sora is gone. The artists are still working.
Last September, when OpenAI quietly released the Sora 2 app to the public, the discourse around it was not quiet at all. Commentators who had spent months watch

Machine Learning Mastery
🧠 Large Language Models
⚡ AI Lesson
1mo ago
5 Practical Techniques to Detect and Mitigate LLM Hallucinations Beyond Prompt Engineering
My friend who is a developer once asked an LLM to generate documentation for a payment API.
DeepCamp AI