Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,647

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,438 Reads 5,209

Showing 5,209 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection

arXiv:2601.09195v2 Announce Type: replace-cross Abstract: Supervised fine-tuning (SFT) is a fundamental post-training strategy to align Large Language Models (L

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

arXiv:2601.10305v3 Announce Type: replace-cross Abstract: Vision-Language Pre-training (VLP) models have achieved remarkable success by leveraging large-scale i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation

arXiv:2601.11702v2 Announce Type: replace-cross Abstract: AI compliance is becoming increasingly critical as AI systems grow more powerful and pervasive. Yet th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

HalluJudge: A Reference-Free Hallucination Detection for Context Misalignment in Code Review Automation

arXiv:2601.19072v2 Announce Type: replace-cross Abstract: Large Language models (LLMs) have shown strong capabilities in code review automation, such as review

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making

arXiv:2602.02378v2 Announce Type: replace-cross Abstract: As LLMs expand from assistance to decision support, a dangerous pattern emerges: fluent agreement with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

SPARE: Self-distillation for PARameter-Efficient Removal

arXiv:2602.07058v2 Announce Type: replace-cross Abstract: Machine Unlearning aims to remove the influence of specific data or concepts from trained models while

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

On Randomness in Agentic Evals

arXiv:2602.07150v3 Announce Type: replace-cross Abstract: Agentic systems are evaluated on benchmarks where agents interact with environments to solve tasks. Mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

arXiv:2602.07906v4 Announce Type: replace-cross Abstract: Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimiza

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling

arXiv:2602.16485v2 Announce Type: replace-cross Abstract: Existing Multi-Agent Systems (MAS) typically rely on homogeneous model configurations, failing to expl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Smooth Gate Functions for Soft Advantage Policy Optimization

arXiv:2602.19345v2 Announce Type: replace-cross Abstract: Group Relative Policy Optimization (GRPO) has significantly advanced the training of large language mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies

arXiv:2602.23811v3 Announce Type: replace-cross Abstract: We investigate the theoretical aspects of offline reinforcement learning (RL) under general function a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings

arXiv:2603.09643v3 Announce Type: replace-cross Abstract: Current evaluation frameworks and benchmarks for LLM powered agents focus on text chat driven agents,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Exploring Collatz Dynamics with Human-LLM Collaboration

arXiv:2603.11066v3 Announce Type: replace-cross Abstract: We develop a structural and quantitative framework for analyzing the Collatz map through modular dynam

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies

arXiv:2603.12510v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models have significant potential to enable general-purpose robotic syste

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents

arXiv:2603.12564v3 Announce Type: replace-cross Abstract: Tool-augmented LLM agents increasingly serve as multi-turn advisors in high-stakes domains, yet their

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Geometry-Guided Camera Motion Understanding in VideoLLMs

arXiv:2603.13119v2 Announce Type: replace-cross Abstract: Camera motion is a fundamental geometric signal that shapes visual perception and cinematic style, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning

arXiv:2603.14867v2 Announce Type: replace-cross Abstract: Many strategic decision-making problems, such as environment design for warehouse robots, can be natur

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

arXiv:2603.15970v3 Announce Type: replace-cross Abstract: Several data warehouse and database providers have recently introduced extensions to SQL called AI Que

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval

arXiv:2603.17872v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have achieved unprecedented fluency but remain susceptible to "hallucinat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Evolutionarily Stable Stackelberg Equilibrium

arXiv:2603.18385v2 Announce Type: replace-cross Abstract: We present a new solution concept called evolutionarily stable Stackelberg equilibrium (SESS). We stud

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models

arXiv:2603.20957v2 Announce Type: replace-cross Abstract: Frontier LLM companies have repeatedly assured courts and regulators that their models do not store co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection

arXiv:2603.21576v2 Announce Type: replace-cross Abstract: Long-context LLM inference is bottlenecked not by compute but by the O(n) memory bandwidth cost of sca

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Sim-to-Real of Humanoid Locomotion Policies via Joint Torque Space Perturbation Injection

arXiv:2603.21853v2 Announce Type: replace-cross Abstract: This paper proposes a novel alternative to existing sim-to-real methods for training control policies

Qwen3.5-9b-uncensored-hauhaucs-Aggressive Model: A Beginner's Guide to Get You Started

Hackernoon 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Qwen3.5-9b-uncensored-hauhaucs-Aggressive Model: A Beginner's Guide to Get You Started

Qwen3.5-9B-Uncensored-HauhauCS-Aggressive is an uncensored variant of the base model created by Hauhau CS. This 9-billion parameter model removes safety filters

AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Unlocking video insights at scale with Amazon Bedrock multimodal models

In this post, we explore how the multimodal foundation models (FMs) of Amazon Bedrock enable scalable video understanding through three distinct architectural a

TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Melania Trump wants a robot to homeschool your child

The first lady sees AI and robotics playing a prominent role in the future of American education.

AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Deploy voice agents with Pipecat and Amazon Bedrock AgentCore Runtime – Part 1

In this series of posts, you will learn how streaming architectures help address these challenges using Pipecat voice agents on Amazon Bedrock AgentCore Runtime

There’s Something Very Dark About a Lot of Those Viral AI Fruit Videos

Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

There’s Something Very Dark About a Lot of Those Viral AI Fruit Videos

With female AI fruit being fart-shamed and even sexually assaulted, there’s a misogynistic undercurrent to the fruit slop microdramas, even as they appear to be

OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage

Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage

In a controlled experiment, OpenClaw agents proved prone to panic and vulnerable to manipulation. They even disabled their own functionality when gaslit by huma

The Verge 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Spotify is letting artists manually approve releases to combat AI fakes

Spotify is beta-testing a new feature called Artist Profile Protection that lets artists review releases before they go live. Sometimes songs end up on the wron

Protecting people from harmful manipulation

DeepMind Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Protecting people from harmful manipulation

Google DeepMind researches AI's harmful manipulation risks across areas like finance and health, leading to new safety measures.

London’s Granola raises $125M to turn meeting recordings into enterprise AI infrastructure

The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

London’s Granola raises $125M to turn meeting recordings into enterprise AI infrastructure

Granola, the London-based AI meeting app that records conversations without dropping a bot into the call, has raised $125 million in a Series C round led by Dan

Why China Is Winning The Open Source AI Race

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Why China Is Winning The Open Source AI Race

China begins to take over the open source AI race as models like DeepSeek and Qwen see greater adoption among developers.

Vibe Coding a Private AI Financial Analyst with Python and Local LLMs

KDnuggets 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Vibe Coding a Private AI Financial Analyst with Python and Local LLMs

Learn to build an AI data analyst with Python: analyzes data, detects anomalies, and generates predictions using local LLMs.

Skills in LangSmith Fleet

LangChain Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Skills in LangSmith Fleet

Fleet now supports shareable skills, so you equip agents across your team with knowledge for specialized tasks.

ZDNet 🧠 Large Language Models ⚡ AI Lesson 1mo ago

5 ways to use AI when your budget is tight

Yes, you can use AI cost-effectively. Here's how professionals are doing it.

The Verge 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Disney’s big bets on the metaverse and AI slop aren’t going so well

Less than a week into his tenure as Disney's newly-appointed CEO, Josh D'Amaro is already dealing with two separate crises that have cast a shadow over the comp

Lyria 3 Pro: Create longer tracks in more

DeepMind Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Lyria 3 Pro: Create longer tracks in more

Introducing Lyria 3 Pro, which unlocks longer tracks with structural awareness. We’re also bringing Lyria to more Google products and surfaces.

Build with Lyria 3, our newest music generation model

Google AI Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Build with Lyria 3, our newest music generation model

Lyria 3 is now available in paid preview through the Gemini API and for testing in Google AI Studio.

Lyria 3 Pro: Create longer tracks in more Google products

Google AI Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Lyria 3 Pro: Create longer tracks in more Google products

We are bringing Lyria 3 to the tools where professionals work and create every day.

The Billion-Dollar Robot Race Is Moving Faster Than The Robots

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago

The Billion-Dollar Robot Race Is Moving Faster Than The Robots

Humanoid robots are attracting capital at a pace the underlying technology cannot yet justify. Between viral dance performances, stratospheric valuations, and f

TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Harvey confirms $11B valuation: Sequoia triples down

Investors like Sequoia, Andreessen Horowitz, Kleiner Perkins, and Elad Gil can't get enough of AI legal tech startup Harvey.

This Skill Makes AI Coding Work: Navigating Context Engineering in 2026

Hackernoon 🧠 Large Language Models ⚡ AI Lesson 1mo ago

This Skill Makes AI Coding Work: Navigating Context Engineering in 2026

context engineering is about designing the information ecosystem that the model has access to when it processes your request. Faros AI identified eight layers o

OpenAI Enters Its Focus Era by Killing Sora

Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

OpenAI Enters Its Focus Era by Killing Sora

As the ChatGPT-maker eyes an IPO, it's ditching Sora in favor of a unified AI assistant and enterprise coding tools.

TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Granola raises $125M, hits $1.5B valuation as it expands from meeting notetaker to enterprise AI app

Granola's valuation jumped from $250 million to $1.5 billion with this round, and it has added more support for AI agents after users previously complained.

TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Meta turns to AI to make shopping easier on Instagram and Facebook

Meta is using generative AI to provide more product and brand information to consumers when they're shopping in its apps.

OpenAI Sora is gone. The artists are still working.

The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

OpenAI Sora is gone. The artists are still working.

Last September, when OpenAI quietly released the Sora 2 app to the public, the discourse around it was not quiet at all. Commentators who had spent months watch

5 Practical Techniques to Detect and Mitigate LLM Hallucinations Beyond Prompt Engineering

Machine Learning Mastery 🧠 Large Language Models ⚡ AI Lesson 1mo ago

5 Practical Techniques to Detect and Mitigate LLM Hallucinations Beyond Prompt Engineering

My friend who is a developer once asked an LLM to generate documentation for a payment API.