Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,541
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,133 reads from curated sources

Apple Just Released iOS 26.5 For Developers, But 1 Major iPhone Feature Is Missing
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 3w ago
Apple Just Released iOS 26.5 For Developers, But 1 Major iPhone Feature Is Missing
Another iPhone update has just reached its first developer beta. There was a chance it would include the first glimpse of the brand-new Siri, but so far there’s
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
Five Hundred Copies of the Same Message in Your Agent's Brain
You send your AI agent a message. The upstream model returns a 429 — rate limited, try again later. Your agent framework dutifully retries. And retries. And ret
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
How to Get Cited within AI Searches
4 core pillars to get cited within AI searches You must shift your strategy from traditional SEO to Generative Engine Optimization (GEO). AI engines do not read
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
How We Built an AI Layer That Understands an Entire Agency Workspace (Not Just One Module)
We shipped the AI layer for Kobin today — an agency operating system that replaces Slack, Notion, HubSpot, Linear, and Buffer. This is the technical story of ho
How AI’s capital explosion signals opportunity but also reveals a critical need for measurable ROI and meaningful impact
The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
How AI’s capital explosion signals opportunity but also reveals a critical need for measurable ROI and meaningful impact
The current wave of investment in artificial intelligence reflects one of the largest capital shifts in modern technology, yet questions around financial return
I Gave 5 Frontier Models the Same Email Thread. Here's What They Missed.
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 3w ago
I Gave 5 Frontier Models the Same Email Thread. Here's What They Missed.
Five frontier models were given a 31-message email thread. They were asked to tell us what was decided, who owns what, and what changed. None of them got all of
Lightview Earns a 49 Proof of Usefulness Score by Building an AI-Safe UI Toolkit for LLM and Human Collaboration
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 3w ago
Lightview Earns a 49 Proof of Usefulness Score by Building an AI-Safe UI Toolkit for LLM and Human Collaboration
Lightview is an open-source UI toolkit designed to enable safe collaboration between large language models and developers. By introducing a sandboxed computatio
From Pipelines to AI Platforms: How Agentic AI Is Redefining the Role of Data Engineers
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 3w ago
From Pipelines to AI Platforms: How Agentic AI Is Redefining the Role of Data Engineers
This article explains how agentic AI is transforming data engineering by shifting systems from batch-based analytics to real-time, context-driven architectures.
Latest open artifacts (#20): New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, & others
Interconnects 🧠 Large Language Models ⚡ AI Lesson 3w ago
Latest open artifacts (#20): New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, & others
New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, & others
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
AI chip startup Rebellions raises $400 million at $2.3B valuation in pre-IPO round
The startup, which is planning to go public later this year, designs chips specifically for AI inference, another challenger to Nvidia's dominance.
Macy's 4.75X Shopping Jump Proves AI Can Move The Top Line
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 3w ago
Macy's 4.75X Shopping Jump Proves AI Can Move The Top Line
OpenAI abandoned Instant Checkout the same week with conversions at 1/3 retailer site rates. Same AI generation, opposite results: the gap is not about the mode
Import AI 451: Political superintelligence; Google's society of minds, and a robot drummer
Import AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
Import AI 451: Political superintelligence; Google's society of minds, and a robot drummer
Are there any genies that can be put back in the bottle?
Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 3w ago
Why Data Scientists Should Care About Quantum Computing
Sara A. Metwalli on the rise of a promising new technology, the effects of LLM on her work, and more. The post Why Data Scientists Should Care About Quantum Com
Search Engine Journal 🧠 Large Language Models ⚡ AI Lesson 3w ago
Why New Google-Agent May Be A Pivot Related To OpenClaw Trend via @sejournal, @martinibuster
Why Google's new AI user agent may be tied to shift of resources from Project Mariner To Gemini Agent The post Why New Google-Agent May Be A Pivot Related To Op
Textbooks, Not the Internet, Trained This Powerful AI
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 3w ago
Textbooks, Not the Internet, Trained This Powerful AI
phi-1.5 is a 1.3B-parameter Transformer trained mainly on synthetic, textbook-quality data. Despite its small size, it matches or beats much larger models on co
Bluesky’s new Attie app uses AI to give you full control over your social feed
The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
Bluesky’s new Attie app uses AI to give you full control over your social feed
The standalone app, built on the AT Protocol and powered by Anthropic’s Claude, was unveiled at the ATmosphere conference by Jay Graber, who stepped back from B
The AI Factory: What It Is And Why Every CEO Should Care
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 3w ago
The AI Factory: What It Is And Why Every CEO Should Care
AI factories are emerging as the model for building, deploying and improving AI at scale, and they could become a major source of competitive advantage for comp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments
arXiv:2603.25747v1 Announce Type: new Abstract: The rapid evolution of Large Multimodal Models (LMMs) has enabled agents to perform complex digital and physical
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
AutoB2G: A Large Language Model-Driven Agentic Framework For Automated Building-Grid Co-Simulation
arXiv:2603.26005v1 Announce Type: new Abstract: The growing availability of building operational data motivates the use of reinforcement learning (RL), which ca
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
GUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play Annotation
arXiv:2603.26266v1 Announce Type: new Abstract: Large vision-language models have endowed GUI agents with strong general capabilities for interface understandin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
AIRA_2: Overcoming Bottlenecks in AI Research Agents
arXiv:2603.26499v1 Announce Type: new Abstract: Existing research has identified three structural performance bottlenecks in AI research agents: (1) synchronous
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CADSmith: Multi-Agent CAD Generation with Programmatic Geometric Validation
arXiv:2603.26512v1 Announce Type: new Abstract: Existing methods for text-to-CAD generation either operate in a single pass with no geometric verification or re
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Stabilizing Rubric Integration Training via Decoupled Advantage Normalization
arXiv:2603.26535v1 Announce Type: new Abstract: We propose Process-Aware Policy Optimization (PAPO), a method that integrates process-level evaluation into Grou
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models
arXiv:2603.25750v1 Announce Type: cross Abstract: As the paradigm of AI shifts from text-based LLMs to Speech Language Models (SLMs), there is a growing demand
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Consistency Amplifies: How Behavioral Variance Shapes Agent Accuracy
arXiv:2603.25764v1 Announce Type: cross Abstract: As LLM-based agents are deployed in production systems, understanding their behavioral consistency (whether th
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models
arXiv:2603.25766v1 Announce Type: cross Abstract: The integration of Vision-Language-Action (VLA) models into autonomous driving systems offers a unified framew
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
UCAgent: An End-to-End Agent for Block-Level Functional Verification
arXiv:2603.25768v1 Announce Type: cross Abstract: Functional verification remains a critical bottleneck in modern IC development cycles, accounting for approxim
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
IncreRTL: Traceability-Guided Incremental RTL Generation under Requirement Evolution
arXiv:2603.25769v1 Announce Type: cross Abstract: Large language models (LLMs) have shown promise in generating RTL code from natural-language descriptions, but
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
ReCUBE: Evaluating Repository-Level Context Utilization in Code Generation
arXiv:2603.25770v1 Announce Type: cross Abstract: Large Language Models (LLMs) have recently emerged as capable coding assistants that operate over large codeba
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Empowering Epidemic Response: The Role of Reinforcement Learning in Infectious Disease Control
arXiv:2603.25771v1 Announce Type: cross Abstract: Reinforcement learning (RL), owing to its adaptability to various dynamic systems in many real-world scenarios
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Beyond identifiability: Learning causal representations with few environments and finite samples
arXiv:2603.25796v1 Announce Type: cross Abstract: We provide explicit, finite-sample guarantees for learning causal representations from data with a sublinear n
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MAGNET: Autonomous Expert Model Generation via Decentralized Autoresearch and BitNet Training
arXiv:2603.25813v1 Announce Type: cross Abstract: We present MAGNET (Model Autonomously Growing Network), a decentralized system for autonomous generation, trai
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?
arXiv:2603.25823v1 Announce Type: cross Abstract: Beneath the stunning visual fidelity of modern AIGC models lies a "logical desert", where systems fail tasks t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
A Compression Perspective on Simplicity Bias
arXiv:2603.25839v1 Announce Type: cross Abstract: Deep neural networks exhibit a simplicity bias, a well-documented tendency to favor simple functions over comp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
GazeQwen: Lightweight Gaze-Conditioned LLM Modulation for Streaming Video Understanding
arXiv:2603.25841v1 Announce Type: cross Abstract: Current multimodal large language models (MLLMs) cannot effectively utilize eye-gaze information for video und
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Why Safety Probes Catch Liars But Miss Fanatics
arXiv:2603.25861v1 Announce Type: cross Abstract: Activation-based probes have emerged as a promising approach for detecting deceptively aligned AI systems by i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks
arXiv:2603.25864v1 Announce Type: cross Abstract: Graphical User Interface (GUI) agents have the potential to assist users in interacting with complex software
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
On Integrating Resilience and Human Oversight into LLM-Assisted Modeling Workflows for Digital Twins
arXiv:2603.25898v1 Announce Type: cross Abstract: LLM-assisted modeling holds the potential to rapidly build executable Digital Twins of complex systems from on
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Good Scores, Bad Data: A Metric for Multimodal Coherence
arXiv:2603.25924v1 Announce Type: cross Abstract: Multimodal AI systems are evaluated by downstream task accuracy, but high accuracy does not mean the underlyin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
DiReCT: Disentangled Regularization of Contrastive Trajectories for Physics-Refined Video Generation
arXiv:2603.25931v1 Announce Type: cross Abstract: Flow-matching video generators produce temporally coherent, high-fidelity outputs yet routinely violate elemen
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Reinforcing Structured Chain-of-Thought for Video Understanding
arXiv:2603.25942v1 Announce Type: cross Abstract: Multi-modal Large Language Models (MLLMs) show promise in video understanding. However, their reasoning often
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models
arXiv:2603.25960v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed in medical settings, yet their sensitivity to prompt fo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Policy-Guided World Model Planning for Language-Conditioned Visual Navigation
arXiv:2603.25981v1 Announce Type: cross Abstract: Navigating to a visually specified goal given natural language instructions remains a fundamental challenge in
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants
arXiv:2603.26008v1 Announce Type: cross Abstract: While powerful in image-conditioned generation, multimodal large language models (MLLMs) can display uneven pe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
H-Node Attack and Defense in Large Language Models
arXiv:2603.26045v1 Announce Type: cross Abstract: We present H-Node Adversarial Noise Cancellation (H-Node ANC), a mechanistic framework that identifies, exploi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MuDD: A Multimodal Deception Detection Dataset and GSR-Guided Progressive Distillation for Non-Contact Deception Detection
arXiv:2603.26064v1 Announce Type: cross Abstract: Non-contact automatic deception detection remains challenging because visual and auditory deception cues often
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
When Identities Collapse: A Stress-Test Benchmark for Multi-Subject Personalization
arXiv:2603.26078v1 Announce Type: cross Abstract: Subject-driven text-to-image diffusion models have achieved remarkable success in preserving single identities