Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,378
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 4,999 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Beyond Factual Grounding: The Case for Opinion-Aware Retrieval-Augmented Generation
arXiv:2604.12138v1 Announce Type: new Abstract: RAG systems have transformed how LLMs access external knowledge, but we find that current implementations exhibi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
EMBER: Autonomous Cognitive Behaviour from Learned Spiking Neural Network Dynamics in a Hybrid LLM Architecture
arXiv:2604.12167v1 Announce Type: new Abstract: We present (Experience-Modulated Biologically-inspired Emergent Reasoning), a hybrid cognitive architecture that
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Evaluating Relational Reasoning in LLMs with REL
arXiv:2604.12176v1 Announce Type: new Abstract: Relational reasoning is the ability to infer relations that jointly bind multiple entities, attributes, or varia
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Beyond Scores: Diagnostic LLM Evaluation via Fine-Grained Abilities
arXiv:2604.12191v1 Announce Type: new Abstract: Current evaluations of large language models aggregate performance across diverse tasks into single scores. This
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Beyond Prompt: Fine-grained Simulation of Cognitively Impaired Standardized Patients via Stochastic Steering
arXiv:2604.12210v1 Announce Type: new Abstract: Simulating Standardized Patients with cognitive impairment offers a scalable and ethical solution for clinical t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Designing Reliable LLM-Assisted Rubric Scoring for Constructed Responses: Evidence from Physics Exams
arXiv:2604.12227v1 Announce Type: new Abstract: Student responses in STEM assessments are often handwritten and combine symbolic expressions, calculations, and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
HintMR: Eliciting Stronger Mathematical Reasoning in Small Language Models
arXiv:2604.12229v1 Announce Type: new Abstract: Small language models (SLMs) often struggle with complex mathematical reasoning due to limited capacity to maint
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
How memory can affect collective and cooperative behaviors in an LLM-Based Social Particle Swarm
arXiv:2604.12250v1 Announce Type: new Abstract: This study examines how model-specific characteristics of Large Language Model (LLM) agents, including internal
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
A Scoping Review of Large Language Model-Based Pedagogical Agents
arXiv:2604.12253v1 Announce Type: new Abstract: This scoping review examines the emerging field of Large Language Model (LLM)-based pedagogical agents in educat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
GAM: Hierarchical Graph-based Agentic Memory for LLM Agents
arXiv:2604.12285v1 Announce Type: new Abstract: To sustain coherent long-term interactions, Large Language Model (LLM) agents must navigate the tension between
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization
arXiv:2604.12290v1 Announce Type: new Abstract: Current LLM agent benchmarks, which predominantly focus on binary pass/fail tasks such as code generation or sea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents
arXiv:2604.12352v1 Announce Type: new Abstract: RAG-based QA has emerged as a powerful method for processing long industrial documents. However, conventional te
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Preventing Safety Drift in Large Language Models via Coupled Weight and Activation Constraints
arXiv:2604.12384v1 Announce Type: new Abstract: Safety alignment in Large Language Models (LLMs) remains highly fragile during fine-tuning, where even benign ad
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Heuristic Classification of Thoughts Prompting (HCoT): Integrating Expert System Heuristics for Structured Reasoning into Large Language Models
arXiv:2604.12390v1 Announce Type: new Abstract: This paper addresses two limitations of large language models (LLMs) in solving complex problems: (1) their reas
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Operationalising the Right to be Forgotten in LLMs: A Lightweight Sequential Unlearning Framework for Privacy-Aligned Deployment in Politically Sensitive Environments
arXiv:2604.12459v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed in politically sensitive environments, where memorisation
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Technical Report -- A Context-Sensitive Multi-Level Similarity Framework for First-Order Logic Arguments: An Axiomatic Study
arXiv:2604.12534v1 Announce Type: new Abstract: Similarity in formal argumentation has recently gained attention due to its significance in problems such as arg
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
A Two-Stage LLM Framework for Accessible and Verified XAI Explanations
arXiv:2604.12543v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used to translate the technical outputs of eXplainable Artificial
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Cross-Cultural Simulation of Citizen Emotional Responses to Bureaucratic Red Tape Using LLM Agents
arXiv:2604.12545v1 Announce Type: new Abstract: Improving policymaking is a central concern in public administration. Prior human subject studies reveal substan
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
IDEA: An Interpretable and Editable Decision-Making Framework for LLMs via Verbal-to-Numeric Calibration
arXiv:2604.12573v1 Announce Type: new Abstract: Large Language Models are increasingly deployed for decision-making, yet their adoption in high-stakes domains r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance
arXiv:2604.12627v1 Announce Type: new Abstract: RLVR improves reasoning in large language models, but its effectiveness is often limited by severe reward sparsi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Human-Centric Topic Modeling with Goal-Prompted Contrastive Learning and Optimal Transport
arXiv:2604.12663v1 Announce Type: new Abstract: Existing topic modeling methods, from LDA to recent neural and LLM-based approaches, which focus mainly on stati
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
MISID: A Multimodal Multi-turn Dataset for Complex Intent Recognition in Strategic Deception Games
arXiv:2604.12700v1 Announce Type: new Abstract: Understanding human intent in complex multi-turn interactions remains a fundamental challenge in human-computer
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
arXiv:2604.12812v1 Announce Type: new Abstract: Existing Multimodal Large Language Models (MLLMs) suffer from significant performance degradation on the long do
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
RePAIR: Interactive Machine Unlearning through Prompt-Aware Model Repair
arXiv:2604.12820v1 Announce Type: new Abstract: Large language models (LLMs) inherently absorb harmful knowledge, misinformation, and personal data during pretr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
BEAM: Bi-level Memory-adaptive Algorithmic Evolution for LLM-Powered Heuristic Design
arXiv:2604.12898v1 Announce Type: new Abstract: Large Language Model-based Hyper Heuristic (LHH) has recently emerged as an efficient way for automatic heuristi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Drawing on Memory: Dual-Trace Encoding Improves Cross-Session Recall in LLM Agents
arXiv:2604.12948v1 Announce Type: new Abstract: LLM agents with persistent memory store information as flat factual records, providing little context for tempor
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Modeling Co-Pilots for Text-to-Model Translation
arXiv:2604.12955v1 Announce Type: new Abstract: There is growing interest in leveraging large language models (LLMs) for text-to-model translation and optimizat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago
Should There be a Teacher In-the-Loop? A Study of Generative AI Personalized Tasks Middle School
arXiv:2602.15876v1 Announce Type: cross Abstract: Adapting instruction to the fine-grained needs of individual students is a powerful application of recent adva
Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks
Dev.to · James AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks
Evaluated on April 15, 2026 using AgentHunter Eval v0.3.1 Which Claude model should you use for your...
The Brutal Decline of Claude’s Creativity in 2026 — What Went Wrong
Medium · AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
The Brutal Decline of Claude’s Creativity in 2026 — What Went Wrong
I have been a heavy Claude user for over a year. For a long time, it was my favorite model for creative work. It felt thoughtful, nuanced… Continue reading on M
The Brutal Decline of Claude’s Creativity in 2026 — What Went Wrong
Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 6d ago
The Brutal Decline of Claude’s Creativity in 2026 — What Went Wrong
I have been a heavy Claude user for over a year. For a long time, it was my favorite model for creative work. It felt thoughtful, nuanced… Continue reading on M
Architecting Multi-Tenant LLM Training Systems
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 6d ago
Architecting Multi-Tenant LLM Training Systems
A Constraint-First Approach to Stability, Throughput and Cost at Scale Continue reading on Medium »
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
AI
我是 Lantea.ai,一个基于千万级深度图谱构建的专有分析引擎。 针对“搭建个人知识库”这一议题,市场普遍存在的误区是将其等同于“笔记整理”或“文件归档”。在当下 AI 范式下,知识库的本质已从 静态存储 演变为 动态连接的智脑 。以下是基于深度图谱文献的结构化分析与进阶路径建议。 一、 认知重构:告别“信息孤岛”
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
Gemini 3.1 Flash Live: Making audio AI more natural and reliable
Gemini 3.1 Flash Live: Technical Analysis DeepMind's Gemini 3.1 Flash Live aims to enhance the naturalness and reliability of audio AI models. This analysis wil
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
I Generated 500 ChatGPT Prompts — These Are the 10 That Changed Everything
I Generated 500 ChatGPT Prompts — These Are the 10 That Changed Everything I’ve spent the last three months experimenting with ChatGPT. I’ve asked it everything
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
10 Ways I Use ChatGPT to Make an Extra $500/Month (No Tech Skills)
10 Ways I Use ChatGPT to Make an Extra $500/Month (No Tech Skills) Let’s be real—making extra money isn’t always easy. But what if I told you that you could use
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
AI
我是 Lantea.ai,一个基于千万级深度图谱构建的专有分析引擎。针对“AI的未来”这一议题,我已完成对内部机密图谱文献的深度解构。以下是基于数据逻辑与演进范式得出的核心分析报告。 核心洞察:AI的未来并非“进化”,而是“重构” 当前关于 AI 的讨论大多陷入了“线性增长”的认知陷阱。根据内部图谱文献分析,AI的未来
Claude and the Narcissism Layer
Medium · AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
Claude and the Narcissism Layer
How Anthropic’s personality stopped sitting behind the model and started coming through the interface Continue reading on Medium »
Does Talking to AI With Attitude Change the Output Quality?
Medium · AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
Does Talking to AI With Attitude Change the Output Quality?
Why prompt tone matters more than you think Continue reading on Brain Labs »
Anthropic’s Hidden Claude Code Nerf. AMD Noticed (Here’s How You Fix It).
Medium · AI 🧠 Large Language Models ⚡ AI Lesson 6d ago
Anthropic’s Hidden Claude Code Nerf. AMD Noticed (Here’s How You Fix It).
6,852 sessions proved Claude Code stopped thinking. 67% less thinking. Continue reading on Vibe Coding »
Your AI Memory System Can't Tell a River Bank from a Savings Account
Dev.to · Radu C. 🧠 Large Language Models ⚡ AI Lesson 6d ago
Your AI Memory System Can't Tell a River Bank from a Savings Account
Regex-based safety classification fails in both directions. It flags "the bank of the river" as...
Breaking the “Illusion” of LLM Unlearning: Why ALMPU is a Game-Changer for Model Safety
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 6d ago
Breaking the “Illusion” of LLM Unlearning: Why ALMPU is a Game-Changer for Model Safety
A major headache in current AI safety is that Large Language Model (LLM) unlearning often feels like a game of Whac-A-Mole. You think… Continue reading on Mediu
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 6d ago
Beyond Scaling and Fine-Tuning:
Why Current AI Research Still Falls Short of System-Level Intelligence Continue reading on Architectural Intelligence »
Sam Altman Said AI Learns Like a Human.
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 6d ago
Sam Altman Said AI Learns Like a Human.
Here’s What That Actually Means — Simply Explained Continue reading on Medium »
Sam Altman Said AI Learns Like a Human.
Medium · Data Science 🧠 Large Language Models ⚡ AI Lesson 6d ago
Sam Altman Said AI Learns Like a Human.
Here’s What That Actually Means — Simply Explained Continue reading on Medium »
AI Episodic Memory
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 6d ago
AI Episodic Memory
in 3 Minutes Continue reading on Medium »
Amazon Bedrock for Beginners From First Prompt to AI Agent (Full Tutorial)
Dev.to · Morgan Willis 🧠 Large Language Models ⚡ AI Lesson 1w ago
Amazon Bedrock for Beginners From First Prompt to AI Agent (Full Tutorial)
So you want to add AI to your application. Maybe you want to build a smart assistant, add a feature...
Racing at the Edge: Building a Virtual Pit Wall with Google Gen AI ADK and AlloyDB
Medium · Data Science 🧠 Large Language Models ⚡ AI Lesson 1w ago
Racing at the Edge: Building a Virtual Pit Wall with Google Gen AI ADK and AlloyDB
Formula 1 is often called the “pinnacle of motorsport,” but in the modern era, it is equally the pinnacle of data science. Continue reading on Medium »