📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AI Forbes Innovation OpenAI News Dev.to AI Hugging Face Blog Hackernoon

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Eyla: Toward an Identity-Anchored LLM Architecture with Integrated Biological Priors -- Vision, Implementation Attempt, and Lessons from AI-Assisted Development

arXiv:2604.00009v1 Announce Type: cross Abstract: We present the design rationale, implementation attempt, and failure analysis of Eyla, a proposed identity-anc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Can LLMs Perceive Time? An Empirical Investigation

arXiv:2604.00010v1 Announce Type: cross Abstract: Large language models cannot estimate how long their own tasks take. We investigate this limitation through fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Quantifying Gender Bias in Large Language Models: When ChatGPT Becomes a Hiring Manager

arXiv:2604.00011v1 Announce Type: cross Abstract: The growing prominence of large language models (LLMs) in daily life has heightened concerns that LLMs exhibit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms

arXiv:2604.00012v1 Announce Type: cross Abstract: Despite the impressive performance of general-purpose large language models (LLMs), they often require fine-tu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

MSA-Thinker: Discrimination-Calibration Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis

arXiv:2604.00013v1 Announce Type: cross Abstract: Multimodal sentiment analysis aims to understand human emotions by integrating textual, auditory, and visual m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Are they human? Detecting large language models by probing human memory constraints

arXiv:2604.00016v1 Announce Type: cross Abstract: The validity of online behavioral research relies on study participants being human rather than machine. In th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Think Twice Before You Write -- an Entropy-based Decoding Strategy to Enhance LLM Reasoning

arXiv:2604.00018v1 Announce Type: cross Abstract: Decoding strategies play a central role in shaping the reasoning ability of large language models (LLMs). Trad

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The Chronicles of RiDiC: Generating Datasets with Controlled Popularity Distribution for Long-form Factuality Evaluation

arXiv:2604.00019v1 Announce Type: cross Abstract: We present a configurable pipeline for generating multilingual sets of entities with specified characteristics

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

How Do Language Models Process Ethical Instructions? Deliberation, Consistency, and Other-Recognition Across Four Models

arXiv:2604.00021v1 Announce Type: cross Abstract: Alignment safety research assumes that ethical instructions improve model behavior, but how language models in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Criterion Validity of LLM-as-Judge for Business Outcomes in Conversational Commerce

arXiv:2604.00022v1 Announce Type: cross Abstract: Multi-dimensional rubric-based dialogue evaluation is widely used to assess conversational AI, yet its criteri

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

WHBench: Evaluating Frontier LLMs with Expert-in-the-Loop Validation on Women's Health Topics

arXiv:2604.00024v1 Announce Type: cross Abstract: Large language models are increasingly used for medical guidance, but women's health remains under-evaluated i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Brevity Constraints Reverse Performance Hierarchies in Language Models

arXiv:2604.00025v1 Announce Type: cross Abstract: Standard evaluation protocols reveal a counterintuitive phenomenon: on 7.7% of benchmark problems spanning fiv

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

"Who Am I, and Who Else Is Here?" Behavioral Differentiation Without Role Assignment in Multi-Agent LLM Systems

arXiv:2604.00026v1 Announce Type: cross Abstract: When multiple large language models interact in a shared conversation, do they develop differentiated social r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

When and Where: A Model Hippocampal Network Unifies Formation of Time Cells and Place Cells

arXiv:2604.00036v1 Announce Type: cross Abstract: Hippocampal place and time cells encode spatial and temporal aspects of experience. Both have the same neural

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

DriftScript: A Domain-Specific Language for Programming Non-Axiomatic Reasoning Agents

arXiv:2604.00043v1 Announce Type: cross Abstract: Non-Axiomatic Reasoning Systems (NARS) provide a framework for building adaptive agents that operate under ins

ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 1w ago

Whittaker-Henderson smoother for long satellite image time series interpolation

arXiv:2604.00048v1 Announce Type: cross Abstract: Whittaker smoother is a widely adopted solution to pre-process satellite image time series. Yet, two key limit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Task-Centric Personalized Federated Fine-Tuning of Language Models

arXiv:2604.00050v1 Announce Type: cross Abstract: Federated Learning (FL) has emerged as a promising technique for training language models on distributed and p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The Energy Footprint of LLM-Based Environmental Analysis: LLMs and Domain Products

arXiv:2604.00053v1 Announce Type: cross Abstract: As large language models (LLMs) are increasingly used in domain-specific applications, including climate chang

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Towards Automatic Soccer Commentary Generation with Knowledge-Enhanced Visual Reasoning

arXiv:2604.00057v1 Announce Type: cross Abstract: Soccer commentary plays a crucial role in enhancing the soccer game viewing experience for audiences. Previous

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

GenoBERT: A Language Model for Accurate Genotype Imputation

arXiv:2604.00058v1 Announce Type: cross Abstract: Genotype imputation enables dense variant coverage for genome-wide association and risk-prediction studies, ye

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Temporal Memory for Resource-Constrained Agents: Continual Learning via Stochastic Compress-Add-Smooth

arXiv:2604.00067v1 Announce Type: cross Abstract: An agent that operates sequentially must incorporate new experience without forgetting old experience, under a

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago

Perspective: Towards sustainable exploration of chemical spaces with machine learning

arXiv:2604.00069v1 Announce Type: cross Abstract: Artificial intelligence is transforming molecular and materials science, but its growing computational and dat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Brain MR Image Synthesis with Multi-contrast Self-attention GAN

arXiv:2604.00070v1 Announce Type: cross Abstract: Accurate and complete multi-modal Magnetic Resonance Imaging (MRI) is essential for neuro-oncological assessme

ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 1w ago

Empirical Validation of the Classification-Verification Dichotomy for AI Safety Gates

arXiv:2604.00072v1 Announce Type: cross Abstract: Can classifier-based safety gates maintain reliable oversight as AI systems improve over hundreds of iteration