KLong: Training LLM Agent for Extremely Long-horizon Tasks

📰 ArXiv cs.AI

KLong is an open-source LLM agent trained for extremely long-horizon tasks using trajectory-splitting SFT and progressive RL training

advanced Published 7 Apr 2026

Action Steps

Activate basic agentic abilities of a base model using a comprehensive SFT recipe
Introduce Research-Factory, an automated pipeline for generating high-quality training data
Utilize trajectory-splitting SFT to cold-start the model
Scale the model via progressive RL training

Who Needs to Know This

AI researchers and engineers working on LLM agents can benefit from KLong's ability to solve complex tasks, and developers can utilize the open-source nature of the project to integrate it into their own applications

Key Insight

💡 KLong's training method enables LLM agents to solve complex tasks with long horizons