ReDef: Do Code Language Models Truly Understand Code Changes for Just-in-Time Software Defect Prediction?

📰 ArXiv cs.AI

ReDef is a benchmark dataset for just-in-time software defect prediction, evaluating code language models' understanding of code changes

advanced Published 6 Apr 2026

Action Steps

Curate a high-confidence dataset of function-level modifications from large-scale projects
Evaluate existing code language models on the ReDef dataset to assess their understanding of code changes
Analyze the results to identify areas where models excel or struggle in predicting defects
Use the insights to fine-tune models and improve their performance in just-in-time software defect prediction

Who Needs to Know This

Software engineers and AI researchers on a team can benefit from ReDef to improve the accuracy of defect prediction models and prioritize risky code changes during code review

Key Insight

💡 Code language models' ability to understand code changes is crucial for accurate defect prediction, and ReDef provides a high-confidence dataset to evaluate and improve these models