MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

📰 ArXiv cs.AI

arXiv:2602.12705v4 Announce Type: replace-cross Abstract: We present MedXIAOHE, a medical vision-language foundation model designed to advance general-purpose medical understanding and reasoning in real-world clinical applications. MedXIAOHE achieves state-of-the-art performance across diverse medical benchmarks and surpasses leading closed-source multimodal systems on multiple capabilities. To achieve this, we propose an entity-aware continual pretraining framework that organizes heterogeneous

Published 8 Apr 2026
Read full paper → ← Back to News