Generalizing Numerical Reasoning in Table Data through Operation Sketches and Self-Supervised Learning

📰 ArXiv cs.AI

arXiv:2604.21495v1 Announce Type: cross Abstract: Numerical reasoning over expert-domain tables often exhibits high in-domain accuracy but limited robustness to domain shift. Models trained with supervised fine-tuning (SFT) on specific datasets tend to rely on header-operation shortcuts rather than structural reasoning. We introduce TaNOS, a continual pre-training framework comprising three components: (i) header anonymization to reduce lexical memorization, (ii) operation sketches that provide

Published 25 Apr 2026
Read full paper → ← Back to Reads