Case-Based Calibration of Adaptive Reasoning and Execution for LLM Tool Use

View PDF HTML (experimental)

Abstract:Tool use extends large language models beyond parametric knowledge, but reliable execution requires balancing appropriate reasoning depth with strict structural validity. We approach this problem from a case-based perspective to present CAST, a case-driven framework that treats historical execution trajectories as structured cases. Instead of reusing raw exemplar outputs, CAST extracts case-derived signals to identify complexity profiles for estimating optimal reasoning strategies, alongside failure profiles to map likely structural breakdowns. The framework translates this knowledge into a fine-grained reward design and adaptive reasoning, enabling the model to autonomously internalize case-based strategies during reinforcement learning. Experiments on BFCLv2 and ToolBench demonstrate that CAST improves both schema-faithful execution and task-level tool-use success while reducing unnecessary deliberation. The approach achieves up to 5.85 percentage points gain in overall execution accuracy and reduces average reasoning length by 26%, significantly mitigating high-impact structural errors. Ultimately, this demonstrates how historical execution cases can provide reusable adaptation knowledge for calibrated tool use.

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2605.15041 [cs.AI]
	(or arXiv:2605.15041v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2605.15041 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Renning Pang [view email]
[v1] Thu, 14 May 2026 16:36:04 UTC (1,413 KB)