PAPER2025-12-16·arXiv 2512.14474

Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling

Annu Rana, Gaurav Kumar

COMPILED NOTES

Two-phase reasoning: LLMs construct explicit problem models before generating solutions. Reduces constraint violations vs CoT and ReAct across five planning domains.

Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling

Abstract

Large language models often struggle with complex multi-step planning tasks, showing high rates of constraint violations and inconsistent solutions. The authors propose Model-First Reasoning (MFR), a two-phase paradigm where the LLM first constructs an explicit problem representation — entities, state variables, actions, constraints — before generating any solution. The core claim is that many LLM planning failures stem from representational deficiencies rather than reasoning limitations.

Key Contributions

MFR paradigm — separates problem modeling from solution generation
Explicit state tracking instead of implicit chain-of-thought state management
Reduced constraint violations across five planning domains
Diagnosis of planning failures as representational, not reasoning-bound

Methodology

Two phases:

Modeling phase — LLM constructs a structured representation defining:
- Entities in the problem
- State variables
- Actions available
- Constraints that must hold
Solution phase — LLM generates a plan conditioned on the explicit model built in phase 1

Ablation studies validate the criticality of the modeling phase — removing or compressing it collapses performance.

Results

MFR outperforms Chain-of-Thought and ReAct baselines across multiple domains:

Medical scheduling
Route planning
Resource allocation
Logic puzzles
Procedural synthesis

Specific reduction in constraint violations relative to CoT/ReAct reported across all five domains.

Limitations

Abstract does not detail failure modes quantitatively. Open questions: how does MFR compare on open-ended tasks without crisp constraint formulations? Does the explicit-modeling step scale to real-world agentic environments where state is partially observable?

Source: Model-First Reasoning LLM Agents — Annu Rana, Gaurav Kumar, December 2025

RELATED · IN THE BASE