StatProver

An Automated Proof Generation and Self-Correction Agent for Rigorous Statistical Derivations

Recently, the team StatAI Lab, led by Professor Fan Zhou from the School of Statistics and Data Science at Shanghai University of Finance and Economics officially released StatProver. Following the statistical reasoning evaluation benchmark StatEval, this is another practical achievement by the StatAI Lab in enhancing the statistical proving capabilities of Large Language Models (LLMs).

What We Do

Recent benchmarks, such as StatEval, have evaluated state-of-the-art models in this area, showing their limitations in solving research-level proofs. Therefore, a dedicated framework is needed to improve LLM reasoning capabilities for statistical problem-solving. The system we developed aims to solve the issues of logical gaps and formula hallucinations that frequently occur when LLMs handle highly complex statistical derivations, realizing fully automated generation and self-correction from statistical propositions to rigorous LaTeX proofs.

StatProver’s design balances the convenience of automation with the rigor of scientific research: the system not only supports one-click, end-to-end fully automated proof generation, but also introduces a flexible human-AI collaboration mechanism. This workflow allows users to manually intervene at key nodes—such as keyword retrieval, framework generation, and error correction—ensuring that the derivation process remains highly controllable and accurate.

How We Do This

StatProver does not simply have the model generate an answer directly; instead, it ensures the quality of the proof through a robust six-stage pipeline:

StatProver Pipeline Architecture

Highlights

Try It Out!

Official Website : 3 free use on registry, 1 free use per day!

Download Full StatProver Technical Report (PDF)

Contact Us

If you have any questions or are interested in collaboration, please feel free to reach out to our laboratory: