Entry point for Sledgehammer’s external prover interface · Isabelle/ML

Stream: Isabelle/ML

Topic: Entry point for Sledgehammer’s external prover interface

Jiangjing Xu (Nov 27 2025 at 12:12):

Dear Isabelle developers,

I hope you are doing well. I am a student currently working on a university project regarding the reliability and testing of proof assistants. According to the project specification, my task is to test the interface between Isabelle/HOL and external provers, which are primarily invoked via Sledgehammer.

My supervisor recommended that I consult this community to better understand the internal architecture. In particular, I would be very grateful if you could advise me on the following:

What is the main ML entry point or API through which Sledgehammer invokes external provers (such as E, Vampire, Z3)?
Which modules or functions in the Isabelle code base are responsible for:
(a) preparing problems for external provers
(b) launching the external tools
(c) parsing and integrating their results back into Isabelle?
If one wishes to instrument or fuzz the interface (for example, by injecting mutated prover calls or modified problem files), is there a recommended way to call this layer programmatically?

For context, I am working with Isabelle 2025, and this question is specifically motivated by Variant 3 of my project, which focuses on exercising the Sledgehammer–external prover interaction to search for crashes, miscommunication, hangs, or unexpected behaviours.

Thank you very much for your time and guidance. I sincerely appreciate your help as I am still learning the internal structure of Isabelle.

Kind regards,
Jiangjing Xu

Fabian Huch (Nov 27 2025 at 12:36):

Have a look at the function Sledgehammer_Prover_ATP.run_atp (In the file src/HOL/Tools/Sledgehammer/sledgehammer_prover_atp.ML), where all of this happens.

This calls all the components, e.g. ATP_Problem_Generate.generate_atp_problem (in src/HOL/Tools/ATP/atp_problem_generate.ML).

You can simply write Isabelle/ML code to call the involved functions yourself.

Fabian Huch (Nov 27 2025 at 12:41):

If you're new to Isabelle/ML, have a look at the first few sections of the Isabelle/ML Cookbook.

Kevin Kappelmann (Nov 27 2025 at 12:43):

Fabian Huch said:

If you're new to Isabelle/ML, have a look at the first few sections of the Isabelle/ML Cookbook.

That's the 2013 version. Here's the most up-to-date version (2019): https://urbanchr.github.io/Cookbook/

Jiangjing Xu (Nov 27 2025 at 15:31):

Fabian Huch 发言道：

Have a look at the function Sledgehammer_Prover_ATP.run_atp (In the file src/HOL/Tools/Sledgehammer/sledgehammer_prover_atp.ML), where all of this happens.

This calls all the components, e.g. ATP_Problem_Generate.generate_atp_problem (in src/HOL/Tools/ATP/atp_problem_generate.ML).

You can simply write Isabelle/ML code to call the involved functions yourself.

Thank you very much, Mr.Huch — this is extremely helpful.

I will look into Sledgehammer_Prover_ATP.run_atp and the related modules you mentioned, especially ATP_Problem_Generate.generate_atp_problem. This gives me a clear starting point for understanding the full invocation chain.

I appreciate the guidance on calling these components directly from Isabelle/ML as well.

Jiangjing Xu (Nov 27 2025 at 15:33):

Kevin Kappelmann 发言道：

Fabian Huch said:

If you're new to Isabelle/ML, have a look at the first few sections of the Isabelle/ML Cookbook.

That's the 2013 version. Here's the most up-to-date version (2019): https://urbanchr.github.io/Cookbook/

Thank you, Mr.Kappelmann, for the link to the updated Isabelle/ML Cookbook. I really appreciate it — I will study the 2019 version to familiarise myself with Isabelle/ML before diving deeper into the Sledgehammer interface.

Mathias Fleury (Nov 27 2025 at 16:14):

It seems strange to me to try fuzzing E and Vampire this way, because Isabelle does not really try to reconstruct the proofs

Mathias Fleury (Nov 27 2025 at 16:14):

Fuzzing the smt code that seems like a lot of fun :-)

Mathias Fleury (Nov 27 2025 at 16:15):

But: I expect that will find a lot of proof reconstruction problems

Mathias Fleury (Nov 27 2025 at 16:16):

before finding problems due to mutation

Mathias Fleury (Nov 27 2025 at 16:17):

BTW, I have somewhere a check_smt method that parses an input and a proof file. That might be useful to start fuzzing (you can mutate text files, which is easier than attempting to mutate the file generation itself)

Jiangjing Xu (Dec 08 2025 at 09:53):

Mathias Fleury 发言道：

It seems strange to me to try fuzzing E and Vampire this way, because Isabelle does not really try to reconstruct the proofs

Thank you very much, Mr.Fleury, for your thoughtful advice — I really appreciate it.

Your points about proof reconstruction dominating the results and about focusing on SMT fuzzing are very helpful, and I will carefully take them into account when refining my direction. The check_smt method you mentioned also sounds like an excellent starting point.

Many thanks again for your guidance.

Mathias Fleury (Dec 09 2025 at 06:14):

Right, so the fork is at https://github.com/m-fleury/isabelle-emacs/tree/Isabelle2025-reconstruction. It contains changes related to add support for a newer cvc5 and the latest veriT version. I will try to merge the changes with the latest RC5 over the week-end.

Last updated: Mar 17 2026 at 13:17 UTC