| Shell | Model | Backend | Hash | Format | Test | Status | Reason | Trajectory | Invoked | Match | Warmup (s) | Measured (s) | Details |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| gptme | local/MFDoom/deepseek-r1-tool-calling:8b | ollama | unknown | tool | Patch File | NO_TOOL_CALL | recovered | maybe | 0% | 58.033 | 41.797 | details | |
| gptme | local/MFDoom/deepseek-r1-tool-calling:8b | ollama | unknown | tool | Python File Simple | NO_TOOL_CALL | recovered | maybe | 0% | 16.496 | 24.061 | details | |
| gptme | local/MFDoom/deepseek-r1-tool-calling:8b | ollama | unknown | tool | Shell Date | NO_TOOL_CALL | clean | no | 0% | 19.552 | 19.303 | details | |
| gptme | local/MFDoom/deepseek-r1-tool-calling:8b | ollama | unknown | tool | Shell PWD | NO_TOOL_CALL | clean | no | 0% | 12.189 | 12.139 | details |