| Shell | Model | Backend | Hash | Format | Test | Status | Reason | Trajectory | Invoked | Match | Warmup (s) | Measured (s) | Details |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| gptme | local/mistral-nemo:12b | ollama | e7e06d107c6c86ed0cf45445f1790720b5092149c4c95f4d965844e9afbfdc89 | tool | JSON Rank From File | NO_TOOL_CALL | clean | no | 0% | 81.439 | 42.103 | details | |
| gptme | local/mistral-nemo:12b | ollama | e7e06d107c6c86ed0cf45445f1790720b5092149c4c95f4d965844e9afbfdc89 | tool | Web Fetch JSON Raw | PASS | recovered | yes | 100% | 40.951 | 37.897 | details | |
| gptme | local/mistral-nemo:12b | ollama | e7e06d107c6c86ed0cf45445f1790720b5092149c4c95f4d965844e9afbfdc89 | tool | Web Search JSON Ranked | NO_TOOL_CALL | recovered | maybe | 0% | 120.008 | 49.921 | details |