| Shell | Model | Backend | Hash | Format | Test | Status | Reason | Trajectory | Invoked | Match | Warmup (s) | Measured (s) | Details |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| gptme | local/qwen3.5:9b | ollama | 6488c96fa5faab64bb65cbd30d4289e20e6130ef535a93ef9a49f42eda893ea7 | tool | JSON Rank From File | PASS | recovered | yes | 100% | 120.008 | 76.234 | details | |
| gptme | local/qwen3.5:9b | ollama | 6488c96fa5faab64bb65cbd30d4289e20e6130ef535a93ef9a49f42eda893ea7 | tool | Web Fetch JSON Raw | PASS | recovered | yes | 100% | 89.005 | 87.25 | details | |
| gptme | local/qwen3.5:9b | ollama | 6488c96fa5faab64bb65cbd30d4289e20e6130ef535a93ef9a49f42eda893ea7 | tool | Web Search JSON Ranked | PASS | recovered | yes | 100% | 120.008 | 112.005 | details |