| Shell | Model | Backend | Hash | Format | Test | Status | Reason | Trajectory | Invoked | Match | Warmup (s) | Measured (s) | Details |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| gptme | local/qwen3:8b | ollama | 500a1f067a9f782620b40bee6f7b0c89e17ae61f686b92c24933e4ca4b2b8b41 | tool | JSON Rank From File | TIMEOUT | recovered | yes | 0% | 120.009 | 120.008 | details | |
| gptme | local/qwen3:8b | ollama | 500a1f067a9f782620b40bee6f7b0c89e17ae61f686b92c24933e4ca4b2b8b41 | tool | Web Fetch JSON Raw | PASS | recovered | yes | 100% | 120.009 | 57.836 | details | |
| gptme | local/qwen3:8b | ollama | 500a1f067a9f782620b40bee6f7b0c89e17ae61f686b92c24933e4ca4b2b8b41 | tool | Web Search JSON Ranked | TIMEOUT | recovered | yes | 0% | 72.216 | 120.019 | details |