| Shell | Model | Backend | Hash | Format | Test | Status | Reason | Trajectory | Invoked | Match | Warmup (s) | Measured (s) | Details |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| gptme | local/qwen2.5:7b | ollama | 845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697e | tool | JSON Rank From File | TIMEOUT | clean | yes | 0% | 75.322 | 120.012 | details | |
| gptme | local/qwen2.5:7b | ollama | 845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697e | tool | Web Fetch JSON Raw | FAIL | recovered | yes | 0% | 19.505 | 17.701 | details | |
| gptme | local/qwen2.5:7b | ollama | 845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697e | tool | Web Search JSON Ranked | TIMEOUT | clean | yes | 0% | 37.693 | 120.017 | details |