| Shell | Model | Backend | Hash | Format | Test | Status | Reason | Trajectory | Invoked | Match | Warmup (s) | Measured (s) | Details |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| continue-cli | local/qwen2.5:7b | ollama | 845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697e | markdown | Web Search JSON Ranked | FAIL | clean | no | 0% | 39.854 | 7.33 | details | |
| continue-cli | local/qwen2.5:7b | ollama | 845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697e | markdown | Weekly Plan Next Week | FAIL | answered without invoking tool | clean | no | 0% | 5.275 | 5.075 | details |