| Shell | Model | Backend | Hash | Format | Test | Status | Reason | Trajectory | Invoked | Match | Warmup (s) | Measured (s) | Details |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| continue-cli | local/gemma4:e4b | ollama | c6eb396dbd5992bbe3f5cdb947e8bbc0ee413d7c17e2beaae69f5d569cf982eb | markdown | Web Search JSON Ranked | FAIL | answered without invoking tool | clean | no | 0% | 211.724 | 162.071 | details |
| continue-cli | local/gemma4:e4b | ollama | c6eb396dbd5992bbe3f5cdb947e8bbc0ee413d7c17e2beaae69f5d569cf982eb | markdown | Weekly Plan Next Week | FAIL | answered without invoking tool | clean | no | 0% | 135.005 | 80.037 | details |