GripProbe Run Summary

ShellModelBackendHashFormatTestStatusReasonTrajectoryInvokedMatchWarmup (s)Measured (s)Details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownJSON Rank From FileFAILanswered without invoking toolcleanno0%35.8926.829details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownWeb Fetch JSON RawFAILanswered without invoking toolcleanno0%5.7765.827details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownWeb Search JSON RankedFAILcleanno0%8.3338.332details