GripProbe Run Summary

ShellModelBackendHashFormatTestStatusReasonTrajectoryInvokedMatchWarmup (s)Measured (s)Details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownPatch File From Prepared PatchFAILcleanno0%43.9628.133details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownPython File QuotedFAILanswered without invoking toolcleanno0%5.9295.526details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownPython File Quoted DEPASScleanyes100%5.3265.528details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownPython File Quoted RUPASScleanyes100%5.9285.576details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownSave FileFAILanswered without invoking toolcleanno0%6.4786.629details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownShell Date DEPASScleanyes100%4.8245.076details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownShell Date RUPASScleanyes100%5.7774.976details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownShell FilePASScleanyes100%5.1255.378details
continue-clilocal/qwen2.5:7bollama845dbda0ea48ed749caafd9e6037047aa19acfcfd82e704d7ca97d631a0b697emarkdownWeb Nonce ProofFAILcleanno0%9.4846.278details