llm-bench — produced artifacts

13/13 projects built · 6 run(s)

20260610-164327

validation: results.md

projectmodelstatusbuildrun time
001-rts-wfc/grok-build_grok-build_xhighgrok-build @xhighokok911s▶ play
001-rts-wfc/ollama-cloud_glm-5_1_xhighglm-5.1 @xhighokok2156s▶ play
001-rts-wfc/ollama-cloud_minimax-m3_xhighminimax-m3 @xhighokok4910s▶ play
001-rts-wfc/openai-codex_gpt-5_5_xhighgpt-5.5 @xhighokok1421s▶ play

20260611-184001

projectmodelstatusbuildrun time
001-rts-wfc/xiaomi-token-plan-ams_mimo-v2_5-pro_highmimo-v2.5-pro @highokok758s▶ play

20260612-171652

validation: results.md

projectmodelstatusbuildrun time
002-rts-wfc/grok-build_grok-build_xhighgrok-build @xhighokok3177s▶ play
002-rts-wfc/ollama-cloud_glm-5_1_xhighglm-5.1 @xhighokok2674s▶ play
002-rts-wfc/ollama-cloud_minimax-m3_xhighminimax-m3 @xhigherrorok3199s▶ play
002-rts-wfc/openai-codex_gpt-5_5_xhighgpt-5.5 @xhighokok3640s▶ play
002-rts-wfc/xiaomi-token-plan-ams_mimo-v2_5-pro_highmimo-v2.5-pro @highokok3004s▶ play

claude-manual-20260610

validation: results.md

projectmodelstatusbuildrun time
claude-fable-5claude-fable-5ok1970s▶ play
claude-opus-4-8claude-opus-4-8ok2648s▶ play

claude-manual-20260611

validation: results.md

projectmodelstatusbuildrun time
claude-opus-4-8claude-opus-4-8ok28000s▶ play