Skip to content

feat: benchmark Operations Center with live progress dashboard#169

Merged
solderzzc merged 1 commit intodevelopfrom
feature/benchmark-operations-center
Mar 18, 2026
Merged

feat: benchmark Operations Center with live progress dashboard#169
solderzzc merged 1 commit intodevelopfrom
feature/benchmark-operations-center

Conversation

@solderzzc
Copy link
Member

  • Redesign generate-report.cjs as a multi-view Operations Center
    • Three tabs: Performance, Quality, Vision
    • Run picker sidebar with model-grouped history + multi-select
    • Comparison tables across selected runs
    • Export to Markdown for community sharing
  • Add live progress mode (auto-refresh + LIVE banner)
    • Intermediate saves after each suite completes
    • Browser auto-opens with pulsing progress indicator
    • Auto-refreshes every 5s during benchmark run
  • Save VLM fixture metadata (filename, response, prompt) per test
  • Embed all data inline for fully self-contained HTML

- Redesign generate-report.cjs as a multi-view Operations Center
  - Three tabs: Performance, Quality, Vision
  - Run picker sidebar with model-grouped history + multi-select
  - Comparison tables across selected runs
  - Export to Markdown for community sharing
- Add live progress mode (auto-refresh + LIVE banner)
  - Intermediate saves after each suite completes
  - Browser auto-opens with pulsing progress indicator
  - Auto-refreshes every 5s during benchmark run
- Save VLM fixture metadata (filename, response, prompt) per test
- Embed all data inline for fully self-contained HTML
@solderzzc solderzzc merged commit 884e270 into develop Mar 18, 2026
1 check passed
@solderzzc solderzzc deleted the feature/benchmark-operations-center branch March 18, 2026 20:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant