ComfyUI_frontend

mirror of https://github.com/Comfy-Org/ComfyUI_frontend.git synced 2026-05-11 16:30:57 +00:00

Author	SHA1	Message	Date
snomiao	eda7f22fa8	fix: restore PR files with git checkout HEAD instead of git checkout - git checkout - uses @{-1} which requires a previous branch switch. Since we use 'git checkout origin/main -- .' (file checkout, not branch switch), there is no @{-1} ref. Use HEAD to restore from current branch. Also restore proper concurrency group. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 23:26:59 +00:00
snomiao	4f80c88c36	fix: use unique concurrency group to unstick QA runs	2026-03-20 23:20:54 +00:00
snomiao	ee876b470e	feat: integrate comfy-qa skill test plan into QA recording pipeline Pass the comprehensive test plan from .claude/skills/comfy-qa/SKILL.md to Gemini when generating test steps. This gives Gemini knowledge of all 12 QA categories (canvas, menus, sidebar, settings, etc.) so it picks the most relevant tests for each PR. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 22:22:49 +00:00
snomiao	c94f808f2c	fix: use vite build directly to skip nx typecheck dependency nx build runs typecheck as a prerequisite (via @nx/vite/plugin config). Use vite build directly for the main branch comparison build. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 22:02:50 +00:00
snomiao	9e2292410a	fix: skip typecheck when building main branch for QA comparison Main branch may have transient TS errors when built with the PR branch's lockfile. Since we only need the dist for visual comparison, run nx build directly instead of pnpm build (which includes typecheck). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 21:48:38 +00:00
snomiao	7ab13f753c	refactor: replace Codex with direct Playwright recording in QA pipeline Replace the unreliable codex exec approach with a Playwright script (qa-record.ts) that uses Gemini to generate targeted test steps from the PR diff, then executes them deterministically via Playwright's API. Key changes: - New scripts/qa-record.ts: Gemini generates JSON test actions, Playwright executes them with reliable helper functions (menu nav, dialog fill, etc.) - Remove codex CLI and playwright-cli dependencies - Remove 150+ lines of prompt templates from pr-qa.yaml - Firefox headless with video recording (same approach proven locally) - Fallback steps if Gemini fails Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 21:43:03 +00:00
snomiao	9615c0168a	fix: resolve merge conflict in pr-report.yaml	2026-03-20 20:16:32 +00:00
snomiao	cf84d68e8c	feat: auto-generate regression tests from QA reports - Tighten BEFORE prompt to 15s snapshot (show old state only) - Add qa-generate-test.ts: Gemini-powered Playwright test generator - New workflow step: generate .spec.ts and push to {branch}-add-qa-test - Tests assert UIUX behavior (tab names, dirty state, visibility)	2026-03-20 20:15:25 +00:00
snomiao	050fad59a2	feat: before/after video comparison for QA pipeline - Build both main (dist-before/) and PR (dist/) frontends in focused mode - Run QA twice: BEFORE on main branch frontend, AFTER on PR branch - Send both videos to Gemini in one request for comparative analysis - Side-by-side dashboard layout with Before (main) / After (PR) panels - Comparative prompt evaluates whether before confirms old behavior and after proves the fix works - Falls back to single-video mode when no before video available	2026-03-20 16:46:51 +00:00
snomiao	7350353959	fix: make QA videos seekable with faststart and frequent keyframes moov atom was at end of file (8.6MB offset) — browser had to download the entire video before seeking. Keyframes were only every 10 seconds. Add -movflags +faststart (moov before mdat) and -g 60 (keyframe every 2.4s at 25fps) to ffmpeg conversion.	2026-03-20 16:30:04 +00:00
snomiao	0251607c41	fix: issue cards instead of dense table, rename to comfy-qa.pages.dev - Replace 6-column confirmed issues table with vertical card blocks using colored severity/timestamp/confidence badges - Rename Cloudflare Pages project from comfyui-qa-videos to comfy-qa	2026-03-20 16:17:03 +00:00
snomiao	082c647454	fix: seekable video, hide empty cards, PR-aware video review - Remove autoplay/loop so video timeline is seekable - Skip card generation for platforms without recordings - Add --pr-context flag to qa-video-review.ts so Gemini evaluates against PR purpose instead of just describing what happened - Workflow now builds pr-context.txt from PR title/body/diff	2026-03-20 10:47:31 +00:00
snomiao	2b32994541	feat: redesign QA dashboard with modern frontend design OKLCH color tokens, liquid glass card surfaces, Inter + JetBrains Mono typography, grain texture overlay, staggered fade-up animations, pill action buttons with SVG icons, and improved report table styling.	2026-03-20 08:26:09 +00:00
snomiao	d1e3e747f3	fix: make settings pre-seed non-fatal and try both API endpoints The /api/settings endpoint returned 4xx in CI. Try both /api/settings and /settings endpoints, and don't fail the job if neither works.	2026-03-19 16:07:14 +00:00
snomiao	df10e3246b	fix: pre-seed Comfy.TutorialCompleted to skip template gallery in QA The Codex agent was spending 35s browsing the "Getting Started" template gallery instead of testing the PR's changes. Pre-seeding this setting via the ComfyUI API ensures the agent lands directly in the graph editor.	2026-03-19 15:57:40 +00:00
snomiao	224c845b6c	fix: tighten focused QA prompt to only test PR-specific behavior The Codex agent was spending time on login flow, template browsing, and general smoke testing instead of testing the PR's actual changes. Changes: - Add 30-second time budget for video recording - Move video-start AFTER login and editor verification - Explicitly prohibit template browsing and sidebar exploration - Reduce test steps to 3-6 targeted actions - Restructure prompt with clear Instructions/Rules sections	2026-03-19 14:34:01 +00:00
snomiao	d4f22467c0	fix: render markdown in QA reports with marked.js Replace crude sed-based markdown conversion with client-side rendering via marked.js CDN. Adds proper table, list, and code styling for the report section. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-19 12:11:10 +00:00
snomiao	b764c39492	fix: run report job on workflow_dispatch events Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-19 10:08:46 +00:00
snomiao	8b2db18a1b	refactor: replace GPT frame extraction with Gemini native video analysis Replace the OpenAI GPT-based frame extraction approach (ffmpeg + screenshots) with Gemini 2.5 Flash's native video understanding. This eliminates false positives from frame-based analysis (e.g. "black screen = critical bug" during page transitions) and produces dramatically better QA reviews. Changes: - Remove ffmpeg frame extraction, ffprobe duration detection, and all related logic (~365 lines removed) - Add @google/generative-ai SDK for native video/mp4 upload to Gemini - Update CLI: remove --max-frames, --min-interval-seconds, --keep-frames flags - Update env: OPENAI_API_KEY → GEMINI_API_KEY - Update workflow: swap API key secret and model in pr-qa.yaml - Update report: replace "Frames analyzed" with "Video size" - Add note in prompt that brief black frames during transitions are normal Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-19 08:33:07 +00:00
snomiao	0b3f6c9bdc	fix: use fill+click for quick login before video recording playwright-cli doesn't support 'evaluate' command. Instead, instruct Codex to quickly fill the username input and click Next on user-select page BEFORE starting video recording, so the video only shows actual QA testing.	2026-03-18 23:21:41 +00:00
snomiao	2f88a07c75	fix: use evaluate to set localStorage before video recording storageState config doesn't work with playwright-cli. Instead, use evaluate to set Comfy.userId/userName after opening the page, then navigate back. This skips user-select before video-start so the recording only shows actual QA testing.	2026-03-18 23:08:05 +00:00
snomiao	ac7158f25e	fix: pre-seed localStorage to skip user-select in QA runs Write a Playwright storageState JSON with Comfy.userId/userName pre-set so the app loads directly to the graph editor. Saves ~40s per QA run that was wasted on navigating the user-select page.	2026-03-18 22:54:41 +00:00
snomiao	3b4402c2d1	fix: prefer explicit qa-session.webm over corrupt auto-recorded videos The convert step was using find which picked up a 0-byte file from playwright's videos/ directory instead of the valid qa-session.webm. Now prefers qa-session.webm explicitly and skips empty files.	2026-03-18 22:31:54 +00:00
snomiao	8d50345193	fix: improve focused QA prompt to test PR-specific behavior, not random walk	2026-03-18 22:10:17 +00:00
snomiao	b00e58e513	fix: re-add push trigger for sno-skills and sno-qa-* branches	2026-03-18 21:27:36 +00:00
snomiao	f4148d13f1	fix: also install ffprobe for GPT video review frame extraction	2026-03-18 21:13:48 +00:00
snomiao	438af97b3d	fix: use sudo for ffmpeg static binary extraction to /usr/local/bin	2026-03-18 20:58:55 +00:00
snomiao	7e756ec673	fix: use static ffmpeg binary instead of apt-get (avoids dpkg lock hang)	2026-03-18 20:33:50 +00:00
snomiao	87a4316125	fix: add ffmpeg install back (not pre-installed on GH runners)	2026-03-18 20:16:33 +00:00
snomiao	52a825f631	fix: normalize flat artifact download into expected subdirectory	2026-03-18 20:07:58 +00:00
snomiao	7bde727c7b	fix: pre-install chromium and clarify prompt for codex Codex was using pnpm dlx instead of the global playwright-cli. Pre-install chromium in setup step and make prompt explicit about using the global command directly without pnpm/npx.	2026-03-18 19:39:52 +00:00
snomiao	dea1c2c421	fix: add debug output to video convert step	2026-03-18 19:10:47 +00:00
snomiao	7549ab4807	fix: use danger-full-access sandbox for codex on GH Actions	2026-03-18 18:58:46 +00:00
snomiao	3ca4a144ba	fix: use correct codex model name gpt-5.4-mini	2026-03-18 18:50:41 +00:00
snomiao	34c6e15678	feat: switch QA from Claude Code to OpenAI Codex CLI Replace claude --print with codex exec for cheaper QA runs. Uses codex-mini-latest model ($1.50/$6 vs Sonnet $3/$15). Uses existing OPENAI_API_KEY secret (no new secrets needed).	2026-03-18 18:45:05 +00:00
snomiao	799fc9e0bc	fix: default to linux-only QA, full 3-OS only via qa-full label Reduces per-run cost from ~$10-16 to ~$2.50 by defaulting to Linux-only. Use qa-full label or workflow_dispatch for 3-OS runs.	2026-03-18 18:27:57 +00:00
snomiao	c7844f3c74	fix: use explicit video-start/stop, remove ffmpeg install, use gpt-4.1-mini - Replace saveVideo config (didn't produce video) with explicit playwright-cli video-start/video-stop commands in QA prompt - Remove apt-get install ffmpeg step (pre-installed on GH runners) - Switch video review model from gpt-4o to gpt-4.1-mini	2026-03-18 18:18:18 +00:00
snomiao	eec387ea49	fix: use auto video recording and show GPT reports on QA site - Enable saveVideo in playwright-cli config for real video recording - Replace screenshot stitching with webm→mp4 conversion - Move video review step before deploy so reports are included - Add GPT video review reports inline on the Cloudflare Pages site - Each video card now has expandable "GPT Video Review" section	2026-03-18 17:20:55 +00:00
snomiao	9cbbc05042	fix: configure playwright-cli outputDir and improve artifact collection - Set .playwright/cli.config.json with outputDir pointing to screenshots/ - This way bare 'playwright-cli screenshot' auto-saves to the right place - Create screenshot directory before Claude runs (don't rely on Claude) - Collect step now searches working directory for stray PNGs - Simplified prompt: no --filename needed, just 'playwright-cli screenshot'	2026-03-18 16:42:30 +00:00
snomiao	cd491b19d3	fix: stitch screenshots from correct directory and simplify prompt Screenshots were saved to artifact root but stitch looked in frames/. Now: prompt tells Claude to save to screenshots/ dir with numbered names, collect step consolidates PNGs there, stitch step globs from screenshots/. Removed video-start/video-stop (Claude doesn't use them).	2026-03-18 15:50:34 +00:00
snomiao	11575306cc	fix: use playwright-cli video recording and collect default output - Add playwright-cli config with outputDir and saveVideo - Use video-start/video-stop instead of relying on screenshot frames - Add fallback artifact collection from .playwright-cli/ default dir - Simplify prompts to focus on video recording workflow	2026-03-18 15:14:09 +00:00
snomiao	a7d600242b	fix: resolve QA_ARTIFACTS path in prompt so Claude gets the literal path The escaped \$QA_ARTIFACTS in the heredoc produced literal text '$QA_ARTIFACTS' in the prompt. Claude's Bash tool didn't reliably expand this env var, so no screenshots or reports were saved. Remove the escapes so the heredoc expands the variable to the actual path (e.g. /home/runner/work/_temp/qa-artifacts).	2026-03-18 14:22:29 +00:00
snomiao	f8bdda585a	fix: escape backticks in QA prompt heredoc to prevent command substitution Backtick-wrapped playwright-cli examples in the unquoted heredoc were being interpreted as bash command substitution, producing empty prompts. Replace backtick syntax with plain "Run:" prefixed commands.	2026-03-18 13:48:06 +00:00
snomiao	31675b62cc	fix: reorganize QA CI — remove screen recording, merge video-review into report - Remove all Xvfb/ffmpeg screen recording infrastructure from qa job (captured blank display since playwright-cli runs headless) - Add screenshot instructions to QA prompts: Claude saves sequential frames to $QA_ARTIFACTS/frames/ after every interaction - Stitch screenshots into video via ffmpeg in report job (2fps) - Merge video-review job into report job (4 jobs → 3 jobs) - Unified PR comment with video links + video review in <details> collapse - Clean up stale QA_VIDEO_REVIEW_COMMENT markers from prior runs	2026-03-18 09:51:53 +00:00
snomiao	f14252accb	feat: add comfy-qa skill and automated QA CI pipeline Add Claude Code skills and a label-triggered QA workflow: - .claude/skills/comfy-qa/SKILL.md: 12-category QA test plan using playwright-cli for browser automation - .github/workflows/pr-qa.yaml: CI workflow triggered by qa-changes (focused, Linux) or qa-full (3-OS matrix) labels. Records screen via ffmpeg, runs Claude CLI with playwright-cli, deploys video gallery to Cloudflare Pages, posts PR comment with GIF thumbnails, and runs OpenAI vision-based video review - scripts/qa-video-review.ts: frame extraction + GPT-4o analysis - scripts/qa-video-review.test.ts: unit tests for video review - knip.config.ts: resolve knip errors for ingest-types package	2026-03-17 06:25:44 +00:00
Christian Byrne	46dad2e077	ops: restrict PyPI publishing to bi-weekly ComfyUI releases (#9948 ) ## Summary Restrict PyPI publishing of `comfyui-frontend-package` to bi-weekly ComfyUI release cycles only, instead of every nightly version bump. ## Changes - What: Move `publish_pypi` job from `release-draft-create.yaml` to `release-biweekly-comfyui.yaml` 1. Removed `publish_pypi` job from `release-draft-create.yaml` (no longer publishes on every merged Release PR) 2. Added `publish-pypi` job to `release-biweekly-comfyui.yaml` with tag polling, build, publish, and PyPI availability confirmation 3. Gated `create-comfyui-pr` on `publish-pypi` success so the ComfyUI requirements bump PR is only created after the package is confirmed available 4. Updated ComfyUI PR body to confirm PyPI availability instead of warning about a pending release PR - Breaking: None — nightly releases still create GitHub releases and publish npm types; only PyPI publishing timing changes - Dependencies: None ## Review Focus - The `publish-pypi` job uses `if: always() && needs.resolve-version.result == 'success'` to run even when `trigger-release-if-needed` is skipped (tag already exists) - Tag polling (30min timeout) waits for the version bump PR to be merged before building from the tagged commit - PyPI propagation polling (15min timeout) confirms the package is installable before creating the ComfyUI PR Fixes COM-16778 ┆Issue is synchronized with this [Notion page](https://www.notion.so/PR-9948-ops-restrict-PyPI-publishing-to-bi-weekly-ComfyUI-releases-3246d73d36508198b00fcc247ac5b58c) by [Unito](https://www.unito.io) --------- Co-authored-by: GitHub Action <action@github.com>	2026-03-16 13:11:15 -07:00
Johnpaul Chiwetelu	f68d8365a6	chore: enable auto-merge on backport PRs (#10108 ) ## Summary - Adds `gh pr merge --auto --squash` after backport PR creation in the backport workflow, so backport PRs merge automatically once checks pass - Uses `\|\| echo "::warning::..."` fallback to avoid failing the workflow if auto-merge can't be enabled (e.g. repo setting not configured) ## Test plan - [ ] Trigger backport workflow on a test PR with `needs-backport` label - [ ] Verify auto-merge is enabled on the created backport PR ┆Issue is synchronized with this [Notion page](https://www.notion.so/PR-10108-chore-enable-auto-merge-on-backport-PRs-3256d73d3650814eb6e5fb2bdf3c5ec7) by [Unito](https://www.unito.io)	2026-03-16 12:57:48 -07:00
Johnpaul Chiwetelu	ed5e0a0b51	chore: replace team CODEOWNERS with external PR review workflow (#10104 ) ## Summary Remove team assignments from CODEOWNERS to reduce notification noise for internal PRs. Add a workflow that requests team review only when external contributors open PRs. ## Changes - What: Strip `@Comfy-org/comfy_frontend_devs` and `@Comfy-Org/comfy_maintainer` from all CODEOWNERS entries (keep individual user assignments). Add `pr-request-team-review.yaml` workflow that uses `pull_request_target` to request team review for non-collaborator PRs. - Dependencies: None ## Review Focus - The workflow uses `pull_request_target` but does not check out or execute any untrusted code — it only runs `gh pr edit --add-reviewer`. - The `author_association` check excludes OWNER, MEMBER, and COLLABORATOR — internal PRs will not trigger team review requests. ┆Issue is synchronized with this [Notion page](https://www.notion.so/PR-10104-chore-replace-team-CODEOWNERS-with-external-PR-review-workflow-3256d73d3650813b887ac16b5e97b4c4) by [Unito](https://www.unito.io)	2026-03-16 15:49:58 +01:00
Christian Byrne	a7218b2922	fix: fix perf CI pipeline — z-score baselines, force-push staleness, baseline storage (#9886 ) ## Summary Fixes three critical issues with the CI performance reporting pipeline that made perf reports useless on PRs (demonstrated by PR #9248 — deep watcher removal merged without useful perf signal). ## Changes ### 1. Fix z-score baseline variance collection (`0/5 runs`) Root cause: PR #9305 added z-score statistical analysis code to `perf-report.ts`, but the historical data download step was placed in the wrong workflow file. The report is generated in `pr-perf-report.yaml` (a `workflow_run`-triggered job), but the historical download was in `ci-perf-report.yaml` (the test runner) — different runners, different filesystems. Fix: Implement `perf-data` orphan branch storage: - On push to main: save `perf-metrics.json` to `perf-data` branch with timestamped filename - On PR report: fetch last 5 baselines from `perf-data` branch into `temp/perf-history/` - Rolling window of 20 baselines, oldest pruned automatically - Same pattern used by `github-action-benchmark` (33.7k repos) ### 2. Fix force-push comment staleness Root cause: `cancel-in-progress: true` kills the perf test run before it uploads artifacts. The downstream report workflow only triggers on `conclusion == 'success'` — cancelled runs are ignored, so the comment from the first successful run goes stale. Fix: - Change `cancel-in-progress: false` — with GitHub's queue depth of 1, rapid pushes (A,B,C,D) run A and D, skipping B and C - Add SHA validation in `pr-perf-report.yaml` — before posting, check if the workflow_run's head SHA still matches the PR's current head. Skip posting stale results. ### 3. Add permissions for baseline operations - `contents: write` on CI job (needed for pushing to perf-data branch) - `actions: read` on both workflows (needed for artifact/baseline access) ## One-time setup required After merging, create the `perf-data` orphan branch: ```bash git checkout --orphan perf-data git rm -rf . echo '# Performance Baselines' > README.md mkdir -p baselines git add README.md baselines git commit -m 'Initialize perf-data branch' git push origin perf-data ``` The first 2 pushes to main after setup will build up variance data, and z-scores will start appearing in PR reports (threshold is `historical.length >= 2`). ## Testing - YAML validated with `yaml.safe_load()` - `perf-report.ts` `loadHistoricalReports()` already reads from `temp/perf-history/<index>/perf-metrics.json` — no code changes needed - All new steps use `continue-on-error: true` for graceful degradation ┆Issue is synchronized with this [Notion page](https://www.notion.so/PR-9886-fix-fix-perf-CI-pipeline-z-score-baselines-force-push-staleness-baseline-storage-3226d73d365081538424c7945e71f308) by [Unito](https://www.unito.io)	2026-03-15 02:46:10 -07:00
Christian Byrne	4bdf67ca21	fix: restore fork PR lint/format CI workflow (#9846 ) ## Problem The lint/format CI workflow was broken for fork PRs in two ways: ### 1. Node version mismatch in setup-frontend action The `setup-frontend` shared action (created in #8377) was missed when Node version was standardized to `.nvmrc` in #9521. It still used `node-version: 'lts/'` instead of `node-version-file: '.nvmrc'`. ### 2. Fork PRs with lint issues silently passed CI Fork PRs with auto-fixable lint/format issues got a green checkmark* despite having unfixed issues: 1. Auto-fix steps (`lint:fix`, `format`) fix issues in the workspace 2. `Commit changes` is correctly skipped for forks (can't push to fork branches) 3. `Final validation` passes because it runs on the already-fixed workspace 4. The `Comment on PR about manual fix needed` step tries to post a comment via `actions/github-script`, but fork PRs have a read-only `GITHUB_TOKEN` — the comment silently fails (`continue-on-error: true`) 5. Result: workflow reports success, contributor thinks their code is clean ## Fix - setup-frontend: Use `node-version-file: '.nvmrc'` instead of `node-version: 'lts/'` - ci-lint-format*: Replace the broken fork comment step with an explicit `exit 1` that fails CI and prints clear fix instructions in the log. This follows the principle from `.github/AGENTS.md`: fork PRs can't post comments, so don't try. ## Testing - [ ] Verify fork PRs with clean code still pass - [ ] Verify fork PRs with lint issues now properly fail (instead of silently passing) ┆Issue is synchronized with this [Notion page](https://www.notion.so/PR-9846-fix-restore-fork-PR-lint-format-CI-workflow-3226d73d3650811cb5bfe9f1f989cc0c) by [Unito](https://www.unito.io)	2026-03-13 06:14:16 -07:00

1 2 3 4 5 ...

317 Commits