mirror of
https://github.com/1038lab/ComfyUI-QwenTTS.git
synced 2026-05-14 01:34:23 +00:00
2.2 KiB
2.2 KiB
update v1.1.4
- new workflow work with new released ComfyUI-QwenAR
Workflow
update v1.1.3
- Dependence update and bug fixed
Update v1.1.0
Highlights
- Voice Clone nodes now support
VOICEinputs from the Voices Library for stable reuse. - New Tools bundle: Create Voice, Load Voice, Whisper STT, and Voice Instruct presets (EN + CN).
- Advanced nodes now expose attention selection (auto / sage_attn / flash_attn / sdpa / eager).
- README expanded with ComfyUI
extra_model_paths.yamlguidance for custom model locations. - Audio Duration node rewritten: cleaner logic, seconds-based outputs, optional frame calculation.
- 5 new sample workflows (v1.1.0):
- Clone_Voice.json — Voice Clone demo workflow (2 variants).
- Create_your_voice.json — Create and save your own voice profile.
- QwenTTS_Nodes.json — Overview of all custom nodes in this repo.
- QwenTTS_sample_workflow.json — Sample nodes for Voice Clone, Voice Design, and Custom Voice.
- Voice_design.json — Advanced Voice Design + preset voice instruct nodes (EN/ZH).
New / Updated Nodes
- Create Voice (QwenTTS)
- Build and save voice prompts to
.ptinComfyUI/output/qwen3-tts_voicesby default.
- Build and save voice prompts to
- Load Voice (QwenTTS)
- Load saved voices or use a custom path and output
VOICE.
- Load saved voices or use a custom path and output
- Whisper STT (QwenTTS)
- Transcribe
AUDIOto text with multiple model sizes.
- Transcribe
- Voice Clone (QwenTTS) / Voice Clone (QwenTTS) Advanced
- Added optional
voiceinput;reference_audiois only required if no voice is provided.
- Added optional
- Voice Instruct (QwenTTS)
- English preset builder from
voice_instruct.json.
- English preset builder from
- 声音风格指引 (QwenTTS)
- Chinese preset builder from
voice_instruct_zh.json.
- Chinese preset builder from
Audio Duration
- Outputs:
duration_int(seconds),duration_float(seconds),frames,audio_path. - Optional
fpsinput enables frame calculation.
