snicolast b3c66eb036 readme
2025-10-07 09:57:49 +13:00
2025-09-22 12:51:09 +12:00
2025-10-03 10:52:10 +13:00
2025-09-12 16:44:17 +12:00
2025-09-22 12:51:09 +12:00
2025-09-12 16:23:19 +12:00
2025-10-07 09:57:49 +13:00
2025-09-13 21:00:23 +01:00

ComfyUI-IndexTTS2

Lightweight ComfyUI wrapper for IndexTTS 2 (voice cloning + emotion control). Nodes call the upstream inference code so behaviour stays matched with the original repo.

Original repo: https://github.com/index-tts/index-tts

Updates

  • 2025-09-22: Added IndexTTS2 Advanced node exposing sampling, speed, seed, and other generation controls.

Install

  • Clone this repository into ComfyUI/custom_nodes/
  • Inside your ComfyUI Python environment:
    pip install wetext
    pip install -r requirements.txt
    

Models

Nodes

  • IndexTTS2 Simple speaker audio, text, optional emotion audio/vector; outputs audio + status string. Auto-selects device, FP16 on CUDA.
  • IndexTTS2 Advanced Simple inputs plus overrides for sampling, speech speed, pauses, CFG, seed.
  • IndexTTS2 Emotion Vector eight sliders (0.01.4, sum <= 1.5) producing an emotion vector.
  • IndexTTS2 Emotion From Text requires ModelScope and local QwenEmotion; turns short text into an emotion vector + summary.

Examples

  • Speaker audio -> IndexTTS2 Simple -> Preview/Save Audio
  • Speaker + emotion audio -> IndexTTS2 Simple -> Save
  • Emotion Vector -> IndexTTS2 Simple -> Save
  • Emotion From Text -> IndexTTS2 Simple -> Save

ComfyUI-IndexTTS2 nodes

Troubleshooting

  • Windows only so far; DeepSpeed is disabled.
  • Install wetext if the module is missing on first launch.
  • Emotion vector sum must stay <= 1.5.
Description
Lightweight ComfyUI wrapper for IndexTTS 2 (voice cloning + emotion control). The nodes call the original IndexTTS2 inference and keep behavior faithful to the repo.
Readme 20 MiB
Languages
Python 98.1%
Cuda 1%
C 0.6%
JavaScript 0.2%