Files
ComfyUI/comfy/text_encoders
rattus ae79e33345 llama: use a more efficient rope implementation (#12434)
Get rid of the cat and unary negation and inplace add-cmul the two
halves of the rope. Precompute -sin once at the start of the model
rather than every transformer block.

This is slightly faster on both GPU and CPU bound setups.
2026-02-12 19:56:42 -05:00
..
2026-02-06 19:48:20 -05:00
2025-04-15 10:32:21 -04:00
2025-08-18 22:38:34 -04:00
2025-04-15 12:13:28 -04:00
2026-01-19 22:32:40 -05:00
2025-08-18 22:38:34 -04:00
2025-04-15 10:32:21 -04:00
2025-04-15 10:32:21 -04:00
2025-08-18 22:38:34 -04:00