Commit Graph

144 Commits

Author SHA1 Message Date
Juha Jeronen
4c6f843ff9 Talkinghead performance improvements and refactoring (#207)
* talkinghead: fix and improve THA3 manual poser

* server.py: no, don't yet use fp16 for talkinghead

* talkinghead: remove wxPython dependency from live mode

* comment out unused functions

* add TODO list

* coding style

* remove unused import

* add TODO marker

* message wordings

* fix typos in variable names

* talkinghead updates

* talkinghead updates

* Empty commit

* presentation order, sectioning

* fix the inMotion flag update

* mark a TODO

* refactor

* remove done TODO items

* mark a TODO

* comment wording

* pause animation while loading a new image

* parser doesn't belong here, not a command-line app anymore

* message wording

* use finally

* remove superfluous "global" declarations

* lots of cleanup

* remove silly sys.path entry

* improve docstring

* oops

* app.py now only serves the live mode for the plugin

* talkinghead live mode: remove ifacialmocap stuff (unused)

* improve comment

* list walking is so 1990s

* use double quotes consistently

* remove now-unused ifacialmocap-related files from the repo

* remove done TODO item

* improve docstring

* update comment

* remove now-unused function

* update comment

* improve docstring

* add TODO marker

* oops, typo

* add --talkinghead-model command-line option to server.py

Default is 'auto': float16 on GPU, float32 on CPU.

* talkinghead: auto-install THA3 models if needed

* remove tha3/models from git repo (have autodownload now)

* Add hf-hub as explicit dependency

* Add THA models to gitignore

---------

Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2023-12-21 23:48:25 +02:00
Cohee
47a5489142 Use pipeline for text summarization 2023-12-20 01:57:38 +02:00
Cohee
423f51e3f8 Use pipeline for image captioning 2023-12-20 01:28:53 +02:00
Cohee
98535e6fa3 Add links to search results 2023-12-11 23:52:25 +02:00
Cohee
aded5b4363 Add support for DuckDuckGo search engine in web search API 2023-11-28 22:39:35 +02:00
Cohee
05cab1c918 Add websearch module 2023-11-28 20:36:02 +02:00
Cohee
51a680946f Fix chroma embedding function 2023-11-16 18:28:52 +02:00
Cohee
beb434ae78 Fix chromadb purge 2023-10-18 23:54:37 +03:00
Cohee
60045b599f Fix double memory consumption for classify module 2023-09-03 16:45:20 +03:00
Cohee
5d51e752f4 #144 Increase max content length limit 2023-09-01 14:36:36 +03:00
Cohee
d68dd41e12 Merge pull request #133 from Tony-sama/neo
RVC expression-based dynamic voice
2023-08-24 21:27:18 +03:00
joe
63528152ef Ignore Auth for TalkingHead Fix 2023-08-24 18:08:45 +09:00
Tony Ribeiro
650b05190a Added dedicated script for classify feature. Clean call to classifier module in RVC. 2023-08-21 17:15:48 +02:00
Tony Ribeiro
16d267c916 Merge branch 'SillyTavern:neo' into neo 2023-08-21 05:21:00 +02:00
deffcolony
4bc0238f5c Update server.py 2023-08-18 23:31:42 +02:00
deffcolony
7311748c1f Python Colorama for some console prints 2023-08-18 23:23:14 +02:00
Tony Ribeiro
b3dd6f8ce3 Merge branch 'SillyTavern:neo' into neo 2023-08-18 17:43:32 +02:00
deffcolony
e3b51dfbab adds green color for url line
This will make sure the link is more visible in console
2023-08-18 16:49:04 +02:00
Tony Ribeiro
fa462c1979 Prototype of RVC + emotion classification pipeline 2023-08-18 01:10:57 +02:00
Tony Ribeiro
b5fcde2611 Added RVC request function to allow upload models from ST. Added parameter --max-content-length=X (Mb) to set server max request data size accordingly if needed. 2023-08-17 04:44:49 +02:00
Tony Ribeiro
185fc1334f Added parameter --rvc-save-file to save last rvc input/output audio file into data/tmp. For research purpose and easy sharing of TTS/RVC sample files 2023-08-17 01:43:13 +02:00
Tony Ribeiro
4992ce0bd3 Added Coqui STT module request function for listing local models and generating tts using it. 2023-08-16 19:20:55 +02:00
Tony Ribeiro
da6bbf879e Added RVC module function to fix models folder organisation (if possible) 2023-08-16 16:38:59 +02:00
Tony Ribeiro
fd6ef64d80 Updated RVC voice conversion function to handle new parameters 2023-08-15 21:16:59 +02:00
Tony Ribeiro
75d48a9638 Added argument '--coqui-models=model1,model2' allowing to download given coqui-api models and keep the last one loaded ready for first tts request. 2023-08-15 17:20:23 +02:00
Tony Ribeiro
84ed0f5de3 remove debug message 2023-08-15 15:43:58 +02:00
Tony Ribeiro
a403eb0247 Corrected coqui gpu option to be actually taken into account ^^ 2023-08-15 15:43:00 +02:00
Tony Ribeiro
c800eae3e8 Cleanup remaining old coqui implementation 2023-08-14 05:32:34 +02:00
Tony Ribeiro
46fecb5b48 Merge branch 'neo' of https://github.com/Tony-sama/SillyTavern-extras into neo 2023-08-14 04:04:55 +02:00
Tony Ribeiro
a440177f25 reduced coqui api call to minimum, added model download 2023-08-14 04:04:46 +02:00
Tony Ribeiro
f87ac34928 Merge branch 'SillyTavern:neo' into neo 2023-08-14 04:01:46 +02:00
Cohee
53020b34f7 #120 Fix coqui module overwriting system function 2023-08-13 18:53:03 +03:00
Tony Ribeiro
db63c58c30 Added missing route to populate ST coqui UI settings and download/check models of coqui api. 2023-08-13 02:19:47 +02:00
Tony Ribeiro
1389d3806d Started refactoring of Coqui-tts extension. 2023-08-12 06:06:55 +02:00
joe
712e5508da Clean up and prevent double load 2023-08-11 09:15:27 +09:00
joe
7ad0bb5598 live2d -> talkinghead name change 2023-08-11 06:50:59 +09:00
Tony Ribeiro
60a8e5c9c6 Add monkey patched fairseq package to run on python 3.11 (what is needed for our use of RVC at least) 2023-08-10 02:58:52 +02:00
Tony Ribeiro
0d00b5fdd0 Add module managing RVC extension to convert audio sent by ST using voice model 2023-08-09 03:38:13 +02:00
Cohee
9a650c4cec Merge pull request #111 from SillyTavern/main
Remove persist call for chroma
2023-08-08 17:44:12 +03:00
Cohee
5ed3013e45 Remove persist call for chroma 2023-08-08 17:43:15 +03:00
Cohee
36e6bfaf44 Merge branch 'main' into neo 2023-08-05 15:29:50 +03:00
Cohee
0c09107dd4 Add Python 3.11 recommendation nudge 2023-08-05 15:06:24 +03:00
joe
dbc7e07c0a Emotion Animation changes Staging 2023-08-05 17:02:53 +09:00
joe
5f4671f8e8 Coqui TTS Cpu/Gpu Flag Fix 2023-08-04 16:24:50 +09:00
joe
d1e2efedba Perfomance Increase and cleanup 2023-08-04 08:32:02 +09:00
joe
05a12822a7 WIP Updates to CTRL C hang up 2023-08-02 09:58:22 +09:00
Cohee
bfdf62765d Load live2d by posting a file 2023-08-01 23:57:32 +03:00
joe
ef8e0493e8 added live2d Arg for gpu or cpu mode 2023-08-01 07:32:32 +09:00
Cohee
38b6403ca1 Port chromadb update from neo 2023-07-31 22:14:42 +03:00
Cohee
d47cb539ad Merge pull request #96 from Tony-sama/neo
Restored speech recognition streaming mode as an independant module.
2023-07-31 20:42:40 +03:00