Juha Jeronen
4c6f843ff9
Talkinghead performance improvements and refactoring ( #207 )
...
* talkinghead: fix and improve THA3 manual poser
* server.py: no, don't yet use fp16 for talkinghead
* talkinghead: remove wxPython dependency from live mode
* comment out unused functions
* add TODO list
* coding style
* remove unused import
* add TODO marker
* message wordings
* fix typos in variable names
* talkinghead updates
* talkinghead updates
* Empty commit
* presentation order, sectioning
* fix the inMotion flag update
* mark a TODO
* refactor
* remove done TODO items
* mark a TODO
* comment wording
* pause animation while loading a new image
* parser doesn't belong here, not a command-line app anymore
* message wording
* use finally
* remove superfluous "global" declarations
* lots of cleanup
* remove silly sys.path entry
* improve docstring
* oops
* app.py now only serves the live mode for the plugin
* talkinghead live mode: remove ifacialmocap stuff (unused)
* improve comment
* list walking is so 1990s
* use double quotes consistently
* remove now-unused ifacialmocap-related files from the repo
* remove done TODO item
* improve docstring
* update comment
* remove now-unused function
* update comment
* improve docstring
* add TODO marker
* oops, typo
* add --talkinghead-model command-line option to server.py
Default is 'auto': float16 on GPU, float32 on CPU.
* talkinghead: auto-install THA3 models if needed
* remove tha3/models from git repo (have autodownload now)
* Add hf-hub as explicit dependency
* Add THA models to gitignore
---------
Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com >
2023-12-21 23:48:25 +02:00
Cohee
47a5489142
Use pipeline for text summarization
2023-12-20 01:57:38 +02:00
Cohee
423f51e3f8
Use pipeline for image captioning
2023-12-20 01:28:53 +02:00
Cohee
98535e6fa3
Add links to search results
2023-12-11 23:52:25 +02:00
Cohee
aded5b4363
Add support for DuckDuckGo search engine in web search API
2023-11-28 22:39:35 +02:00
Cohee
05cab1c918
Add websearch module
2023-11-28 20:36:02 +02:00
Cohee
51a680946f
Fix chroma embedding function
2023-11-16 18:28:52 +02:00
Cohee
beb434ae78
Fix chromadb purge
2023-10-18 23:54:37 +03:00
Cohee
60045b599f
Fix double memory consumption for classify module
2023-09-03 16:45:20 +03:00
Cohee
5d51e752f4
#144 Increase max content length limit
2023-09-01 14:36:36 +03:00
Cohee
d68dd41e12
Merge pull request #133 from Tony-sama/neo
...
RVC expression-based dynamic voice
2023-08-24 21:27:18 +03:00
joe
63528152ef
Ignore Auth for TalkingHead Fix
2023-08-24 18:08:45 +09:00
Tony Ribeiro
650b05190a
Added dedicated script for classify feature. Clean call to classifier module in RVC.
2023-08-21 17:15:48 +02:00
Tony Ribeiro
16d267c916
Merge branch 'SillyTavern:neo' into neo
2023-08-21 05:21:00 +02:00
deffcolony
4bc0238f5c
Update server.py
2023-08-18 23:31:42 +02:00
deffcolony
7311748c1f
Python Colorama for some console prints
2023-08-18 23:23:14 +02:00
Tony Ribeiro
b3dd6f8ce3
Merge branch 'SillyTavern:neo' into neo
2023-08-18 17:43:32 +02:00
deffcolony
e3b51dfbab
adds green color for url line
...
This will make sure the link is more visible in console
2023-08-18 16:49:04 +02:00
Tony Ribeiro
fa462c1979
Prototype of RVC + emotion classification pipeline
2023-08-18 01:10:57 +02:00
Tony Ribeiro
b5fcde2611
Added RVC request function to allow upload models from ST. Added parameter --max-content-length=X (Mb) to set server max request data size accordingly if needed.
2023-08-17 04:44:49 +02:00
Tony Ribeiro
185fc1334f
Added parameter --rvc-save-file to save last rvc input/output audio file into data/tmp. For research purpose and easy sharing of TTS/RVC sample files
2023-08-17 01:43:13 +02:00
Tony Ribeiro
4992ce0bd3
Added Coqui STT module request function for listing local models and generating tts using it.
2023-08-16 19:20:55 +02:00
Tony Ribeiro
da6bbf879e
Added RVC module function to fix models folder organisation (if possible)
2023-08-16 16:38:59 +02:00
Tony Ribeiro
fd6ef64d80
Updated RVC voice conversion function to handle new parameters
2023-08-15 21:16:59 +02:00
Tony Ribeiro
75d48a9638
Added argument '--coqui-models=model1,model2' allowing to download given coqui-api models and keep the last one loaded ready for first tts request.
2023-08-15 17:20:23 +02:00
Tony Ribeiro
84ed0f5de3
remove debug message
2023-08-15 15:43:58 +02:00
Tony Ribeiro
a403eb0247
Corrected coqui gpu option to be actually taken into account ^^
2023-08-15 15:43:00 +02:00
Tony Ribeiro
c800eae3e8
Cleanup remaining old coqui implementation
2023-08-14 05:32:34 +02:00
Tony Ribeiro
46fecb5b48
Merge branch 'neo' of https://github.com/Tony-sama/SillyTavern-extras into neo
2023-08-14 04:04:55 +02:00
Tony Ribeiro
a440177f25
reduced coqui api call to minimum, added model download
2023-08-14 04:04:46 +02:00
Tony Ribeiro
f87ac34928
Merge branch 'SillyTavern:neo' into neo
2023-08-14 04:01:46 +02:00
Cohee
53020b34f7
#120 Fix coqui module overwriting system function
2023-08-13 18:53:03 +03:00
Tony Ribeiro
db63c58c30
Added missing route to populate ST coqui UI settings and download/check models of coqui api.
2023-08-13 02:19:47 +02:00
Tony Ribeiro
1389d3806d
Started refactoring of Coqui-tts extension.
2023-08-12 06:06:55 +02:00
joe
712e5508da
Clean up and prevent double load
2023-08-11 09:15:27 +09:00
joe
7ad0bb5598
live2d -> talkinghead name change
2023-08-11 06:50:59 +09:00
Tony Ribeiro
60a8e5c9c6
Add monkey patched fairseq package to run on python 3.11 (what is needed for our use of RVC at least)
2023-08-10 02:58:52 +02:00
Tony Ribeiro
0d00b5fdd0
Add module managing RVC extension to convert audio sent by ST using voice model
2023-08-09 03:38:13 +02:00
Cohee
9a650c4cec
Merge pull request #111 from SillyTavern/main
...
Remove persist call for chroma
2023-08-08 17:44:12 +03:00
Cohee
5ed3013e45
Remove persist call for chroma
2023-08-08 17:43:15 +03:00
Cohee
36e6bfaf44
Merge branch 'main' into neo
2023-08-05 15:29:50 +03:00
Cohee
0c09107dd4
Add Python 3.11 recommendation nudge
2023-08-05 15:06:24 +03:00
joe
dbc7e07c0a
Emotion Animation changes Staging
2023-08-05 17:02:53 +09:00
joe
5f4671f8e8
Coqui TTS Cpu/Gpu Flag Fix
2023-08-04 16:24:50 +09:00
joe
d1e2efedba
Perfomance Increase and cleanup
2023-08-04 08:32:02 +09:00
joe
05a12822a7
WIP Updates to CTRL C hang up
2023-08-02 09:58:22 +09:00
Cohee
bfdf62765d
Load live2d by posting a file
2023-08-01 23:57:32 +03:00
joe
ef8e0493e8
added live2d Arg for gpu or cpu mode
2023-08-01 07:32:32 +09:00
Cohee
38b6403ca1
Port chromadb update from neo
2023-07-31 22:14:42 +03:00
Cohee
d47cb539ad
Merge pull request #96 from Tony-sama/neo
...
Restored speech recognition streaming mode as an independant module.
2023-07-31 20:42:40 +03:00