Comments (8)
Thank you @harshhpareek, this really got me rolling. A few more notes for anyone else who steps down this path:
- Electron 2 will not launch, it crashes repeatedly; upgrading to 4 fixed this, so change it in
package.json
accordingly. - Ensure
ffmpeg
is installed via your system package manager, eg.apt install ffmpeg
npm run install
npm audit fix
pip3 install ffmpeg-python
is needed for output to conclude properly, or you need to disable it in settings (it was enabled by default for me)- If you want to use ffmpeg, you need to change the ffmpeg command call(s) in
python/audio_post.py
- Wherever you find
ffmpeg_path=...
set it toffmpeg_path="ffmpeg"
so that it finds the version installed in your system PATH.
- Wherever you find
- If you want to use ffmpeg, you need to change the ffmpeg command call(s) in
- The waveglow models are archived on nexus (I used this; not sure if there are implications I am unaware of, but it does generate
- The requirements files have a few dependencies not available on linux (pywin32; pywin32-ctypes). I commented these out and did
pip3 install requirements.txt
from xva-synth.
For the ffmpeg thing, I ship the ffmpeg.exe in the compiled version on Steam / Nexusmods. For linux, I suppose it wouldn't help anyway even if I did include it in the repo. I don't use linux, but maybe an easier thing to do rather than changing code might be to place a simlink between the place where the exe would be, and wherever your ffmpeg is installed on linux (maybe - I'm not sure).
The waveglow models are a legacy thing, they are no longer needed (not since v2.0). You can still use them if you wish, however. Nowadays, the per-voice individual HiFi-GAN vocoders are preferable, as they are fine-tuned specifically to the voice, rather than using some off-the-shelf waveglow one-size-fits-all vocoder (and they are faster). In upcoming model versions, I think this will go away also, with the models being end-to-end.
I'm working on the next major version (v3), I'll finally update the Electron version to make that easier. Are you sure about tqdm not being available on linux?
from xva-synth.
Yeah, you're probably right about symlinking, I was going for quick and dirty to make sure it would work on my setup. You are also correct about tqdm, that misunderstanding was the result of something else -- having installed on a second machine this morning, I realized that.
On the topic of waveglow, I don't have options for any models besides WaveGlow, Big WaveGlow, and quick and dirty? How can I enabled the HiFi-GAN vocoders (sorry if I've just missed the instructions, I will have another look of course)? For testing, I am only using the Female Dunmer voice from Morrowind, just to note in case that has something to do with it?
EDIT: Also, how rude of me to not lead with this, but thank you for open-sourcing this incredible project. That is mighty generous, and then sharing your models and research tidbits too. Words can't even express the gratitude.
from xva-synth.
The hifigan models are the xxxxx.hg.pt files, where the xxxx is the voice ID. Most if not all new voices since about v1.1 of the app have been trained and released together with this additional file (on nexus, the description will show "Model: FastPitch1.1+HiFi"). When using the app, the default behaviour (can be changed in settings) is to use hifigan when the voice is loaded if available, though you can change the vocoder in the "Vocoder" dropdown (the lightning bolt next to it shows that it's available).
from xva-synth.
In v1.0.3, I had to add models/waveglow_256channels_universal_v4.pt
from the nexus page manually to get this working.
from xva-synth.
Thank you for your detailed post regarding this. I admit, this section was a little bare.
It is true, CUDA and pytorch are necessary dependencies. I am unsure what the Electron quick-start is required for.
Models can be downloaded from the nexus pages, and they have the correct file structure required. The gpu error is caused by an earlier error in the code causing some other code not to run. The earlier error is the mising waveglow checkpoint which you correctly point out must be downloaded from nvidia and placed in the models folder.
I use v2.0.0 (this is an old project). Keep it there for behaviour consistent to mine. I use CUDA 10.1 but this shouldn't matter, so long as it matches the pytorch version you download.
from xva-synth.
Regarding Electron quick start: i meant that some dependency is missing, which that project installed
from xva-synth.
Thank you kindly. I will try to route any other questions through discord to prevent cluttering github with non-issues.
from xva-synth.
Related Issues (20)
- Can this work offline? HOT 3
- Cannot use xVASynth Fuz Ro Bork plugin v1.2 anymore since xVASynth v3.0.0. HOT 1
- xVASynth v3.0.0. / xVASynthVoices.json HOT 2
- [ Feature Request ] An option to enable command-line-interface/cli. HOT 1
- Loading new model does not reset Styles HOT 1
- Implement Smoothness parameter
- Bug: Phoneme editor container stretches longer than its content HOT 1
- Nexus Mods Premium - File integrity check HOT 1
- Error generating batch voice for Cirilla HOT 1
- Drag & Drop voice model archives on sidebar - progressbar
- Allow plugin to change emotional modifiers to whole sequence
- Nexus Mods Premium - install stalls if no game folder found HOT 2
- ffmpeg: MP3 format crash with Deesing HOT 2
- ARPABet dictionaries are not loaded for v3 models HOT 2
- AM cleanup crash v3 models if sentence begins with "Am" HOT 1
- ARPAbet "the orc" fusion on v3 voices
- xVAPitch/v3 models <PAD> symbols ignored by modifier if done one by one HOT 2
- Add audio streaming support HOT 2
- Wav2Vec2, FastPitch 1, and FastPitch 1.1 code does not remove '.exe' from expected executable paths when not running on Windows
- Intel ARC GPU support HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from xva-synth.