Comments (4)
Thanks!
There are two possible way to have interjections:
- If the prompt has sounds like that, it's possible that the model will add that to the generation spontaneously
- if you wrote these in the target transcript, the model will just follow
from voicecraft.
Thanks! There are two possible way to have interjections:
- If the prompt has sounds like that, it's possible that the model will add that to the generation spontaneously
- if you wrote these in the target transcript, the model will just follow
So theoretically, VoiceCraft can be used to generate singing melody (vocal only) ?
from voicecraft.
Thanks! There are two possible way to have interjections:
- If the prompt has sounds like that, it's possible that the model will add that to the generation spontaneously
- if you wrote these in the target transcript, the model will just follow
So theoretically, VoiceCraft can be used to generate singing melody (vocal only) ?
I think it's unlikely to generate singing voice out-of-the-box, just because the training data is speech data. You could finetune it on singing data
from voicecraft.
Cool, I will try with singing data. Thanks
from voicecraft.
Related Issues (20)
- URL's unresponsive
- File Not Found Error when running ./data/phonemize_encodec_encode_hf.py file HOT 1
- nvm
- VoiceCraft Fine-tune dataset preparation HOT 4
- Simplify installation HOT 1
- espeak not installed on system even when attempting to hardcode path to espeak HOT 1
- Hugging Face demo no longer works
- Batch Inference HOT 2
- use facebook's pretrained encodec model HOT 7
- about silence tokens during inference
- Why inference TTS doesn't need to mask? HOT 2
- Any one have convertted the model to onnx?
- espeak issues on macOS Sonoma 14.2.1 HOT 1
- Have you tried to not delayed stacking input (Use delayed stacking for generation, but not on input)
- Data preperation problem
- Question regarding the how gradient accumulation is done. (It looks like we didn't /accumulation_steps when backprop loss )
- omageconf error
- Working in WSL but 10min+ inference
- I finetuned voicecraft on commonvoice-french, here are some of my findings/thoughts
- Total duration of training dataset HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from voicecraft.