Hello how are you?
Finally with your explanations I managed to use Zero Shot ASS,
However, when examining the results, using the test keys "vocals",
I got unattractive results, I didn't understand why, being that in the examples of the videos they sound very good,
So my question is as follows,
In the "Data/Query" folder, to get decent results, how many examples should I put there?
Because I did as follows,
In "mixture.wav" is the complete instrumental with the vocals,
And in the "Query" folder I put 15 examples of clean vocals (from the singer in question),
So am I doing it wrong?
NOTE: Each example is on average 7 to 15 seconds long in .wav format
I tested it on other examples and got the same pattern (unattractive results),
When I refer to unattractive results, I mean almost no final changes, the files in the "wavoutput" folder are practically unmodified.
I only tested the model (htsat_audioset_2048d.ckpt) because the other two models are incompatible with this code, I received checkpoint errors (because the script is not standardized to accept them).
So if you can let me know why I would be grateful,
My thanks in advance,
Lucas Rodrigues.