用的是b站视频链接上传步骤四也有错误，不知道是否有关，同样贴上 <a target="_blank" rel="noopener noreferrer n

<a href="https://github.com/Plachtaa/VITS-fast-fine-tuning/files/10834060/speaker_link

<a target="_blank" rel="noopener noreferrer nofollow" href="https://user-images.github

我用你的文件试了一下，没有任何问题，标注也正常生成了 <a href="https://github.com/Plachtaa/VITS-fast-fine-tun

步骤三报错 about vits-fast-fine-tuning HOT 16 OPEN

plachtaa commented on August 28, 2024

步骤三报错

from vits-fast-fine-tuning.

Comments (16)

Plachtaa commented on August 28, 2024

步骤3这个错误说明视频没下下来，看下你的txt

from vits-fast-fine-tuning.

Mylech commented on August 28, 2024

speaker_link2.txt
是格式错了么

from vits-fast-fine-tuning.

Mylech commented on August 28, 2024

from vits-fast-fine-tuning.

Plachtaa commented on August 28, 2024

收到，我会试试的

from vits-fast-fine-tuning.

Plachtaa commented on August 28, 2024

我用你的文件试了一下，没有任何问题，标注也正常生成了
long_character_anno.txt

from vits-fast-fine-tuning.

Plachtaa commented on August 28, 2024

经测试可能是因为whisper没能正常运行导致的，刚刚把模型型号从large换成了medium，你现在再试试看

from vits-fast-fine-tuning.

Mylech commented on August 28, 2024

又试了一遍，步骤三还是步骤三不一样

transcribing ./denoised_audio/vangogh_29936.wav...

transcribing ./denoised_audio/vangogh_583238.wav...

2023-02-26 16:42:38.096489: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2023-02-26 16:42:39.331181: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer.so.7'; dlerror: libnvinfer.so.7: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/lib/python3.8/dist-packages/cv2/../../lib64:/usr/lib64-nvidia
2023-02-26 16:42:39.331316: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer_plugin.so.7'; dlerror: libnvinfer_plugin.so.7: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/lib/python3.8/dist-packages/cv2/../../lib64:/usr/lib64-nvidia
2023-02-26 16:42:39.331336: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly.

from vits-fast-fine-tuning.

Plachtaa commented on August 28, 2024

这个打印的信息是正确的，看一下有没有生成long_character_anno.txt这个文件，有的话就没问题了

from vits-fast-fine-tuning.

Mylech commented on August 28, 2024

好，明晚试试，感谢

from vits-fast-fine-tuning.

Plachtaa commented on August 28, 2024

好，明晚试试，感谢

用你给的链接把梵高模型炼好了，效果还不错，在这里下载：
https://drive.google.com/drive/folders/1OICqnvcge55Hd7Ja8eIE0PzQZr4MHBRx?usp=sharing
感谢你为debug做出的贡献，爱你🤗

from vits-fast-fine-tuning.

Mylech commented on August 28, 2024

谢谢谢谢OTZ

from vits-fast-fine-tuning.

Unmserae commented on August 28, 2024

我也是同样的错误用的视频链接没有看到下面生成long_character_anno.txt那个文件，还是说我没找到吗？导入视频也是一样的结果
这是我的链接文件speaker_link2.txt
Running this codeblock will perform automatic vocal seperation & annotation. Since this step uses Whisper & Demucs, it may take a while to complete.

[MoviePy] Writing audio in ./raw_audio/madoka_652407.wav

100%|██████████| 5890/5890 [00:03<00:00, 1750.28it/s]

[MoviePy] Done.
speaker_link2.txt

[MoviePy] Writing audio in ./raw_audio/madoka_645071.wav

100%|██████████| 5890/5890 [00:02<00:00, 2184.38it/s]

[MoviePy] Done.

[MoviePy] Writing audio in ./raw_audio/madoka_271353.wav

100%|██████████| 5890/5890 [00:02<00:00, 2476.39it/s]

[MoviePy] Done.

Important: the default model was recently changed to htdemucs the latest Hybrid Transformer Demucs model. In some cases, this model can actually perform worse than previous models. To get back the old default model use -n mdx_extra_q.
Selected model is a bag of 1 models. You will see that many progress bars per track.
Separated tracks will be stored in /content/VITS-fast-fine-tuning/VITS-fast-fine-tuning/VITS-fast-fine-tuning/separated/htdemucs
Separating track raw_audio/madoka_645071.wav
100%|██████████████████████████████████████████████| 269.09999999999997/269.09999999999997 [00:14<00:00, 18.44seconds/s]
Important: the default model was recently changed to htdemucs the latest Hybrid Transformer Demucs model. In some cases, this model can actually perform worse than previous models. To get back the old default model use -n mdx_extra_q.
Selected model is a bag of 1 models. You will see that many progress bars per track.
Separated tracks will be stored in /content/VITS-fast-fine-tuning/VITS-fast-fine-tuning/VITS-fast-fine-tuning/separated/htdemucs
Separating track raw_audio/madoka_652407.wav
100%|██████████████████████████████████████████████| 269.09999999999997/269.09999999999997 [00:13<00:00, 19.90seconds/s]
Important: the default model was recently changed to htdemucs the latest Hybrid Transformer Demucs model. In some cases, this model can actually perform worse than previous models. To get back the old default model use -n mdx_extra_q.
Selected model is a bag of 1 models. You will see that many progress bars per track.
Separated tracks will be stored in /content/VITS-fast-fine-tuning/VITS-fast-fine-tuning/VITS-fast-fine-tuning/separated/htdemucs
Separating track raw_audio/madoka_271353.wav
100%|██████████████████████████████████████████████| 269.09999999999997/269.09999999999997 [00:13<00:00, 19.92seconds/s]
2023-03-04 05:08:21.936022: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2023-03-04 05:08:23.588622: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer.so.7'; dlerror: libnvinfer.so.7: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/lib/python3.8/dist-packages/cv2/../../lib64:/usr/local/lib/python3.8/dist-packages/cv2/../../lib64:/usr/lib64-nvidia
2023-03-04 05:08:23.588800: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer_plugin.so.7'; dlerror: libnvinfer_plugin.so.7: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/lib/python3.8/dist-packages/cv2/../../lib64:/usr/local/lib/python3.8/dist-packages/cv2/../../lib64:/usr/lib64-nvidia
2023-03-04 05:08:23.588833: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly.
transcribing ./denoised_audio/madoka_645071.wav...

transcribing ./denoised_audio/madoka_652407.wav...

transcribing ./denoised_audio/madoka_271353.wav...

2023-03-04 05:12:11.501043: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2023-03-04 05:12:13.146523: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer.so.7'; dlerror: libnvinfer.so.7: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/lib/python3.8/dist-packages/cv2/../../lib64:/usr/lib64-nvidia
2023-03-04 05:12:13.146706: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer_plugin.so.7'; dlerror: libnvinfer_plugin.so.7: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/lib/python3.8/dist-packages/cv2/../../lib64:/usr/lib64-nvidia
2023-03-04 05:12:13.146741: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly.
Warning: no short audios found, this IS expected if you have only uploaded long audios, videos or video links.
this IS NOT expected if you have uploaded a zip file of short audios. Please check your file structure or make sure your audio language is supported.

from vits-fast-fine-tuning.

Plachtaa commented on August 28, 2024

打印的信息应该是对的，没找到就刷新一下，我会用你链接的测试一遍

from vits-fast-fine-tuning.

Unmserae commented on August 28, 2024

打印的信息应该是对的，没找到就刷新一下，我会用你链接的测试一遍

刷新重试了一下，现在多了这个
Traceback (most recent call last):
File "long_audio_transcribe.py", line 45, in
code = file.rstrip(".wav").split("_")[1]
IndexError: list index out of range

from vits-fast-fine-tuning.

Plachtaa commented on August 28, 2024

是没有问题的，如果还是不行建议检查一下GPU是否开启，或者重启会话。

from vits-fast-fine-tuning.

Unmserae commented on August 28, 2024

是没有问题的，如果还是不行建议检查一下GPU是否开启，或者重启会话。

好的，谢谢大佬~

from vits-fast-fine-tuning.

步骤三报错 about vits-fast-fine-tuning HOT 16 OPEN

Comments (16)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent