Comments (4)
补充:我使用了您提供的如下的方法转化了音频,但是还是出现上述错误
from asrt_speechrecognition.
很明显,这是音频文件的时间长度过长导致的,可以参考ASRT项目文档上所述的内容,一条语音数据的最长时间长度当前限制为不能超过16秒,超过的话很容易导致模型的数据尺寸过大进而引发Memory不足的问题,尤其是在使用不太先进的GPU运行的时候。如果存在较长时间的音频,首先应当切割为一段段比较短的音频片段。
from asrt_speechrecognition.
作者您好,我按照您说的,切割了一段15秒的音频,转成了wav格式,但是还是显示之前的错误,甚至第一个数据更大了,很是奇怪,我将再试试缩短,感谢您的回复,谢谢!
from asrt_speechrecognition.
很明显,这是音频文件的时间长度过长导致的,可以参考ASRT项目文档上所述的内容,一条语音数据的最长时间长度当前限制为不能超过16秒,超过的话很容易导致模型的数据尺寸过大进而引发Memory不足的问题,尤其是在使用不太先进的GPU运行的时候。如果存在较长时间的音频,首先应当切割为一段段比较短的音频片段。
您好,我又尝试了一个8s的视频,我的转换过程如下:
最终得到一个wav文件,但是送入预测时,还是显示一下错误:
如果是时长问题的话,8s应该满足条件了,是不是我视频转音频的处理过程有误,还请您指教一下,感谢!
from asrt_speechrecognition.
Related Issues (20)
- 数据集可以只采用thchs30进行训练和预测吗?
- 修改成支持英文识别的问题 HOT 1
- h5文件转tflite出错 HOT 1
- pip package conflict caused by protobuf==3.19.6 and grpcio-tools HOT 3
- Error with CUDA_ERROR_ILLEGAL_ADDRESS HOT 7
- 训练模型时出错 HOT 2
- 怎么能识别中英文混合的语音?
- No such file or directory(训练每次出现的缺失wav文件还不一样) HOT 2
- 可以提供麦克风的示例不 HOT 1
- ValueError: Expect x to be a non-empty array or dataset. HOT 2
- ARM64 的支持 HOT 1
- 有训练好的模型权重文件下载吗
- download_default_datalist 时出现 502 Bad Gateway HOT 1
- 请问,电脑安装不了cuda和cdnn的话,可以用服务器来代替吗?然后移除那部分的代码可以吗? HOT 1
- could not broadcast input array from shape (1043793,200,1) into shape (1600,200,1) HOT 1
- 模型问题
- 命令行应该去掉前面第一个/符号
- Run ASRT on smartphones. HOT 8
- 参考引用本项目 HOT 1
- 文件找不到 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from asrt_speechrecognition.