Comments (4)
Currently we store 0.0
for non-media files and media files that are not WAV or FLAC. And that value also goes into the DataFrame
. But when we request the duration with deps.duration(file)
, we return None
for those files:
https://github.com/audeering/audb/blob/master/audb/core/dependencies.py#L273
So apparentely some media files of audioset
have a duration of 0.0
(is this expected?), which then causes the problem you describe.
The fix you suggest #25 avoids calling Dependencies.duration()
and therefore the error does not occur. The question is if we should stay with the current implementation of Dependencies.duration()
of if it is safer to return 0.0
instead of None
?
from audb.
Here's a list of files for which duration is 0.0:
audioset/eval_segments/1eHDeRxypFE_120.000.wav
audioset/unbalanced_train_segments/-dG6nL9hxCI_30.000.wav
audioset/unbalanced_train_segments/0vcR8EJPkNg_30.000.wav
audioset/unbalanced_train_segments/1Xhai0AqqYc_30.000.wav
audioset/unbalanced_train_segments/1YLFyYYjOBo_350.000.wav
audioset/unbalanced_train_segments/1eOwK_vnSNs_530.000.wav
audioset/unbalanced_train_segments/1zTnrk9WbWw_22.000.wav
audioset/unbalanced_train_segments/3UpqEQK8L2g_420.000.wav
audioset/unbalanced_train_segments/3Uuz8uckuZI_150.000.wav
audioset/unbalanced_train_segments/3Uz50faafeQ_110.000.wav
audioset/unbalanced_train_segments/3Vfu_ZGAIj0_490.000.wav
audioset/unbalanced_train_segments/3W-qiMM0qb4_320.000.wav
audioset/unbalanced_train_segments/3WDtoYwcNQU_210.000.wav
audioset/unbalanced_train_segments/3XfKg7nY8qU_590.000.wav
audioset/unbalanced_train_segments/3sSblRfEG2o_20.000.wav
audioset/unbalanced_train_segments/3zMB6xg7ouA_540.000.wav
audioset/unbalanced_train_segments/4rubpXyUYWo_520.000.wav
audioset/unbalanced_train_segments/5K6G403jyqE_30.000.wav
audioset/unbalanced_train_segments/5SVRRZZLpLQ_270.000.wav
audioset/unbalanced_train_segments/5jhswDK5HY8_190.000.wav
audioset/unbalanced_train_segments/5vqWfePO8r0_60.000.wav
audioset/unbalanced_train_segments/6-B1MeRJ0fM_30.000.wav
audioset/unbalanced_train_segments/6bQ5bFruUSU_470.000.wav
audioset/unbalanced_train_segments/7XfsRZmrPTk_380.000.wav
audioset/unbalanced_train_segments/8MXI-oFA1wk_30.000.wav
audioset/unbalanced_train_segments/8Slj7B_LDfQ_30.000.wav
audioset/unbalanced_train_segments/9_AdiGNxqk0_30.000.wav
audioset/unbalanced_train_segments/9wsqa5iuvUk_580.000.wav
audioset/unbalanced_train_segments/BTxswE3Ktho_140.000.wav
audioset/unbalanced_train_segments/BaM_Lb-anUM_30.000.wav
audioset/unbalanced_train_segments/BjLqOn28WM8_540.000.wav
audioset/unbalanced_train_segments/CmgWyoNh_LU_30.000.wav
audioset/unbalanced_train_segments/DUtaiHX5acw_100.000.wav
audioset/unbalanced_train_segments/Dm1ZmRNHNsU_30.000.wav
audioset/unbalanced_train_segments/ER9r34AjR6s_500.000.wav
audioset/unbalanced_train_segments/Fli8wjBFV2M_230.000.wav
audioset/unbalanced_train_segments/FnYkF2oI_f0_530.000.wav
audioset/unbalanced_train_segments/HUfTrnZ-RSk_30.000.wav
audioset/unbalanced_train_segments/HkWYbsFGDsY_470.000.wav
audioset/unbalanced_train_segments/IZjp4g5t_vo_170.000.wav
audioset/unbalanced_train_segments/JTDe2G1VAx4_110.000.wav
audioset/unbalanced_train_segments/K-TjW8HcSU4_370.000.wav
audioset/unbalanced_train_segments/LAnLq0qql2o_180.000.wav
audioset/unbalanced_train_segments/LhsnmpK0xkQ_490.000.wav
audioset/unbalanced_train_segments/MOV1oBlyYpU_160.000.wav
audioset/unbalanced_train_segments/OHJDtmjrCYM_250.000.wav
audioset/unbalanced_train_segments/PAHEldIA5Yk_270.000.wav
audioset/unbalanced_train_segments/QaBgMk4ccos_210.000.wav
audioset/unbalanced_train_segments/Qb04wAv0c_4_270.000.wav
audioset/unbalanced_train_segments/QcGpyXhIENI_80.000.wav
audioset/unbalanced_train_segments/QfILH7zmFxk_290.000.wav
audioset/unbalanced_train_segments/RFzGhgLcpv0_50.000.wav
audioset/unbalanced_train_segments/TV2NtVguqeE_180.000.wav
audioset/unbalanced_train_segments/WNtE9AvBOKg_560.000.wav
audioset/unbalanced_train_segments/WtFGPDqfOgE_560.000.wav
audioset/unbalanced_train_segments/XQoLtCFZG-c_160.000.wav
audioset/unbalanced_train_segments/YlwxbqtJST4_13.000.wav
audioset/unbalanced_train_segments/_vrj7xAGCUM_30.000.wav
audioset/unbalanced_train_segments/aEVbRV_c9bk_24.000.wav
audioset/unbalanced_train_segments/aPvNMALHyaI_290.000.wav
audioset/unbalanced_train_segments/b4TNdsnx6B8_30.000.wav
audioset/unbalanced_train_segments/bj0u9F6DEhc_250.000.wav
audioset/unbalanced_train_segments/bjLiWLXtXK4_27.000.wav
audioset/unbalanced_train_segments/cnzMmXjihdQ_10.000.wav
audioset/unbalanced_train_segments/eqnveHA0hrI_200.000.wav
audioset/unbalanced_train_segments/fVZVIZc2pAM_200.000.wav
audioset/unbalanced_train_segments/gT-IsJN7uis_460.000.wav
audioset/unbalanced_train_segments/gVEix6vNL7k_120.000.wav
audioset/unbalanced_train_segments/gp-5bdPzSDc_350.000.wav
audioset/unbalanced_train_segments/h3XDhqBwU1A_150.000.wav
audioset/unbalanced_train_segments/hZ0_tspxeX0_390.000.wav
audioset/unbalanced_train_segments/iYWzoGSLYPg_380.000.wav
audioset/unbalanced_train_segments/ileDehaisZE_520.000.wav
audioset/unbalanced_train_segments/izFhj77r7Uc_480.000.wav
audioset/unbalanced_train_segments/jt4Z46pknFo_60.000.wav
audioset/unbalanced_train_segments/lGhZqXf8zw0_240.000.wav
audioset/unbalanced_train_segments/lrv96Tkaifc_50.000.wav
audioset/unbalanced_train_segments/mDPXk7MF6YE_170.000.wav
audioset/unbalanced_train_segments/oAl41EK2Kd0_440.000.wav
audioset/unbalanced_train_segments/r2PcA0hjBZk_30.000.wav
audioset/unbalanced_train_segments/s8s2tnpTD4s_390.000.wav
audioset/unbalanced_train_segments/s9GfRQr3OAk_60.000.wav
audioset/unbalanced_train_segments/tKRkWDLfzZg_80.000.wav
audioset/unbalanced_train_segments/tpYfNeSrVu0_310.000.wav
audioset/unbalanced_train_segments/wM5Qf5xXT8w_410.000.wav
audioset/unbalanced_train_segments/y8Vmi3tHgaw_50.000.wav
audioset/unbalanced_train_segments/yYJZk8LMCxk_310.000.wav
from audb.
Good point, I also think that audb.Dependencies.duration()
should never return None
. The same holds for sampling_rate()
, bit_depth()
and channels()
. I opened #27 to address this.
Regarding the empty audioset files, I created an issue.
from audb.
Fixed by #27
from audb.
Related Issues (20)
- Rename "Find media" to "Parse media"? HOT 1
- There are no available db in audb HOT 2
- Regexp filtering of media files not working as expected HOT 2
- Speedup loading of partial databases from cache
- Loading attachment files on the fly? HOT 4
- Use platform-dependent lock mechanism HOT 5
- Deprecate `num_workers` argument from `load_table()` HOT 2
- Attachment files not published HOT 5
- Handle attachments more efficiently HOT 5
- Copy attachments from cached versions
- Introduce load_table.py, load_media.py and load_attachment.py
- Add is_pattern argument to error_message_missing_object()
- Be more precise for return types in audb.info
- only_metadata=True should not load attachment files
- Add attachments and maybe load_attachments argument
- Forbid certain chars in version string
- Backward compatibility with version check? HOT 4
- audb.dependencies() can return dependencies in different orders HOT 2
- Store dependency table in different format
- TST: Executing docstring examples takes very long HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from audb.