Comments (18)
You need to see for yourself if there is a problem with the preprocessed files generated.
from looking-to-listen-at-the-cocktail-party.
@JusperLee your GPU size was in TB or GB ?
from looking-to-listen-at-the-cocktail-party.
one gpu was in 64GB
from looking-to-listen-at-the-cocktail-party.
@JusperLee okay..All the preprocessing has been done according to your readme only. Can you give possible solutions to this ?
from looking-to-listen-at-the-cocktail-party.
Due to the different experimental environments, I think you need to carefully check your code, including testing and preprocessing. This code did not show the above problems during my experiment.
from looking-to-listen-at-the-cocktail-party.
@JusperLee what was the accuracy of your audio visual model ?
from looking-to-listen-at-the-cocktail-party.
from looking-to-listen-at-the-cocktail-party.
@JusperLee Do you have your personal paper available for this in english ?
from looking-to-listen-at-the-cocktail-party.
Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation
from looking-to-listen-at-the-cocktail-party.
@JusperLee How to use this model on a new video? Can you share that code?
from looking-to-listen-at-the-cocktail-party.
You just need to change the input video.
from looking-to-listen-at-the-cocktail-party.
@JusperLee For that new video we again need to do the face embedding like all the preprocessing from start right ?
from looking-to-listen-at-the-cocktail-party.
Yes
from looking-to-listen-at-the-cocktail-party.
@JusperLee For how many speakers it will work?
from looking-to-listen-at-the-cocktail-party.
@JusperLee I'm getting following error while running the test.py file
Traceback (most recent call last):
File "test.py", line 92, in
T = utils.fast_istft(F,power=False)
File "/home/lenovo/Downloads/Looking-to-Listen-at-the-Cocktail-Party-master/utils.py", line 75, in fast_istft
data = istft(real_imag_shrink(data))
File "/home/lenovo/Downloads/Looking-to-Listen-at-the-Cocktail-Party-master/utils.py", line 30, in istft
Total[start:end] = Total[start:end] + data[i, :] * windows
ValueError: operands could not be broadcast together with shapes (257,) (512,)
from looking-to-listen-at-the-cocktail-party.
- This model does not limit the number of people mixed.
- The different dimensions may be a problem with your own operation, which I cannot help.
from looking-to-listen-at-the-cocktail-party.
@JusperLee At the place of 257 there should be 512 right?
from looking-to-listen-at-the-cocktail-party.
I think you can check this out. These questions are very basic.
from looking-to-listen-at-the-cocktail-party.
Related Issues (20)
- How to get dataset_train.txt? HOT 3
- X,Y co-ordinates will not be present in test videos as obvious HOT 1
- can face embeddings be provided in this repo HOT 1
- 作者您好 HOT 3
- istft error in model/utils/utils.py HOT 3
- The voice generation after STFT in AO_model is not 298*257*2. Why are the numbers in the first column different? HOT 3
- hi,when i run test,a error :unknown loss function:loss_func.What is going on, can it be solved? HOT 2
- something wrong in test code ? HOT 1
- 对cRM没有通过sigmoid将数值压缩到0-1? HOT 3
- 你好,在知乎回答中注意到你在视频帧扩展数据维度与音频保持一致时提到使用最近邻内插法,但是在代码中好像你使用的还是双线性内插法 HOT 3
- Colab script error in test.py part
- requirements.txt internal conflicts
- error in test.py file HOT 16
- type error
- Fit_generator() error in tensorflow 2.4.0 but not in 2.0.0
- Size of num_gpu while training AVmodel? HOT 4
- operands error HOT 2
- Output of the test.py file HOT 1
- Hardware used to train HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from looking-to-listen-at-the-cocktail-party.