Coder Social home page Coder Social logo

neural_srp's People

Contributors

egrinstein avatar yezhangyinge avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

neural_srp's Issues

Problems encountered in reproducing the model

Hi, I've tried to reproduce neural-srp and encountered some problems:

  1. when I proprocess the tau-nigens dataset, the following error occurs, and when I change the value of self._nb_unique_classes from 2 to
    3 it have been solved, Does it mean that the samples in the dataset do not have more than 2 sound source activities at most?
tnb_classes[frame_ind, active_event] = 1
IndexError: index 2 is out of bounds for axis 1 with size 2
  1. But When I tried to visualize the tau using the neural-srp-multi.bin, it reported that:
target_doas = target_doas.view( target_doas.shape[0], target_doas.shape[1], 3, max_nb_doas ).transpose(-1, -2)
RuntimeError: shape '[1, 50, 3, 2]' is invalid for input of size 500

Is it because I change the value of self._nb_unique_classes?

  1. When I tried to loading the doanet.bin to visualize the tau, it occured:
    c884ce4b050eff7fd256d6c8227d20f
    how can I solve it?

Thank you in advance, looking forward to your reply!

Questions bout the feature shape transform during normalization

Hello, after reading your code, i have some questions:

In GccExtractor, the feat matric shape is

gcc_feat = np.zeros((nb_frames, self._nb_bins, n_output_channels))
gcc_feat = gcc_feat.transpose((0, 2, 1))

and, in Preprocessor::extract_all_feature, the feat is trans to

        feat = self._gcc_extractor(audio_in)
        nb_frames = feat.shape[1]
        feat = feat.transpose((0, 2, 1)).reshape((nb_frames, -1))

which not make sense. ( i mean, the nb_frames refers to different meanings

I can't get it, could u help me about it ๏ผŸ If there is something wrong in this version of code? Thanks!!!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.