Coder Social home page Coder Social logo

oxford-cs-deepnlp-2017 / practical-2 Goto Github PK

View Code? Open in Web Editor NEW
112.0 18.0 93.0 209 KB

Oxford Deep NLP 2017 course - Practical 2: Text Classification

Home Page: https://www.cs.ox.ac.uk/teaching/courses/2016-2017/dl/

nlp natural-language-processing deep-learning machine-learning oxford

practical-2's People

Contributors

iassael avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

practical-2's Issues

the first question

Hello,

My name is Quei-An, an Msc student. For the first question of practical 2, I don't understand much what "starting from random embeddings" means. You mention GloVe afterwards, so I suppose that stands for word embeddings, but how can we train all word embeddings with such limited amount of data (only ~2000 talks)?

I tried somehow to implement the model with respect to the classification problem, but firstly I can only feed in one input at a time because each document has different size (number of words), secondly there's so little data that the accuracy turns out to be so poor. Then you mention "Training in batches", so I start getting confused.

Could you please clarify?

If I use fixed word representations (e.g. word2vec) then everything seems OK, I get about 56% accuracy by trying different hyperparameters.

Thank you and have a nice weekend.

my neural net only predicts 'ooo'.

I implemented the most basic neural net (following the instructions) and it is not performing very well.
I'm using Bag of Means to do document embedding which uses a Word2Vec model trained on the ted text.

I suspect that I have some sort of bug, as I'm a beginner with PyTorch. If the instructors don't mind, I'd like to share my code: [removed]
It is mostly modeled off this tutorial.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.