Coder Social home page Coder Social logo

book-genre-classification's Introduction

Book-Genre-Classification

Classification of books into genres using cover and title.

We used the base convolution layers of pre-trained Xception model on ImageNet dataset to extract the cover features and used pre-trained GloVe model to get the feature vectors from pre-processed titles. The Xception model implementation was taken from keras and glove library for Python was used.

A logistic regression model was used for the final classification. Scikit-learn was used to implement this. Using only the cover features, we got an accuracy of around 67%, using only GloVe around 86% and finally using both of these gave the best accuracy of 87%, beating even human accuracy.

The main conclusion was that the title of the book tells us a lot more about the contents of the book when compared to it's cover. The entire project was done in Python.

Xcetion+Glove1 jupyter notebook has the necessary code. Many features were extracted using on the fly code and saved into pickle files, which are being imported in the given jupyer notebook. Since some of these saved pickle files are pre-trained GloVe vectors, they are around 250 MB, which exceed GitHub's requirment that each file be less than 25 MB. Because of this, we are not able to provide them right now.

The dataset for the book covers was obtained from : https://github.com/uchidalab/book-dataset

Main collaborators for this project were: Ganesh Biradar, Aravind Varier

book-genre-classification's People

Contributors

biradarganesh25 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.