Coder Social home page Coder Social logo

krishnapriyakanduri / scrape-ml Goto Github PK

View Code? Open in Web Editor NEW

This project forked from gss0c24/scrape-ml

0.0 0.0 0.0 589 KB

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.

License: MIT License

Python 3.46% Jupyter Notebook 96.54%

scrape-ml's Introduction

IMDB Movie review Scrapping

Scrapping the movie review ✏️ using python programming language💻.

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.

Introduction

Semi-supervised-sequence-learning-Project 💻 replication process is done over here and for further analysis creation of new data is required.

  • The following script includes the following.
  • Movie_review_imdb_scrapping.ipynb - Script to scrap the data from imdb website
  • rename_files.ipynb - Script to rename the scrapped text files as per the requirements
  • convert_texts_to_csv.ipynb - Python script to make a CSV file from the txt files for SVM processing

Dependencies

install Beautifulsoup using pip install beautifulsoup4

Installation

1️⃣ Fork the Semi-supervised-sequence-learning-Project/ repository
Follow these instructions on how to fork a repository

2️⃣ Cloning the repository
Once you have set up your fork of the /Semi-supervised-sequence-learning-Project repository, you'll want to clone it to your local machine. This is so you can make and test all of your personal edits before adding it to the master version of /Semi-supervised-sequence-learning-Project.

Navigate to the location on your computer where you want to host your code. Once in the appropriate folder, run the following command to clone the repository to your local machine.

git clone [email protected]:your-username/sanjay-kv/Semi-supervised-sequence-learning-Project.git.git

Final Dataset

1️⃣ Here is the Link to Final Dataset: Drive Link

scrape-ml's People

Contributors

sanjay-kv avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.