Coder Social home page Coder Social logo

web-scraping's Introduction

Web Scraping

This repository contains Python scripts demonstrating web scraping from various websites. Web scraping is a technique used to extract data from websites, and these examples showcase how it can be applied to different use cases.

Contents

  1. Amazon Bestsellers: Scrape the bestseller products from different categories on Amazon.
  2. Kindle Store: Scrape Kindle books and their details from the Amazon Kindle store.
  3. Internshala: Extract internship listings and details from Internshala's website.
  4. Times of India Headlines: Scrape the latest headlines from the Times of India news website.
  5. UCI Machine Learning Repository: Collect machine learning datasets and information from the UCI ML Repository.
  6. Yahoo Finance: Scrape stock market data and financial information from Yahoo Finance.
  7. Airline Review Data: Extract customer reviews and ratings of airlines from a review website.
  8. HackerEarth: Scrape coding challenges and competitions from the HackerEarth platform.
  9. Swiggy Restaurants: Collect restaurant details and cuisines from the Swiggy food delivery website.
  10. Geeks for Geeks: Scrape coding tutorials and articles from Geeks for Geeks.
  11. eBay Products: Extract product listings and details from eBay.
  12. IMDb Movies: Scrape movie details and ratings from IMDb.
  13. Zomato Restaurants: Collect restaurant details and ratings from Zomato.
  14. ESPN Sports News: Scrape sports news and updates from ESPN.
  15. House Price (Magic Bricks): Extract real estate property details and prices from the Magic Bricks website.
  16. World Health Organization (WHO): Scrape the latest disease outbreak news list from the World Health Organization (WHO).
  17. Smart India Hackathon Data Scraper: Extracts and compiles problem statements and associated details from the Smart India Hackathon competition.
  18. Booking.com: Extracts reviews of all hotels from booking.com at New Delhi

Datasets Available on Kaggle

Airline Review Dataset

Explore reviews for 491 different airlines with this dataset. The dataset provides valuable insights into the experiences of passengers with various airlines.

Link: Airline Review Dataset

House Price Dataset

This dataset consists of 188K rows of house prices from cities all over India. It's a great resource for analyzing housing trends and prices in different Indian cities.

Link: House Price Dataset

SIH Challenge Set

This dataset comprises a diverse collection of 563 problem statements presented during the Smart India Hackathon, offering valuable insights into innovative challenges across various domains in India.

Link: SIH Challenge Set

Hotel Reviews

Explore a comprehensive dataset of 7,000 hotel reviews, each accompanied by an associated rating, providing a comprehensive perspective on guest experiences.

Link: Hotel Reviews

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.