Coder Social home page Coder Social logo

news_scraping's Introduction

News_Scraping

Please read the instructions.txt carefully before attempting the tasks

A good data scientist not only has extensive knowledge of machine learning, and deep learning, but also has the ability to extract and gather data from various sources and store it in a useable format. This task will introduce you to the first step of all data science tasks, data collection. One method of data collection is web scraping, which you will be working on in this task.

Problem Statement This project involves collecting data from various online sources. You are asked to collect relevant news data on different stocks, collect financial news headlines. The second part of the project is data cleaning and pre processing. You are asked to present a clean and usable dataset.

Instructions

  • Refer to beautiful soup's online documentation or refer to youtube videos if you run into a problem instead of using ChatGPT
  • Do not alter any prewritten code or comments
  • Be sure to add comments to make your code legible and to let the mentors understand what approach you have taken
  • Only use google colab to run the code

Procedure

  1. Fork and clone this repository onto your local device
  2. Open the .ipynb file on google colab
  3. Once you are done with the task, download as .ipynb and store it in a folder along with required files
  4. Name your file as your Enrollment number
  5. Push this file to forked repo and then send PR
  6. Your code will be reviewed by the mentors. Points will be granted once the PR is accepted and merged

Help

For any query feel free to contact [email protected]. You can also interact with the mentors and the geekhaven community on discord

news_scraping's People

Contributors

shashank1985 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.