GoogleNewsScraper scrapes articles from google news rss feeds based on the given query.
├── gnews-scaper # Root Folder
├── src/ # Source Code directory
├── GoogleNewsScaper/ # Google News Scaper directory
├── GoogleNewsScaper.ipynb # IPython notebook consists of code for scraping google news for the given query.
├── .gitignore # .gitignore file
├── requirements.txt # Library dependencies
- Clone the repository into a folder
git clone [email protected]:kishore-s-15/news-scaper.git
-
Change the directory to the project root directory.
-
Create a virtual environment
On Windows run
python -m venv env
On Linux and MacOs run
python3 -m venv env
-
Activate the virtual environment
On Windows run
env\Scripts\activate.bat
On Linux and MacOs run
source env/bin/activate
-
Install the dependencies for the project in the virtual environment
pip install -r requirements.txt
-
Then run the following command
On Windows run
python src\GoogleNewsScaper\GoogleNewsScaper.py
On Linux and MacOs run
python3 src/GoogleNewsScaper/GoogleNewsScaper.py
This should start scraping the news articles and print it out to the console.