Coder Social home page Coder Social logo

rafaelmoraes003 / tech-news Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 21 KB

Analysis and manipulation of news data from a technology website obtained through data scraping using Python.

Dockerfile 1.66% Python 98.34%
crawler python web-scraping https mongodb parsel pymongo data-scraping

tech-news's Introduction

Tech News

This project aims to query a website that contains news about technology. To do this, data scraping was used, which is a technique for collecting data from online platforms. The data is captured from the scripts that are generated by the pages and programs that β€œscrape” the information. After the scraping is finished, the data is saved in a database.

With the data already saved and structured, the program allows to search by title, date, tags and news category.

An interactive menu is available so that the user can do the processes more easily.

Technologies used

python logo mongodb logo

How to use the application

Clone the application using the git clone command. After that, enter the project folder using the command cd tech-news.

How to run the application

  1. Create the virtual environment for the project
  • python3 -m venv .venv && source .venv/bin/activate
  1. Install the dependencies
  • python3 -m pip install -r dev-requirements.txt

Running the MongoDB database through Docker πŸƒ 🐳

In the root folder of the project, use the command docker-compose up -d mongodb.

Using the menu

  1. In the terminal, use the command:
  • python3 -m tech_news.menu

This command will bring up the menu, which contains several options on how to view the data that was collected from the scrape. If this is your first time using the application, first use option 0 on the menu to populate the database.

tech-news's People

Contributors

rafaelmoraes003 avatar trybe-tech-ops avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.