Coder Social home page Coder Social logo

finnstue / europeanstartups-scraper Goto Github PK

View Code? Open in Web Editor NEW

This project forked from darideveloper/europeanstartups-scraper

0.0 0.0 0.0 484 KB

Python scraper for extract data from the page europeanstartups, using python, and a google chrome data with a premium account already logged.

License: MIT License

Python 100.00%

europeanstartups-scraper's Introduction



Europeanstartups Scraper

Europeanstartups Scraper

Python scraper for extract data from the page europeanstartups, using python, and a google chrome data with a premium account already logged.

Project type: client


Table of Contents
  1. Build With
  2. Media
  3. Details
  4. Install
  5. Settings
  6. Run
  7. Roadmap

Build with

Media

web page 1

web page 2

sample terminal

sample csv file

Details

The project extract all results from the page https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union, and save the output data in a csv file.

The project is a python script, that use a google chrome data with a premium account already logged, to extract the data from the page.

The data extract is:

  • NAME
  • DEALROOM SIGNAL
  • MARKET
  • TYPE
  • LAUNCH DATE
  • VALUATION
  • FUNDING
  • LOCATION
  • LAST ROUND
  • REVENUE
  • STATUS
  • GROWTH STAGE
  • EMPLOYEES
  • OWNERSHIP
  • MARKET CAP
  • DEBT
  • URL WEBSITE
  • LINKEDIN PROFILE
  • TWITTER PROFILE
  • FIRM VALUATION
  • TAGS

Install

Prerequisites

Installation

  1. Clone the repo
    git clone https://github.com/darideveloper/europeanstartups_scraper.git
  2. Install python packages (opening a terminal in the project folder)
    python -m pip install -r requirements.txt 

Settings

Create a .env file with the following content

 CHROME_PATH = C:/Users/<<your-user-name>>/AppData/Local/Google/Chrome/User Data/ #the chrome path is the folder where chrome data its installed
 SHOW_BROWSER = True # Show (True) or Hide (False) the google chrome window

Run

  1. Go to https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union and create an account (if you have problems with your email, try with a proton email)
  2. Activate the premium trial or buy a premium account
  3. be sure to keep the account logged in the browser.
  4. Open a terminal in the project folder
  5. Run the project folder with python:
    python .
  6. Wait until the script finish, and check the output.csv file in the project folder (note: while the script its running, you can't use google chrome).

Roadmap

  • Use chrome data fror avoid login in the page
  • Extract all data from the page
  • Save output data in csv file

europeanstartups-scraper's People

Contributors

darideveloper avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.