Coder Social home page Coder Social logo

big-commerce's Introduction

Big Commerce - Scrape latam marketplace

Big commerce is a scraping tool for marketplace ecommerce in latam

Prerequisites

  • python >= 3.7.4
  • Pip Installed (Recommended >= 21.0.1)
  • Install Python packages with pip and requirements.txt
    $ pip install -r requirements.txt

Running the Scraper

Run command:

$ python main.py

alt text

  1. Select marketplace to scraper (Equivalent a --marketplace option)
  2. select marketplace country (Equivalent a --country option)
  3. select products category to scrape

whats happen?

  1. Bigcommerce crawl the subcategories of the selected category and save the data in a csv file in root directory ./output/{PID}-{MARKETPLACE}-{COUNTRY}-categories.csv

    What does it scrape?

    Value Description
    id category id in marketplace
    name category name in marketplace
    href category link in marketplace
    hierarchy level in category tree in marketplace (Init 1 hierarchy)
    parent parent of category in category tree
    index order category in category tree
    subcategories number of child categories
  2. Big commerce crawl all the products of each category, browses through all the pages of the category and scraper the page of each product, in each iteration of the category pages, the product data is saved in a csv file in the directory ./output/{PID}-{MARKETPLACE}-{COUNTRY}-products.csv

    What does it scrape?

    Value Description
    name name of product
    link link product in marketplace
    price_simbol currency symbol
    price price product
    price_discount product discount percentage
    best_seller indicates if the product have flag best sellers
    promotional indicates if the product is sponsored
    category_id product category in the marketplace
    number_sales public number of sales in the marketplace
    seller product seller
    rating product rating
  3. Play time with data

Options

Optional argument Description
--marketplace name marketplace to scrape
--country Country code where you want to scrape marketplace, available: co, mx
--recursive Browse product pages recursively, available: True or False

Marketplace and available commands:

$ python main.py --help

Contributing

  1. Fork the repo on GitHub
  2. Clone the project to your own machine
  3. Commit changes to your own branch
  4. Push your work back up to your fork
  5. Submit a Pull request so that we can review your changes

Join our supergroup on Telegram https://t.me/joinchat/BwvX7B03bTg4Y2Fh

big-commerce's People

Contributors

91-julian-sanchez avatar

Stargazers

Stanley Melgar avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.