Coder Social home page Coder Social logo

samimakela / iba-cocktails Goto Github PK

View Code? Open in Web Editor NEW

This project forked from rasmusab/iba-cocktails

0.0 0.0 0.0 942 KB

The International Bartenders Association (IBA) Official Cocktails in CSV and JSON format

License: Other

Shell 0.01% R 0.12% HTML 99.87%

iba-cocktails's Introduction

IBA Cocktails in CSV and JSON format

This repo includes all the International Bartenders Association (IBA) Official Cocktails in CSV and JSON format as of 2023, from two different sources: The IBA website and Wikipedia's list of IBA cocktails. My take on the difference between these sources is that the IBA website is more "official" (it's their list, after all), but the Wikipedia recipes are easier to follow.

Files scraped from The IBA website that you'll find in the iba-web folder:

  • iba-cocktails-web.csv: A CSV file with one row per cocktail. This means the ingredients are all smushed into a single column as a comma-separated list.
  • iba-cocktails-ingredients-web.csv: A CSV file with one row per cocktail ingredient. For example, there are three rows for the Margarita's (1) tequila, (2) triple sec, and (3) lime juice. Here each ingredient description has also been lightly parsed into its quantity, unit, and ingredient. For example, 15 ml Freshly Squeezed Lime Juice has quantity: 15, unit: ml, and ingredient: Freshly Squeezed Lime Juice.
  • iba-cocktails-web.json: A JSON list with one dictionary/object per cocktail. This JSON includes the combined information from iba-cocktails-web.csv and iba-cocktails-ingredients-web.csv.

Files scraped from Wikipedia's list of IBA cocktails that you'll find in the wikipedia folder:

Included here are also the R scripts I used to scrape this data. These scripts are guaranteed to work on my computer, on the 5th of Match 2023, at least once.

If you want to run the scraping and cleaning script from scratch, you could either run the scripts 01_scrape.R and 02_clean.R interactively, and hope that you just happen to have the right version of R and the required packages installed. Or, if you have docker installed, you can run ./dockerized-scrape-iba-web.sh and dockerized-scrape-iba-wikipedia.sh. However, if Wikipedia changes or IBA changes their website just a tiny bit from how it was on 2023-03-05 then these scripts are likely going to fail, anyway.

Also, see teijo/iba-cocktails for another IBA cocktails dataset.

iba-cocktails's People

Contributors

rasmusab avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.