Coder Social home page Coder Social logo

phenomismatch / sightings-database Goto Github PK

View Code? Open in Web Editor NEW
5.0 3.0 1.0 10.22 MB

This is a database of sightings of birds, moths, and butterfiles in support of the phenology mismatch project.

License: MIT License

Python 97.66% R 2.34%
python3 pandas sqlite3 phenology ebird maps bbs pollard postgresql naba

sightings-database's Introduction

pheno-mismatch

Causes, consequences, and cross-scale linkages of climate-driven phenological mismatch across three tropic levels

Repository contribution

This repository contains scripts to help research and outreach efforts. There are other GitHub repositories for this project maintained by the various research labs.

Database

All data is stored in either a SQLite3 or PostgreSQL (PostGIS) database. We have gathered and consolidated the data from the sources below. Each source dataset records different things and therefore contains very different data. Our schema distills the common information needed for our study into a relational model and keeps the unique data, for further analysis, in NoSQL JSON blobs which are attached to each relational record.

  1. North American Breeding Bird Survey (BBS).
  2. MAPS: Monitoring Avian Productivity and Survivorship (MAPS).
  3. eBird Basic Dataset. This dataset has been culled to take everything between 20⁰ and 90⁰ latitude and -95⁰ and -50⁰ longitude. We are also only taking "complete" and "approved" checklists. Additionally, we are also only taking data for ~120 migratory bird species.
  4. Pollard butterfly dataset.
  5. NABA butterfly dataset.
  6. Caterpillar Counts dataset.
  7. NestWatch dataset.
  8. USGS Bird Banding Laboratory dataset.

There are 5 primary tables in the database:

  1. The Taxa table that hold data specific to each taxon.
  2. The Dataset table that hold information about where and when we extracted the data.
  3. The Place table contains where observations occur. Most datasets sample data repeatedly at designated locations. NoSQL place data is contained in the place_json field.
  4. The Event table holds when, how, and by whom data about the observation. NoSQL event data is in the event_json field.
  5. The Count table holds what was observed and how many birds or leps were observed during the event. count_json holds the NoSQL data.

Some record counts for the datasets:

Dataset Place Records Event Records Count Records Notes
BBS 5,690 122,925 > 6M
MAPS 1,224 619,335 > 2M
NestWatch 65,063 503,510 647,212
Pollard 760 86,996 86,958
NABA 1,132 2,135 305,810
eBird 1,986,208 16,820,802 > 120M Culled from > 650M records

Samples in R

See this R script for how to access the sqlite database in R.

Samples in Python

Most of the scripts in the lib directory access the database. I have moved common code into this library. A some sample_queries that use this library are in this Python script.

Output image

sightings-database's People

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

rafelafrance

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.