The Ocearch site compiles information about Shark locations by tagging individual sharks with trackers that 'ping' with their location. There are hundreds of sharks tracked, and tens of thousands of locational pings in total.
This Binder-enabled Github repo offers three levels of challenges to analyze the Shark data.
The easy challenge offers you already-cleaned data and asks five questions that can be answered with basic Pandas methods.
Easy questions:
- How many total pings are in the Ocearch data?
- How many unique species of sharks are in the data set?
- What is the name, weight, and species of the heaviest shark?
- When and where was the very first ping?
- Excluding results with 0 distance traveled: what's the minimum, average, and maximum travel distances?
The intermediate challenge forces you to do some data cleaning before being able to answer the Easy Challenge questions, and asks five more questions that require more complex Pandas methods, in particular using groupby
.
Intermediate questions:
- Which shark had the most pings?
- Which shark has been pinging the longest, and how long has that been?
- Which shark species has the most individual sharks tagged?
- What is the average length and weight of each shark species?
- Which shark has the biggest geographic box (largest distance from min lat/lon to max lat/lon, not dist_traveled)?
The hard challenge asks you to pull and parse data from the Ocearch api directly before answering the Easy, Intermediate, and Hard questions. The Hard questions will be added in a future date.