Coder Social home page Coder Social logo

clarkcga / multi-temporal-crop-classification-training-data Goto Github PK

View Code? Open in Web Editor NEW
0.0 3.0 3.0 15.43 MB

This repository contains the pipeline for generating a training dataset for land cover and crop type segmentation using USDA CDL data.

License: Apache License 2.0

Jupyter Notebook 99.16% Dockerfile 0.84%
earth-observation foundation-models geospatial-analytics geospatial-data remote-sensing

multi-temporal-crop-classification-training-data's People

Contributors

hamedalemo avatar kordi1372 avatar mcecil avatar

Watchers

 avatar  avatar  avatar

multi-temporal-crop-classification-training-data's Issues

Query and export HLS data

Using the definition of tiles from here, we need to retrieve corresponding scenes from HLS dataset.
Specification:

  • Find three best cloud-free scenes during the growing season. (growing season needs to be defined after we decide on what crop types to include, 1)
  • export the scenes for each chip. Scenes should include all the bands from HLS data at 30m spatial resolution.

Compare the grids of HLS and CDL

Both HLS and CDL are at 30m spatial resolution.
We need to confirm if they are using the same coordinate system and grid. If not, decide how to project one to the other. Preferably, we should project HLS to CDL as CDL is a categorical data layer and more noise will be added to the dataset if we reproject it.

The outcome of this task should be:

  • the final AOI for our training dataset, which can be a subset of the GFM AOI. (the AOI needs to be a multiply of 224 x 224 pixel chips)
  • The projection and coordinate system for the training dataset

Download CDL

Retrieve CDL data (latest year available) for the region that GFM is trained on (AOI).
If needed, crop the dataset to the AOI and store under /data.

Define tiles for the training dataset

Divide the training dataset AOI (from here) to tiles of 224 x 224 pixels (at 30m spatial resolution).
The outcome can be a GeoJSON which stores all of these tiles, or any other compatible data format.
We will use the bounding boxes of these tiles to chip the CDL dataset and query HLS scenes.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.