iscn / socdrahr2 Goto Github PK
View Code? Open in Web Editor NEWThis is an R package that haromonizes some digitial soil repositories.
License: BSD 2-Clause "Simplified" License
This is an R package that haromonizes some digitial soil repositories.
License: BSD 2-Clause "Simplified" License
see 07dc166
The ISCN database takes a long time to load. Is there anyway to speed this up?
Also looking for ways to reduce the file size.
Add documentation to the variable table from ISCN3. Add a description and any common units to the master variable table. Including:
The header data for the Pangaea download could use a robust parser. Below is an example of a hard coded parser for the C-PEAT project but it sure does feel like a regular data tree with a general solution somewhere.
https://github.com/ISCN/soilDataR/blob/c869479a3d01f1cce0209ab99e5daf84ef8c6d47/R/readCPEAT.R#L146
When trying to run the code through are on my Windows operating system, I run into many errors with the trying to run functions that have to open excel files and anywhere where the code calls for lines with "/n". I believe this is because Windows uses "/n/r" to indicate both new lines and return.
435 topsoil samples (0–10 cm); network includes 87 globally distributed plots and five soil replicates per plot
Delgado-baquerizo, Manuel; Ochoa-Hueso, Raúl (2020): Dataset of "Soil element coupling is driven by ecological context and atomic mass". figshare. Dataset. https://doi.org/10.6084/m9.figshare.13138811.v1 with paper citation: https://onlinelibrary.wiley.com/doi/full/10.1111
About 800 samples with P measured using Hadley method (physiochemical fractionation). Scrapped both the Chinese and English literature. We would need to expand the fractionation descriptions.
Hou, E., Tan, X., Heenan, M. et al. A global dataset of plant available and unavailable phosphorus in natural soils derived by Hedley method. Sci Data 5, 180166 (2018). https://doi.org/10.1038/sdata.2018.166
I just uploaded the HeckmanSwansonBiscuitBurn.rmd in commit 73284e1 can you check it out when you get the chance @ktoddbrown
Currently the package structure is messy.
Currently identified tasks:
dependent libraries need to be imported not called via 'library' in the R code.
better and revised function documentation
Commented one of the tables on the report, starting at line 66. see commit 1ade55f @ktoddbrown
Treatments need to be pulled out to their own data frame.
Given the below example
site_name | profile_name | soil org C | soil org C (sigma) | soil org C (unit) | soil org C (method) | bulk density (g cm-3) | pH | time on shelf (yrs) |
---|---|---|---|---|---|---|---|---|
Super secret site | sediment_10 | 0.3 | 1.2 | g cm-3 | lab method 4 | 1 | 7 | 0 |
Super secret site | sediment_30 | 0.1 | 2 | g cm-3 | lab method 4 | 1.02 | 3 | 0 |
Super secret site | sediment_10 | 0.3 | 1.2 | mg cm-3 | lab method 6 | 0.5 | 7 | 10 |
with this key:
header | var | dataframe | class | type | hardUnit |
---|---|---|---|---|---|
site_name | site_name | field | factor | NA | NA |
profile_name | profile_name | field | factor | NA | NA |
soil org C | soc | sample | numeric | value | NA |
soil org C (sigma) | soc | sample | numeric | sigma | NA |
soil org C (unit) | soc | sample | factor | unit | NA |
soil org C (method) | soc | sample | character | method | NA |
bulk density (g cm-3) | bd_sample | sample | numeric | value | g cm-3 |
pH | pH | sample | numeric | value | NA |
time on shelf (yrs) | storage_time | treatment | character | value | yr |
the process from template function breaks. You can hack it by renaming the profiles uniquely and treating the treatment as if it was a measurement, see the soils-long-tail-recovery/hackathon/example.*
I substituted out the select function that we were calling before and added updated comments. see commit 71ec46f @ktoddbrown
We need to take the three table key developed here: https://github.com/ESIPFed/soil_data_model_survey/tree/master/data and rewrite the data harmonization script. In theory this should be pretty straight forward.
Working on the threeTblKey
branch we need to do the following
readKeyedData
to readOneTableKeyData
and make a note that this is depreciated.tidyverse
?Investigate https://rstudio.github.io/distill/website.html to create a webpage front for this repo.
We deleted the duplicate dataset_layer and dataset_profile. See commit edb3b06 @ktoddbrown
see d6d4e6a
Link below of spreadsheet with summary of changes made between ISCN3 to ISCN3.5, as well as status, contributor, file name, and other notes.
https://docs.google.com/spreadsheets/d/1x9ZC6zcAhcCtkrzO2jd-LtBsPMFVT_0LPqnB-v1w-ZM/edit?usp=sharing
Do we have this in ISCN?
http://icp-forests.net/page/level-ii
Apparently they have soil chem data from almost 800 forests? Though admittedly, I have no clue what soil chem actually includes, but can always find out.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.