ecologicaltraitdata / ets Goto Github PK
View Code? Open in Web Editor NEWA Data Standard for Ecological Trait Datasets
Home Page: https://ecologicaltraitdata.github.io/ETS/
License: Creative Commons Attribution 4.0 International
A Data Standard for Ecological Trait Datasets
Home Page: https://ecologicaltraitdata.github.io/ETS/
License: Creative Commons Attribution 4.0 International
@aostrow
A sub-folder 'bexis' now holds information for the BExIS trait template. The XML template in this folder should be updated every time the BExIS template is updated to the current version of ETS.
You could also drop your script for producing the XML template here, or we create an R-based script to produce the new version of the template any time the ETS reaches a new release.
The metadata fields should be more clearly defined. The current authorLastname
and authorFirstname
aimed for compatibility with TRY. However, this is not sufficient to capture e.g. contact e-mail etc. Maybe some ISO Standard can be considered here?
for integration with BEXIS2
update xml file in https://github.com/EcologicalTraitData/ETS/blob/master/bexis/traitTemplate.xml to reflect changes made in v0.10
see: https://terminologies.gfbio.org/terms/ets/pages/VERSIONS.html
refer to source file: https://github.com/EcologicalTraitData/ETS/blob/master/ETS.csv
in v0.7 I noticed several spelling issues in the definitions or comments of the term. I wasn't sure whether to modify the .csv file and make a pull request or list them here.
One general question is whether the text is written in American English (AE) or British English (BE). So far it is a mixture of both, but it should be consistent.
Words that are currently written in BE are:
Words that are currently written in AE are:
One of these two blocks should be corrected.
General spelling errors:
Most dates in the document are formatted according to the American notation of month/day/year. There are however three cases, that use the format day/month/year, causing ambiguity and wrong dates.
Two terms use the DateIssued of "5/9/2017" with is supposed to be the fifth of September. Both terms have the version "v0.4". The other terms from version 4 have issue dates from 8/24/2017 till 10/24/2017. Version 3 terms were all created on 7/7/2017.
Two terms have the DateModified of "1/11/2017" which is supposed to be the first of November, as the terms were all created on 7/7/2017.
Seven terms have the DateModified of "15/11/2017" which is an obvious error, no ambiguity here.
To avoid such errors in the future, I would suggest to switch to the ISO 8601 format of YY-MM-DD (related XKCD).
In Discussion with David of GFBio, we figured that this can be much simplified. The controlled vocabulary could be listed in the 'Definition' field, with an unambiguous definition of the terms we are adding to the DwC options, i..e 'ExpertKnowledge', 'MuseumSpecimen'.
after renaming the double record terms (#23), the terms traitNameStd
, traitValueStd
, traitUnitStd
and scientificNameStd
should have their new versions traitName
, traitValue
, traitUnit
and scientificName
in field replacedBy
.
All documentation pages should use relative links to point to the terms (and link back to the top of page), to work on gfbio.org and github.com alike.
a release branch is created for v0.9 that includes only
This will be used to receive a new DOI at ZENODO.org.
The human-readable GFBio release is a static website including the full vocabulary, as well as instructions for contribution (About page), and also including the best practice guidelines and examples. The version history is only showing the change log, not the static websites of previous versions.
The development branch maintains an own website for preview (hosted on Github Pages), which shows the current version of the terms as well as viewports for all previous versions. This site will have a big disclaimer that this is not the official release channel and users of the standard should refer to gfbio.org.
Hi,
AmNat requires in-text citations to be (Author, Year) (https://www.journals.uchicago.edu/journals/an/instruct#aims), but the AmNat csl file produces (Author Year) in-text citations.
Thanks,
Stefano
We should define the URI Scheme of the terms.
First of all, see #3.
Second, we have several options for setting our own URI addresses.
In my opinion the latter is the most sustainable solution. It seemed to me that it would be solved in the same process as importing the RDF ontology, and GFBio would be very open to do this.
Any opinions here? Otherwise I would push forward option 3.
URIs often do not match the case of the Term. Infact spelling is fairly inconsistent. The URI root is http://terminologies.gfbio.org/terms/ETS/
and complemented by a string identical to the Term itself (with mixed case spelling) as in valueType
, or modified to a lowercase string as in the case of traitvaluestd
. This irrgularity can cause wrong calls of the API or broken reference to the human readable website.
The URI resolves to the human readable website only if it is correctly mapped to the lower case URL of style https://terminologies.gfbio.org/terms/ets/pages/#term
.
This should aim for a minimal consensus vocabulary for trait thesaury.
The reviewer comments pointed out that the common way of keeping original author entries along with standardized entries by having a 'verbatim' term next to the standard term. i.e. we will rename terms or change definitions
This has some messy side effects on the compatibility of the current and future version of ETS. But at this stage it is possible without doing much harm. Any thoughts on this?
the version column for the term expectedUnit states "y0.4" instead of "v0.4"
add a term unitID
to contain a pointer to a metric ontology.
Changes on both git page and csv
date issued
is missing for several terms (e.g. MeasurementOrFact
, taxonID
) - check if this is okdate issued
or date modified
are in different formats (month/day or day/month)Traitlist
: all valueTypes
are missingscientificName
- Definition
: "Original character string provided as species name by the data owner (kept for reference and continueity)" [typo in continuity]occurrenceID
- Definition
: a bona fide global unique identifier [maybe not clear what "bona fide" means?]warnings
, relationSource
, eventID
: put capital letters at the beginning of sentences (for harmonisation)datasetID
and statisticalMethod
- Comment
: "e.g." could be "Examples" (for harmonisation)taxonRank
, kingdom
, phylum
, class
, order
, family
, genus
: some have a date issued
and some have a date modified
but the date is the same. Should they be all date issued
?author
- Comment
: Examples: [remove or fill the field]valueType
; only "Date" have a capital letteryear
, month
, day
- Description
: description is not harmonised: e.g. "The four-digit year, at which" vs "The ordinal month, in which" vs "The integer day of the month on which"Mismatches: in git but not csv or vice versa
description
fields have a space before the beginning of the text (in locationID
, eventDate
, statisticalMethod
, Exploratory
)To include instructions for how to contribute, should be referenced from the README and will automatically be highlighted from the issues page.
Am 11.04.2019 um 08:00 schrieb Prof Brian Ripley:
This concerns packages
[...] traitdataform [...]
which are failing their checks in a strict Latin-1 locale: see the debian-clang results. (Several of these seem to stem from vcr.)
On Linux, such a locale can be ensured via LC_CTYPE=en_US (which may need installing for distros that micro-package). AFAWK it cannot be done on Windows.
- The character in don't is an (ASCII) apostrophe, not a right quote:
don’t
(with a right quote) is used in packages [...] (and others not failing).
en and em dashes are not portable, found in packages
[...] traitdataform [...] .Using \uxxxx coding for non-ASCII chars in R character strings should help in some cases (see 'Writing R Extensions').
Please correct before May 10 to safely retain the package on CRAN.
This link in the README does not load https://terminologies.gfbio.org/terminology/?ontology=ETS. Is there a new link or alternative way to reach this page?
once the csv is updated,
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.