Coder Social home page Coder Social logo

microbiomedata / mixs-6-2-release-candidate Goto Github PK

View Code? Open in Web Editor NEW
5.0 5.0 0.0 9.34 MB

Proposed, Harmonized MIxS 6.2

Home Page: https://github.com/GenomicsStandardsConsortium/mixs6.2_release_candidate

License: MIT License

Makefile 19.71% Python 72.08% HTML 0.49% JavaScript 0.63% CSS 1.04% Jinja 6.05%

mixs-6-2-release-candidate's People

Contributors

turbomam avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

mixs-6-2-release-candidate's Issues

Required terms must have examples and preferably validation constraints

generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'api' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'basin' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'coll_site_geo_feat' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'collection_date' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'env_broad_scale' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'env_local_scale' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'env_medium' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'iwf' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'microbial_biomass_meth' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'occup_density_samp' is a required property
generated/mixs_v6.xlsx.examples.yaml::$.exhaustive_test_set[0]: 'water_cut' is a required property

When I'm talking about schemasheets here, I mean...

  • generated/GSC_MIxS_6_usage_populated_no_blank_cols.tsv
  • which is generated by generate_and_populate_template in project.Makefile
  • that doesn't retain extra schemasheet header rows, for annotation inner_keys, internal_separators (for lists)

misc

  • infer enumerations as long as string_serialization starts with [, contains |s and doesn't include , : or (
  • make sure examples from slots using the enum match a PV form the enum
  • #55
  • #56

GSC narrative about slot combining etc.

GSC/MIxS TWG people have is said that any slot from any checklist or environmental package can be combined together in any submission, and even that submissions with non-MIxS slots should be accepted

The problem with this is that submitter may think of their own term names/column heading for a variable, not realizing that MIxS already has a term with the same semantics. That reduces the FAIRness of the data.

@only1chunts suggested rating submissions with 0 to 5 stars

I will experiment some more with extra slots in closed and open schemas

see also

terms sharing MIxS IDs

  • MIXS:0000018: depth & soil_depth
  • MIXS:0000091: associated_resource & url
  • MIXS:0000110: samp_stor_temp & samp_store_temp
  • MIXS:0000116: samp_stor_dur & samp_store_dur
  • MIXS:0000335: soil_text_measure & texture
  • MIXS:0000336: soil_texture_meth & texture_meth
  • MIXS:0000338: tot_n_meth & tot_nitro_cont_meth
  • MIXS:0000525: tot_car & tot_carb
  • MIXS:0000689: tot_phos & tot_phosphate
  • MIXS:0000755: samp_stor_loc & samp_store_loc
  • MIXS:0001107: samp_name & sample_name
  • MIXS:0001139: Food_source & food_sourc
  • MIXS:0001184: Food_Product_type & food_product_type
  • MIXS:0001225: samp_collect_method & sample_collec_method

create combination classes

  • checklists should be mixins
    • try that with environmental package classes too?
  • addition of combinations for all checklists X all environmental packages will make the schema painfully large

make sure provided examples pass validation

  • automate
  • rules for when to trust the example vs when to trust the Value syntax, Expected value, etc?
  • some provided examples might actually contain multiple examples, potentially split with multiple different delimiters

Interpretation of Requirement codes

from https://github.com/GenomicsStandardsConsortium/mixs/wiki/5.-MIxS-checklists

  • not applicable (-): descriptor is not applicable for a given checklist type

    • implemented in LinkML by deleting an association between the named term on a row in the XLSX sheet and the class implied
  • C conditional mandatory (C): descriptor must be present for compliance with the checklist, but only when applicable to the study, i.e. if this item is not applicable for the study the metadata data will still be checklist compliant even if it is left out

    • implemented in LinkML as recommended: true
  • E environment-dependent (E): descriptor must be present depending on the environment the original sample was obtained from

    • globally implemented as recommended: true
    • in the slot usage of the named term and the implied class, implemented as required: true
  • M mandatory (M): descriptor must be present for compliance with the checklist_

    • implemented in LinkML as required: true
  • X optional (X): descriptor may be present, not mandatory for compliance with checklist

    • implemented in LinkML as required: false and recommended: false

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.