Coder Social home page Coder Social logo

sdmx-csv's People

Contributors

dosse avatar olavtenbosch avatar sosna avatar tzaphkiel avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

sdmx-csv's Issues

SDMX-CSV data without series/data

This is somehow related to #5 but the business case here is outside SDMX REST.

Would it be valid to have SDMX-CSV containing only dataset level data attributes ?

Handling of missing values

from @sosna
I could not find the SDMX syntax to be used to identify missing values in the SDMX-CSV documentation.

The appropriate approach for missing values in SDMX-CSV should be clarified.
The solution should also distinguish between “values to be set to 'missing'” and “values not to be changed when appending”.

Related ticket in SDMX-XML: sdmx-twg/sdmx-ml#32
Related Ticket in SDMX-JSON: sdmx-twg/sdmx-json#122

Why dataflow only?

The SDMX-CSV spec states: "For the first column, the dataflow column, always is the term DATAFLOW".

Typically, SDMX data messages require a reference to structural metadata but this reference can be a data structure, a dataflow or a provision agreement. Considering this, is there a reason why SDMX-CSV accepts one of these 3 types only (i.e. dataflow)?

Also, the minimum required to parse an SDMX data message is a data structure. So forcing a dataflow might seem to push the bar a bit too high, especially in cases where dataflows are not available.

Last, CSV messages have no header and therefore no way to pass the provider ID (unless I’m mistaken). At least, if you could reference a provision agreement instead of a dataflow, there would be a work-around.

A solution would be to:

  • Rename the 1st column Structure
  • Specify the artefact type (e.g. Dataflow=BIS:CBS(1.0) instead of BIS:CBS(1.0))

What do you think?

Improve the narrative structure of the SDMX-CSV data specification

The SDMX-CSV data specification should have an overview like this:

Column Name Description Order
STRUCTURE Data structure type (dataflow, datastructure, dataprovision) 1
STRUCTURE_ID Artefact identification information 2
STRUCTURE_NAME Localized name of the artefact 3 (Optional)
ACTION Action code (I - Information, A - Append, R - Replace, D - Delete) 4 (Optional)
SERIES_KEY Series key for the observation 5 (Optional)
OBS_KEY Observation key 6 (Optional)
Remaining Columns Component IDs or localized names
Special cases:
EXAMPLE: text Extra column with the localized name for the EXAMPLE component
EXAMPLE[] Multi-valued
EXAMPLE[de;fr] Multi-lingual
PARENT.CHILD Nested metadata attribute
PARENT[].CHILD Nested metadata attribute of multi-valued parent attribute

And then discuss them in detail, based on the type of column. The current document is very difficult to follow as it is at the moment.

(In doubt, please handle this as a public review comment on SDMX 3.1 once the comment period begins.)

SDMX-CSV data without observations/data

Dear all,

In SDMX REST data queries it is possible include the query parameter detail=serieskeysonly or detail=nodata.
If SDMX-CSV is requested with detail=serieskeysonly or detail=nodata then what should be the output ?
Could an SDMX-CSV data file exist without any observations ?

Thanks in advance

Escaping the character used as separator between dimension values in a key

The field guide states the following about keys (the highlight is mine):

The column will contain the combination of IDs/values for all the dimensions, order by their order in the data structure definition and separated by a dot character (.), e.g. M.USD.EUR.SP00.2020-01

However, the guide does not specify what to do in case a dimension value contains a dot, i.e. the character to be used as separator. This cannot be the case in case of coded dimensions (as the . is not an allowed character for an SDMX code), but could be the case if:

  • The dimension is uncoded;
  • The dimension takes its values from a Valuelist, instead of a Codelist.

So, in case a dimension value contains a dot, what should the service provider do, when building the series and/or observation keys if SDMX-CSV data messages?

Thank you.

Migrate the documentation to readthedocs.org

Decision has been taken to migrate the documentation to readthedocs.org.

We should take this opportunity to consolidate and improve the current documentation.
This repository will details the CSV format part of the full documentation glued together in sdmx-im.

Public review feedback received from OECD (18/09/2017)

SDMX Standard must assure possibility to validate the content, everything else should be implementation-specific

  • Make header mandatory
  • Make use of IDs mandatory. An allowed variant could be ID+Name to respect the main use case of dissemination
  • Fix the separator to comma

Change the description of the repository

I suppose this statement is obsolete

This repository is used for maintaining the draft proposal for SDMX-CSV message specifications. This is an early draft and there are no guarantee it will lead to a future SDMX-CSV specification.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.