You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It would be very useful if {STATcubeR} could support "SDMX archives" which are generated from STATcube. sdmx archives consist of a metadata component called the "structure definition" and a data part which contains the actual cell values. In order to support that, we would need to add parsers for the xml-based data format.
The generated archives are more or less compatible with the CRAN package rsdmx: https://cran.r-project.org/package=rsdmx, which could be used as a starting point to develop parsers.
Possible usuage: parser function sdmx_table() which generates an object of class sc_data (the parent class for OGD and STATcube-API datasets)
x <- STATcubeR::sdmx_table("path/to/sdmx_archive.zip")
class(x)
#> [2] "sdmx_table" "sc_data" "R6"
There are several advantages of the sdmx format compared to the API
The structure definition contains information about hierarchical classifications, which are not available via the API
The download option "sdmx archive" is available even if STATcube is used as a "guest user"
SDMX is used by other SuperSTAR products such as SuperCROSS
The last point is probably the most compelling one since a direct interface to SuperCROSS would be very helpful for the internal workflows of statistics austria
The text was updated successfully, but these errors were encountered:
this is the first step to resolving
#27 by adding a function that creates
sc_table() like objects based on sdmx
archives
The sdmx format contains all metadata
that is necessary for STATcubeR to reuse
the existing $tabulate() workflow and this
first version already provides support for
various features via the base class (sc_data)
- $tabulate() to aggregate data
- $total_codes() to set/unset total codes
- $recoder to recode datasets (change labels)
change codes, toggle visibility of
elements, reorder elements, etc.
- importing german and english labels
simultaniously (both languages are included
in a zip download) and allowing to swhitch
between them using $language<-().
New features
- sdmx arcives provide a $parent column
in the $fields() table which are used
to represent hierarchical classifications.
Previously, this was only possible with
od_table()
There are still some improvements. See
the issue #27 for more details
- properly parse time variables -
currently they are treated as generic
categories.
- parse element annotations (detailed
descriptions for classification
elements) and add them to
$field()$de_desc just like with
OGD dataset
- parse value annotations (see #39)
- provide a print/fromat method
- add a reasonable logic for total
codes that takes the parent codes into
account
- fill meta$measures$fun and
$meta$measures$precision based on
the sdmx metadata
- modify very long codes which use
the @-symbol (probably for escapes)
- extend documentation
- possibly check SuperCROSS compability
It would be very useful if
{STATcubeR}
could support "SDMX archives" which are generated from STATcube. sdmx archives consist of a metadata component called the "structure definition" and a data part which contains the actual cell values. In order to support that, we would need to add parsers for the xml-based data format.The generated archives are more or less compatible with the CRAN package rsdmx: https://cran.r-project.org/package=rsdmx, which could be used as a starting point to develop parsers.
Possible usuage: parser function
sdmx_table()
which generates an object of classsc_data
(the parent class for OGD and STATcube-API datasets)There are several advantages of the sdmx format compared to the API
The last point is probably the most compelling one since a direct interface to SuperCROSS would be very helpful for the internal workflows of statistics austria
The text was updated successfully, but these errors were encountered: