Skip to content

biolab-helper/datasets

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Orange data set repository

A collection of descriptions of data sets that are served in Data Set widget in Orange and programs for generating the descriptions from a given data set.

Each data set is described with a record that contains the following attributes:

  • name: data set file name (without extension)
  • title: a short title (less then 40 characters)
  • description: description of the data set
  • collection: name of original repository
  • references: any references to essential publications of the data set
  • tags: a list of tags
  • target: the type of target variable ("categorical", "numeric", or null)
  • version: data set version
  • year: year when the data set was first published
  • instances: number of data instances
  • missing: does data contain any missing values?
  • variables: number of all variables (including target and meta variables)
  • source: the web page from where the data set was obtained
  • url: the web address of the Orange-readable file with the data set

Following is an example of description record for the wine data set. Note that the description would most often be longer and would contain at least a paragraph of text:

{
    "name": "wine",
    "title": "Wine tasting",
    "description": "Wine profiling data with attributes from chemical analysis.",
    "collection": "UCI",
    "references": [
        "Smit J, Miles C, Novak J (2016) On health impact of red wine, Altruism 18(3):42-142.",
    ],
    "tags": ["classification", "small"],
    "target": "categorical",
    "version": "1.0",
    "year": 1991,
    "instances": 178,
    "missing": false,
    "variables": 13,
    "source": "https://archive.ics.uci.edu/ml/datasets/Wine",
    "url": "http://my.web.server/wine.xls"
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%