Skip to content
Arnold Kuzniar edited this page May 2, 2019 · 30 revisions

Inventory of tools and data formats

This is an overview of useful data-related software that is being used in being considered for projects at the Netherlands eScience Center.

Software I/O formats API Description Projects Engineers
OpenRefine CSV, TSV, RDF GREL Data cleaning & munging ODEX4all, candYgene Arnold
Virtuoso Universal Server (OSE) CSV, TSV, XML, RDF, JSON PL/SQL, SPARQL Object-relational DB, RDF Quad Store ODEX4all, candYgene, EOSCpilot LOFAR, GTCG Arnold
Neo4j CSV, TSV Cypher Graph database GTCG Arnold
SQLite CSV, TSV, GFF SQL (embedded) Relational database candYgene Arnold
Apache Solr TXT, XML Solr client APIs (RESTful) full-text search platform candYgene Arnold
SIGA.py GFF->SQLite DB->RDF Python Command-line tool to transform (semantify) genome annotations ODEX4all, candYgene Arnold
SAMtools SAM, BAM
VCF/BCFtools VCF, BCF
Clone this wiki locally