-
Notifications
You must be signed in to change notification settings - Fork 13
/
DESCRIPTION
81 lines (81 loc) · 3.32 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
Package: git2rdata
Title: Store and Retrieve Data.frames in a Git Repository
Version: 0.4.1
Authors@R: c(
person("Thierry", "Onkelinx", , "[email protected]", role = c("aut", "cre"),
comment = c(ORCID = "0000-0001-8804-4216", affiliation = "Research Institute for Nature and Forest (INBO)")),
person("Floris", "Vanderhaeghe", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0002-6378-6229", affiliation = "Research Institute for Nature and Forest (INBO)")),
person("Peter", "Desmet", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0002-8442-8025", affiliation = "Research Institute for Nature and Forest (INBO)")),
person("Els", "Lommelen", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0002-3481-5684", affiliation = "Research Institute for Nature and Forest (INBO)")),
person("Research Institute for Nature and Forest (INBO)", , , "[email protected]", role = c("cph", "fnd"))
)
Description: The git2rdata package is an R package for writing and reading
dataframes as plain text files. A metadata file stores important
information. 1) Storing metadata allows to maintain the classes of
variables. By default, git2rdata optimizes the data for file storage.
The optimization is most effective on data containing factors. The
optimization makes the data less human readable. The user can turn
this off when they prefer a human readable format over smaller files.
Details on the implementation are available in vignette("plain_text",
package = "git2rdata"). 2) Storing metadata also allows smaller row
based diffs between two consecutive commits. This is a useful feature
when storing data as plain text files under version control. Details
on this part of the implementation are available in
vignette("version_control", package = "git2rdata"). Although we
envisioned git2rdata with a git workflow in mind, you can use it in
combination with other version control systems like subversion or
mercurial. 3) git2rdata is a useful tool in a reproducible and
traceable workflow. vignette("workflow", package = "git2rdata") gives
a toy example. 4) vignette("efficiency", package = "git2rdata")
provides some insight into the efficiency of file storage, git
repository size and speed for writing and reading.
License: GPL-3
URL: https://ropensci.github.io/git2rdata/,
https://github.com/ropensci/git2rdata/,
https://doi.org/10.5281/zenodo.1485309
BugReports: https://github.com/ropensci/git2rdata/issues
Depends:
R (>= 3.5.0)
Imports:
assertthat,
git2r (>= 0.23.0),
methods,
yaml
Suggests:
ggplot2,
knitr,
microbenchmark,
rmarkdown,
testthat
VignetteBuilder:
knitr
Config/checklist/communities: inbo
Config/checklist/keywords: git; version control; plain text data
Encoding: UTF-8
Language: en-GB
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.3.2
Collate:
'clean_data_path.R'
'datahash.R'
'display_metadata.R'
'git2rdata_package.R'
'write_vc.R'
'is_git2rdata.R'
'is_git2rmeta.R'
'list_data.R'
'meta.R'
'print.R'
'prune.R'
'read_vc.R'
'recent_commit.R'
'reexport.R'
'relabel.R'
'rename_variable.R'
'update_metadata.R'
'upgrade_data.R'
'utils.R'
'verify_vc.R'