Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Very large input data files in REMIND #218

Open
mikapfl opened this issue Feb 16, 2023 · 1 comment
Open

Very large input data files in REMIND #218

mikapfl opened this issue Feb 16, 2023 · 1 comment
Assignees

Comments

@mikapfl
Copy link
Contributor

mikapfl commented Feb 16, 2023

Hi,

in REMIND, modules/35_transport/edge_esm/input/ is very large (1.1 GB) after input data is downloaded. That is somewhat wasteful of storage resources on the cluster and probably also pretty inefficient when reading the data.
As far as I understand, only a small handful of rather small files is read into GAMS code using $include, while most files (and in particular, the really big ones like pref.cs4r) are either never read or read in R. Only for reading into GAMS, the uncompressed cs4r file format is really necessary, for reading into R, the compressed binary format .mz would be more appropriate. Maybe you can check if you can use that to reduce the size of the input data? In my testing, this would reduce the input data size from 1.1 GB to something like 60 MB, which would be fantastic.

Cheers

Mika

@orichters
Copy link
Contributor

@johannah-pik, @jmuessel: I also see these large files in the output folders and wonder if I really need the data for all gdp and EDGE-T scenarios in pref.cs4r when this folder clearly contains only a specific one.

Any progress planned on this? Maybe you could ask RSE to help with it? Thanks!

@johannah-pik johannah-pik self-assigned this Oct 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants