Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

determine what to do when techMD detects corrupt files #378

Open
andrewjbtw opened this issue Aug 23, 2022 · 0 comments
Open

determine what to do when techMD detects corrupt files #378

andrewjbtw opened this issue Aug 23, 2022 · 0 comments

Comments

@andrewjbtw
Copy link

[This is a stub for future work.]

Sometimes files are deposited that have some kind of file corruption - missing end-of-file data, bad headers, etc.. This can lead the techMD process to stall because the metadata extraction tools report errors when trying to characterize the files.

We don't have a good process for handling this situation right now. We can skip the techMD step and get the files deposited but it seems like we should have some kind of techMD field that we could use to query for known-corrupt files, and then potentially follow up with users if that's a possibility.

Note that this ticket is specifically about situations where a file is already corrupt prior to deposit, not corruption that might be caused by hardware or software failures during accessioning. Our system should be catching any corruption that happens during accessioning by verifying file checksums as files make their way through the pipeline.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant