Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Should we add tooling for a search + load mementos pipeline? #17

Open
Mr0grog opened this issue Dec 3, 2019 · 0 comments
Open

Should we add tooling for a search + load mementos pipeline? #17

Mr0grog opened this issue Dec 3, 2019 · 0 comments
Labels
question Further information is requested

Comments

@Mr0grog
Copy link
Member

Mr0grog commented Dec 3, 2019

We have a lot of code we left behind in web-monitoring-processing around knitting together the search and get_memento methods in a high-performance way across many threads. I think it’s good that we didn’t include any of that here to start with, but would it make sense to add some of that back in eventually?

i.e. Some tooling that supports the workflow:

┌──────────────────┐   ┌ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ┐   ┌─────────────────┐
│   Sequence of    │     ┌─────────────┐   ┌─────────────┐     │   Sequence of   │
│     URLs/URL     │──▶│ │   search    │──▶│ get_memento │ │──▶│    Mementos     │
│     Patterns     │     └─────────────┘   └─────────────┘     │                 │
└──────────────────┘   └ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ┘   └─────────────────┘

It should be significantly more abstract than the way it is currently implemented in https://github.com/edgi-govdata-archiving/web-monitoring-processing/blob/master/web_monitoring/cli.py, of course. 😉

@Mr0grog Mr0grog added the question Further information is requested label Oct 21, 2020
@Mr0grog Mr0grog moved this to Backlog in Wayback Roadmap Dec 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
Status: Backlog
Development

No branches or pull requests

1 participant