Name		Name	Last commit message	Last commit date
parent directory ..
jobs		jobs
util		util
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pydocstyle.ini		.pydocstyle.ini
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
config.py		config.py
conftest.py		conftest.py
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
worker.py		worker.py

README.md

Stencila Hub Worker

Purpose

The worker service processes jobs sent to one or more job queues. It is designed to be able to run either within, or outside, of the cluster that the other services are running. For example, users may wish to run their own worker instances, listening to their account's virtual host on the broker.

Approach

The worker is a Celery process. Jobs are defined as Python classes in the jobs directory. However, jobs are not restricted to being implemented in Python; they may make use of other processes (see SubprocessJob), Docker containers, or Kubernetes pobs.

Testing

Unit tests

Each of the jobs should have at least one *_test.py file. You can run all the tests like this,

make test

Or, with coverage, using,

make cover

If you want to run tests individually, use pytest directly e.g. to only run the tests for the convert job:

./venv/bin/pytest jobs/convert

Some of the jobs, in particular those in jobs/pull, involve making HTTP requests. To speed up test runs and to allow them to be run offline, we use pytest-recording to record requests and their responses. To enable this for a test add the @pytest.mark.vcr decorator and run the test once with --record-mode=rewrite e.g.

./venv/bin/pytest --record-mode=rewrite jobs/pull/elife_test.py

The generated YAML files in the casettes folder should be committed. If needs be, you can run tests again with rewrite mode to update the casettes.

Kubernetes sessions

Some jobs, notably Kubernetes sessions, are most easily developed and tested by trying them out on a cluster. You can create a new session pod at the command line using:

./venv/bin/python3 -m jobs.session.kubernetes_session --debug

This will create a cluster in jobs namespace of the current kubectl context. You can then check the details of that pod e.g.

kubectl describe pod -n jobs session-f304ed49bcddc038d49d1be5e9227e86

Or shell into it using kubectl exec e.g.

kubectl exec -it -n jobs session-f304ed49bcddc038d49d1be5e9227e86 -- bash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

worker

worker

README.md

Stencila Hub Worker

Purpose

Approach

Testing

Unit tests

Kubernetes sessions

Files

worker

Directory actions

More options

Directory actions

More options

Latest commit

History

worker

Folders and files

parent directory

README.md

Stencila Hub Worker

Purpose

Approach

Testing

Unit tests

Kubernetes sessions