Europeana Embedding API

Spring-Boot2 wrapper around legacy Python code to generate Embeddings. The wrapper makes the performance of a single request a bit slower, but makes the API more stable and capable of processing multiple requests at the same time (at the cost of increasing memory usage)

Prerequisites

Java 17
Maven^*
Europeana parent pom
Python3.6

^{* A Maven installation is recommended, but you could use the accompanying mvnw (Linux, Mac OS) or mvnw.cmd (Windows)
files instead.}

Build

mvn clean install (add -DskipTests) to skip the unit tests during build

Run locally

The application has a Tomcat web server that is embedded in Spring-Boot. Either select the EmbeddingsApplication class in your IDE and 'run' it

or

go to the application root where the pom.xml is located and excute
./mvnw spring-boot:run (Linux, Mac OS) or mvnw.cmd spring-boot:run (Windows)

For local debugging

Launch a Python process manually. For this either use the Dockerfile in the python folder or make sure Python 3.6 is installed. When using Docker to launch Python:

Don't forget to map the port specified in the test-run.sh file.
In the Executor class modify the 127.0.0.1 address to the IP of the Docker container and comment out the createProcess method.
In the EmbeddingsService class, comment out the Python 3.6 check in the checkRequirements method.

Deployment to Kubernetes (for testing purposes)

Generate a Docker image using the project's Dockerfile
Configure the application by generating a embedding.user.properties file and placing this in the k8s folder. After deployment this file will override the settings specified in the embedding.properties file located in the src/main/resources folder. The .gitignore file makes sure the .user.properties file is never committed.
Configure the deployment by setting the proper environment variables specified in the configuration template files in the k8s folder
Deploy to Kubernetes infrastructure.

Deployment to a physical server

For good performance we recommend deploying the Embedding API on a server that has a recent NVIDIA card. This will speed up embedding generation significantly.

Prerequisites

NVIDIA graphics drivers are installed
NVIDIA Container Toolkit is installed (and the system is restarted after installation)
Run nvidia-smi to check if the GPU can be accessed
Docker is installed

Installation

Copy the the project's docker-compose.yml file to the server and run docker-compose up.

License

Licensed under the EUPL 1.2. For full details, see LICENSE.md.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Europeana Embedding API

Prerequisites

Build

Run locally

For local debugging

Deployment to Kubernetes (for testing purposes)

Deployment to a physical server

Prerequisites

Installation

License

About

Releases

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
k8s		k8s
python		python
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
docker-compose.yml		docker-compose.yml
mvnw		mvnw
mvnw.cmd		mvnw.cmd
owasp-suppress.xml		owasp-suppress.xml
pom.xml		pom.xml

License

europeana/embedding-api

Folders and files

Latest commit

History

Repository files navigation

Europeana Embedding API

Prerequisites

Build

Run locally

For local debugging

Deployment to Kubernetes (for testing purposes)

Deployment to a physical server

Prerequisites

Installation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Contributors 2

Languages