A standalone crawler that crawls only .gov.si web sites using Playwright.
cp .env.example .env
Edit .env file if necessary. Number of threads can be set using the N_THREADS parameter.
docker-compose up -d ieps-db
pip install virtualenv
python<version> -m venv <virtual-environment-name>
source env/bin/activate
Alternatively you can set it up using Pycharm.
pip install -r requirements.txt
playwright install
python migrate.py
python main.py
You can run PgAdmin Docker container with the following command:
docker-compose up -d pgadmin
Access the pgadmin4 via your favorite web browser by visiting the URL. Use the [email protected] as the email address and root as the password to log in.