For a general overview of the pipeline welding process, see the repo README.
For the setup steps that need to be performed once at the start of a project, see the setup REAME.
Updates (or reverts) your recipe to include any desired changes.
- Allows you to make changes to your recipe from dataset to dataset (or batch to batch).
If you would like your recipe to include any updates to the official recipe:
git fetch upstream
git checkout master
git merge upstream/master
git push
If you would like your recipe to include any updates that you have made:
git checkout UPDATED-BRANCH
or
git checkout <commit_hash>
A repository for each dataset/batch.
- Retain all code, configuration files, computational environments, and directory structure that a standard Pooled Cell Painting workflow expects and produces.
Click "Use this template".
Enter a name for your new repository that includes your batch name and click "Create repository from template".
Now, fork and clone the forked repository to your favorite file system (local, AWS, GCP, etc.).
Next, we create a submodule in the repository we just created.
Adding a submodule initiates the weld.
- Link the processing code (recipe) with the data (current repo).
- Require a manual step to update the recipe to enable asynchronous development.
See below:
# In your terminal, clone the repository you just created (THIS REPO)
USER="INSERT-USERNAME-HERE"
REPO="INSERT-NAME-HERE"
git clone [email protected]:$USER/$REPO.git
# Navigate to this directory
cd $REPO
# Add the Recipe Submodule
git submodule add [email protected]:$USER/pooled-cell-painting-profiling-recipe.git recipe
Refer to "Adding a submodule" for more details.
Lastly, we commit the submodule to github.
Committing this change finalizes the weld initialization.
- Track the submodule (recipe) version with the current repository.
See below:
# Add, commit, and push the submodule contents
git add .gitmodules recipe
git commit -m 'link recipe submodule to initialize weld'
git push
Data is processed and figures and data are output.
- Track the submodule (recipe) version with the current data repository.
- Activate conda environment. (
conda activate pooled-cell-painting
) - Manually update the configuration yaml documents for your specific batch.
- Yaml documents with reasonable default values are available in the config/ folder.
- Do NOT change the location of the .yaml files.
- Additional documentation for each of the parameters is available in the config/docs/ folder.
- Execute
weld.py
(see below)
python weld.py