- Check the quality of the data using Reads_QC (Hafnia spp project in VIGASP)
- Check the species identification using Species Abundance pipeline
- Use the reference data (Hafnia_PostiveControl project in VIGASP) given by Girum to find a cut-off for species identification process
- Build Genome Assembly for the Hafnia data in Hafnia spp project in VIGASP
- Download the Gene sequences for the list of AMR genes given by Girum
- Download the Gene sequences for the list of Virulence genes given by Girum
- Run Abricate on the AMR and VirulenceGenes and tabulate the results.
- Use CoreGenomeSNP_Phylogeny pipeline to build the Tree
-
Check the quality of the data using Reads_QC - DONE 94 Hafnia samples from "Hafnia spp." project was analysed using "Reads_QC" pipeline Results:
- Coverage
- Phred Score
- QC%
- Predicted Species (using proportion of reads mapped) using Kraken2 and Bracken
-
Check the species identification using Species Abundance pipeline - DONE
- Predicted species using Kraken2 and Bracken (kind of redundant results from Reads_QC)
-
Use the reference data to find a cut-off for species identification process - DONE
- Girum sent us 3 Hafnia strains to use as reference to decide on Cut-off for species confirmation
- The 3 sample were analyzed using Reads_QC pipeline and "ProportionofReads" mapped used as cut-off for species identification prediction
-
Download the Gene sequences for the list of AMR genes.
-
Update: Many of the nucleotide_IDs are outdated and not able to download only the gene sequences
-
Suggestions from Håkon: Use NCBIAmrPlus for AMR genes
-
Use ALPPACA for the SNP analysis and phylogenetics.