MoGAAAP: A modular Snakemake workflow for automated genome assembly and annotation with quality assessment
Abstract
With the current speed of sequencing, there is a desire for standardised and automated genome assembly and annotation to produce high-quality genomes as input for comparative (pan)genomics. Therefore, we created a convenience pipeline using existing tools that creates annotated genome assemblies from HiFi (and optionally ultra-long ONT and/or Hi-C) reads for a set of related accessions as well as a related reference genome. Our pipeline is species-agnostic and generates an extensive quality assessment report that can be used for manual filtering and refinement of the assembly and annotation. It includes statistics for individual completeness and contamination assessments as well as a concise pangenome view. The pipeline is implemented in Snakemake and available with a GPLv3 license at GitHub under github.com/dirkjanvw/MoGAAAP and at Zenodo under doi.org/10.5281/zenodo.14833021.
Related articles
Related articles are currently not available for this article.