Mind the gaps – ignoring errors in long read assemblies can critically affect protein prediction

Information
Authors: 
Warr, A. & Watson, M.
Journal: 
Nature Biotechnology
Journal publication date: 
2019
DOIs: 
https://doi.org/10.1038/s41587-018-0004-z
Abstract

Long read, single molecule sequencing technologies are now routinely used for whole-genome sequencing and assembly. However, even after multiple rounds of correction, many errors can remain which can critically affect protein coding regions, resulting in significantly altered and often truncated protein predictions.