Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes.
Fiche publication
Date publication
novembre 2020
Journal
BMC bioinformatics
Auteurs
Membres identifiés du Cancéropôle Est :
Dr POCH Olivier, Dr THOMPSON Julie, Pr COLLET Pierre
Tous les auteurs :
Meyer C, Scalzitti N, Jeannin-Girardon A, Collet P, Poch O, Thompson JD
Lien Pubmed
Résumé
Recent advances in sequencing technologies have led to an explosion in the number of genomes available, but accurate genome annotation remains a major challenge. The prediction of protein-coding genes in eukaryotic genomes is especially problematic, due to their complex exon-intron structures. Even the best eukaryotic gene prediction algorithms can make serious errors that will significantly affect subsequent analyses.
Mots clés
Error correction, Gene prediction, Genome annotation, Primates, Protein sequence errors
Référence
BMC Bioinformatics. 2020 Nov 10;21(1):513