Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes.

Fiche publication


Date publication

novembre 2020

Journal

BMC bioinformatics

Auteurs

Membres identifiés du Cancéropôle Est :
Dr POCH Olivier, Dr THOMPSON Julie, Pr COLLET Pierre


Tous les auteurs :
Meyer C, Scalzitti N, Jeannin-Girardon A, Collet P, Poch O, Thompson JD

Résumé

Recent advances in sequencing technologies have led to an explosion in the number of genomes available, but accurate genome annotation remains a major challenge. The prediction of protein-coding genes in eukaryotic genomes is especially problematic, due to their complex exon-intron structures. Even the best eukaryotic gene prediction algorithms can make serious errors that will significantly affect subsequent analyses.

Mots clés

Error correction, Gene prediction, Genome annotation, Primates, Protein sequence errors

Référence

BMC Bioinformatics. 2020 Nov 10;21(1):513