• Home
  • UCSC journals portal
  • ANID repository
  • UCSC Thesis Repository
  • English
  • Español
  • Log In
    Have you forgotten your password?
  1. Home
  2. Productividad Científica
  3. Publicaciones Científicas
  4. annotate_my_genomes: an easy-to-use pipeline to improve genome annotation and uncover neglected genes by hybrid RNA sequencing
 
Options
annotate_my_genomes: an easy-to-use pipeline to improve genome annotation and uncover neglected genes by hybrid RNA sequencing
Dr. Farkas-Pool, Carlos 
Facultad de Medicina 
Recabal, Antonia
Mella, Andy
Candia-Herrera, Daniel
González-Olivero, Maryori
Jonathan-Haigh, Jody
Tarifeño-Saldivia, Estefanía
Caprile, Teresa
10.1093/gigascience/giac099
GigaScience
2022
Background: The advancement of hybrid sequencing technologies is increasingly expanding genome assemblies that are often annotated using hybrid sequencing transcriptomics, leading to improved genome characterization and the identification of novel genes and isoforms in a wide variety of organisms.
Results: We developed an easy-to-use genome-guided transcriptome annotation pipeline that uses assembled transcripts from hybrid sequencing data as input and distinguishes between coding and long non-coding RNAs by integration of several bioinformatic approaches, including gene reconciliation with previous annotations in GTF format. We demonstrated the efficiency of this approach by correctly assembling and annotating all exons from the chicken SCO-spondin gene (containing more than 105 exons), including the identification of missing genes in the chicken reference annotations by homology assignments.
Conclusions: Our method helps to improve the current transcriptome annotation of the chicken brain. Our pipeline, implemented on Anaconda/Nextflow and Docker is an easy-to-use package that can be applied to a broad range of species, tissues, and research areas helping to improve and reconcile current annotations. The code and datasets are publicly available at https://github.com/cfark as/annotate_my_genomes
Thumbnail Image
Download
Name

annotate_my_genomes. an easy-to-use pipeline to improve genome annotation and uncover neglected genes by hybrid RNA sequencing.pdf

Size

5.12 MB

Format

Checksum
Transcriptome annotation
Genome Annotation pipeline
SCO-spondin
Hybrid sequencing
Historial de mejoras
Proyecto financiado por: