An Atlas of Bovine Gene Expression Reveals Novel Distinctive Tissue Characteristics and Evidence for Improving Genome Annotation
dataset
posted on 2024-09-29, 05:07authored byAnimal Health, USDA-ARS USMARC
Background: A comprehensive transcriptome survey, or "gene atlas", provides information essential for a complete understanding of the genomic biology of an organism. We present an atlas of RNA abundance for 92 adult, juvenile and fetal cattle tissues and 3 cattle cell lines. Results: The Bovine Gene Atlas was generated from 7.2 million unique digital gene expression tag sequences (300.2 million total raw tag sequences), from which 1.59 million unique tag sequences were identified that mapped to the bovine genome accounting for 85% of the total raw tag abundance. Filtering these tags yielded 87,764 unique tag sequences that unambiguously mapped to 16,517 annotated protein-coding loci in the genome accounting for 45% of the total raw tag abundance. Clustering of tissues based on tag abundance profiles generally confirmed ontology classification based on anatomy. There were 5,429 constitutively expressed loci and 3,445 constitutively expressed unique tag sequences mapping outside annotated gene boundaries that represent a resource for enhancing current gene models. Physical measures such as inferred transcript length or antisense tag abundance identified tissues with atypical transcriptional tag profiles. We report for the first time the tissue specific variation in the proportion of mitochondrial transcriptional tag abundance. The Bovine Gene Atlas can be examined at http://www.agbase.msstate.edu/bovineatlas . Conclusions: The Bovine Gene Atlas is the deepest and broadest transcriptome survey of any livestock genome to date. Commonalities and variation in sense and antisense transcript tag profiles identified in different tissues facilitate the examination of the relationship between gene expression, tissue, and gene function. Overall design: An atlas of mRNA abundance for 92 adult, juvenile and fetal cattle tissues and 3 cattle cell lines.
It is recommended to cite the accession numbers that are assigned to data submissions, e.g. the GenBank, WGS or SRA accession numbers. If individual BioProjects need to be referenced, state that "The data have been deposited with links to BioProject accession number PRJNA126207 in the NCBI BioProject database (https://www.ncbi.nlm.nih.gov/bioproject/)."