Ag Data Commons
Browse

File(s) stored somewhere else

Please note: Linked content is NOT stored on Ag Data Commons and we can't guarantee its availability, quality, security or accept any liability.

Salmo salar strain:St. John River | breed:North American Atlantic salmon Genome sequencing and assembly

dataset
posted on 2024-06-11, 06:43 authored by USDA/ARS
The North American (N.A.) Atlantic salmon is a sub-species of Atlantic salmon with substantial genomic differences from the European sub-species of the same species. A high quality reference de-novo assembly is needed for the N.A. Atlantic salmon. To accomplish this, we generated 104x genome coverage of PacBio RS-II long-read sequences with a male N.A. Atlantic salmon from the St. John River broodstock of the USDA breeding program in Maine, USA. Contigs were assembled from the long reads using the Canu pipeline. The assembly was further improved with Bionano optical maps and Hi-C proximity ligation sequence data to generate 1,755 scaffolds. The scaffolds were polished using the PacBio raw reads and ~132x Illumina short reads. A genetic map composed of 27 linkage groups was generated from 36K SNP markers and 2,512 fish from 141 full-sib families. The genetic linkage information was used to anchor the scaffolds to 27 chromosomes and to order and orient the scaffold sequences in the chromosome sequences.

Funding

Agricultural Research Service, 8030-31000-005 (In-house USDA funding)

History

Data contact name

BioProject Curation Staff

Publisher

National Center for Biotechnology Information

Temporal Extent Start Date

2022-01-05

Theme

  • Non-geospatial

ISO Topic Category

  • biota

National Agricultural Library Thesaurus terms

genomics; sequence analysis; genome

Pending citation

  • No

Public Access Level

  • Public

Accession Number

PRJNA759632

Preferred dataset citation

It is recommended to cite the accession numbers that are assigned to data submissions, e.g. the GenBank, WGS or SRA accession numbers. If individual BioProjects need to be referenced, state that "The data have been deposited with links to BioProject accession number PRJNA759632 in the NCBI BioProject database (https://www.ncbi.nlm.nih.gov/bioproject/)."

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC