Ag Data Commons
Browse

Enzymatic exome sequencing and its application to triaged genotyping of multiplexed CRISPR-edited plants

dataset
posted on 2025-07-23, 01:39 authored by Yale University
Whole-genome sequencing has become a common strategy to genotype individual plants of interest. Although a limited number of genomic regions usually need to be surveyed with this strategy, excess sequencing information is almost always generated at an appreciable financial cost for the users. For example, heterochromatic sequences, which can account for more than 80% of the genome size of some plants, are oftentimes not required in these genotyping projects. Therefore, strategies that enrich euchromatic DNA coding for the protein-coding genes prior to sequencing can provide far more useful sequence information and drastically lower sequencing costs. Here, we present the development and application of MsRR-Seq, which relies on the cytosine methylation-sensitive restriction enzyme MspJI to deplete heterochromatic DNA before library construction and sequencing. By applying MsRR-Seq to citrus and maize, we show that protein-coding genes can be enriched in sequencing datasets. We then describe the application of MsRR-Seq to facilitate the identification of complex mutants from populations of citrus plants resulting from multiplex CRISPR/Cas9 editing of four genes. Overall, this work demonstrates an easy and low-cost method to enrich euchromatic DNA in whole-genome sequencing projects, an approach that is especially useful for large plant genomes with an excessively high proportion of methylated repetitive sequences.

Funding

USDA: 2023-70029-41277

History

Data contact name

BioProject Curation Staff

Publisher

National Center for Biotechnology Information

Temporal Extent Start Date

2025-07-03

Theme

  • Non-geospatial

ISO Topic Category

  • biota

National Agricultural Library Thesaurus terms

sequence analysis

Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

It is recommended to cite the accession numbers that are assigned to data submissions, e.g. the GenBank, WGS or SRA accession numbers. If individual BioProjects need to be referenced, state that "The data have been deposited with links to BioProject accession number PRJNA1286006 in the NCBI BioProject database (https://www.ncbi.nlm.nih.gov/bioproject/)."

Accession Number

PRJNA1286006

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC