Ag Data Commons

File(s) stored somewhere else

Please note: Linked content is NOT stored on Ag Data Commons and we can't guarantee its availability, quality, security or accept any liability.

Complete genome sequence determined by the Genolevures Consortium in a comparative genomics study of hemiascomycete yeasts

posted on 2024-06-11, 05:10 authored by Genoscope, France, Genolevures Consortium, Institut Pasteur, France
In a comparative genomics study designed to examine eukaryotic genome evolution, the Genolevures Consortium determined the complete genome sequences of the yeast species Kluyveromyces lactis, Candida glabrata, Yarrowia lipolytica, and Debaryomyces hansenii. These four hemiascomycete yeasts represent a broad evolutionary range within a single eukaryotic phylum. The Genolevures Consortium sequenced the Kluyveromyces lactis strain NRRL Y-1140 genome at 11.4X coverage using whole genome shotgun (WGS) sequencing. They assembled the Kluyveromyces lactis sequences into chromosomes with 0 total gaps. The Genolevures Consortium annotated the four yeast genome sequences, identifying approximately 24,200 novel genes. They classified the corresponding protein products, together with Saccharomyces cerevisiae proteins, into about 4,700 families. Genomic analysis revealed that the different yeast lineages have evolved through a variety of molecular mechanisms, including tandem gene repeat formation, segmental duplication, a massive genome duplication, and extensive gene loss.


Data contact name

BioProject Curation Staff


National Center for Biotechnology Information

Temporal Extent Start Date



  • Non-geospatial

ISO Topic Category

  • biota

Ag Data Commons Group

  • ARS Culture Collection

National Agricultural Library Thesaurus terms

genomics; sequence analysis; genome

Pending citation

  • No

Public Access Level

  • Public

Accession Number


Preferred dataset citation

It is recommended to cite the accession numbers that are assigned to data submissions, e.g. the GenBank, WGS or SRA accession numbers. If individual BioProjects need to be referenced, state that "The data have been deposited with links to BioProject accession number PRJNA13835 in the NCBI BioProject database ("

Usage metrics



    Ref. manager