Ag Data Commons
Browse

Data from: Dairy cows inoculated with highly pathogenic avian influenza virus H5N1

Download (6.86 GB)
dataset
posted on 2024-11-14, 20:17 authored by AMY BAKERAMY BAKER, Bailey Arruda, Mitchell Palmer, Paola M. Boggiatto, Kaitlyn Sarlo-Davila, Alexandra Buckley, Giovana Ciacci ZanellaGiovana Ciacci Zanella, Celeste A. Snyder, Tavis AndersonTavis Anderson, CARL HUTTERCARL HUTTER, Thao-Quyen Nguyen, Alexey Markin, Kristina Lantz, Erin Posey, Mia Torchetti, Suelee Robbe-Austerman, Drew R. Magstadt, Patrick J. Gorden

Raw read data from: Baker AL, Arruda B, Palmer MV, Boggiatto P, Davila KS, Buckley A, Zanella GC, Snyder CA, Anderson TK, Hutter CR, Nguyen TQ. Markin, A., Lantz, K., Posey, E.A., Torchetti, M.K., Robbe-Austerman, S., magstadt, D.R., Gorden, P.J. Dairy cows inoculated with highly pathogenic avian influenza virus H5N1. Nature. 2024 Oct 15:1-3.

Additional details on data processing and computational pipelines are provided at: GitHub - flu-crew/dairy-cattle-hpai-experiment: data and code to support Baker et al. HPAI H5N1 in dairy cattle

The Highly Pathogenic Avian Influenza (HPAI) H5N1 genotype B3.13 virus used this study was collated (A/dairy cattle/Texas/24- 531 008749-002/2024: TX/24: NCBI PP755581- PP755588) with other B3.13 strains generated within Nguyen et al. and newly sequenced data collected between April 16 and May 8, 2024. Influenza A virus Ribonucleic acid (RNA) from samples was amplified, then we generated complementary DNA (cDNA) libraries by using the Illumina DNA Sample Preparation Kit, (M) Tagmentation (Illumina, 535 https://www.illumina.com) and either the iSeq or NextSeq Reagent Kit v2 (Illumina) according to manufacturer instructions. We performed reference-guided assembly of genome sequences using IRMA v1.1.5. To analyze the Illumina short read data for 82 samples collected during the experimental challenge, we used the “Flumina” pipeline (https://github.com/flu-crew/Flumina) for processing and analyzing influenza data. The pipeline uses Python v3.10, R v4.4 (R Development Core Team 2024), and SnakeMake to organize programs and script execution. The reads are preprocessed using FASTP, removing adapter contamination, low complexity sequences, and other artifacts. Consensus contigs for phylogenetics were assembled using IRMA v1.1.5. The pipeline maps cleaned reads to the reference cattle strain derived from sequencing the stock inoculum of A/dairy cattle/Texas/24-008749-002/2024 using BWA (bwa index –a bwtsw). High frequency single nucleotide variants (SNV) were called with GATK v.4.4 41 and low frequency SNVs were called with LoFreq. To assess potential SNV phenotypic changes, a database was generated using the Sequence Feature Variant Types tool from the Influenza Research Database for all eight genes. To estimate genome-wide natural selection, we used the program SNPGenie on the VCF files.

Funding

USDA-ARS: 5030-32000-231-000D

Oak Ridge Institute for Science and Education: DE-AC05-06OR23100

Centers for Disease Control and Prevention: 21FED2100395IPD

NIAID CENTERS OF EXCELLENCE FOR INFLUENZA RESEARCH AND RESPONSE (CEIRR)

National Institute of Allergy and Infectious Diseases

Find out more...

History

Data contact name

Anderson, Tavis K

Data contact email

tavis.anderson@usda.gov

Publisher

Ag Data Commons

Intended use

These data include raw Illumina sequence data for influenza A virus strains collected from experimentally infected dairy cattle. Following incorporation into a larger gene sequence dataset they may be used to study the evolutionary history of H5N1 subtype influenza A viruses and can be used to validate the results generated in Baker et al. https://doi.org/10.1038/s41586-024-08166-6

Use limitations

These data are derived from a small animal experiment and should not be used as representative of IAV evolution. The limitations of the study and these data are described in Baker et al. https://doi.org/10.1038/s41586-024-08166-6

Temporal Extent Start Date

2024-03-25

Temporal Extent End Date

2024-09-30

Frequency

  • notPlanned

Theme

  • Non-geospatial

ISO Topic Category

  • biota
  • farming
  • health

National Agricultural Library Thesaurus terms

dairy cows; Influenza A virus; avian influenza; viruses; RNA; complementary DNA; nucleotide sequences; inoculum; genes

ARS National Program Number

  • 103

ARIS Log Number

417441

Pending citation

  • No

Public Access Level

  • Public