Assembly and analysis of the genome of Notholithocarpus densiflorus.
SOD
cobarcoding
stLFR
tanoak
Journal
G3 (Bethesda, Md.)
ISSN: 2160-1836
Titre abrégé: G3 (Bethesda)
Pays: England
ID NLM: 101566598
Informations de publication
Date de publication:
01 Mar 2024
01 Mar 2024
Historique:
received:
04
12
2023
revised:
16
02
2024
accepted:
22
02
2024
medline:
1
3
2024
pubmed:
1
3
2024
entrez:
1
3
2024
Statut:
aheadofprint
Résumé
Tanoak (Notholithocarpus densiflorus) is an evergreen tree in the Fagaceae family found in California and southern Oregon. Historically, tanoak acorns were an important food source for Native American tribes and the bark was used extensively in the leather tanning process. Long considered a disjunct relictual element of the Asian stone oaks (Lithocarpus spp.), phylogenetic analysis has determined that the tanoak is an example of convergent evolution. Tanoaks are deeply divergent from oaks (Quercus) of the Pacific Northwest and comprise a new genus with a single species. These trees are highly susceptible to 'sudden oak death' (SOD), a plant pathogen (Phytophthora ramorum) that has caused widespread mortality of tanoaks. Here, we set out to assemble the genome and perform comparative studies among a number of individuals that demonstrated varying levels of susceptibility to SOD. First, we sequenced and de novo assembled a draft reference genome of N. densiflorus using co-barcoded library processing methods and an MGI DNBSEQ-G400 sequencer. To increase the contiguity of the final assembly, we also sequenced Oxford Nanopore (ONT) long reads to 30X coverage. To our knowledge, the draft genome reported here is one of the more contiguous and complete genomes of a tree species published to date, with a contig N50 of ∼1.2 Mb, a scaffold N50 of ∼2.1 Mb, and a complete gene score of 95.5% through BUSCO analysis. In addition, we sequenced 11 genetically distinct individuals and mapped these onto the draft reference genome enabling the discovery of almost 25 million single nucleotide polymorphisms and ∼4.4 million small insertions and deletions. Finally, using co-barcoded data we were able to generate complete haplotype coverage of all 11 genomes.
Identifiants
pubmed: 38427916
pii: 7617432
doi: 10.1093/g3journal/jkae043
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
© The Author(s) 2024. Published by Oxford University Press on behalf of The Genetics Society of America.