?

The first mitogenome of Lauraceae (Cinnamomum chekiangense)

2024-03-09 10:19ChangwiBiNingSunFuhuanHanKwangXuYongYangDaviFrguson
植物多樣性 2024年1期

Changwi Bi ,Ning Sun ,Fuhuan Han ,Kwang Xu ,Yong Yang ,*,Davi K.Frguson

a State Key Laboratory of Tree Genetics and Breeding,Co-Innovation Center for Sustainable Forestry in Southern China,Key Laboratory of Tree Genetics and Biotechnology of Educational Department of China,Key Laboratory of Tree Genetics and Silvicultural Sciences of Jiangsu Province,Nanjing Forestry University,Nanjing 210037,China

b College of Information Science and Technology,Nanjing Forestry University,Nanjing 210037,China

c Research Institute of Subtropical Forestry,Chinese Academy of Forestry,Hangzhou 311400,China

d Co-Innovation Center for Sustainable Forestry in Southern China,College of Life Sciences,Nanjing Forestry University,Nanjing 210037,China

e Department of Paleontology,University of Vienna,Vienna,Austria

There are three distinct genetic systems in higher plants,the dominant nuclear genome and the semi-autonomous organelle genomes(plastids and mitochondria).In contrast to the conserved plastid genome (plastome),the plant mitochondrial genome(mitogenome) is characterized by an intriguing “evolutionary paradox”distinguished by a remarkably low mutation rate but with a significantly high rearrangement rate(Palmer and Herbon,1988;Lai et al.,2022).Plant mitochondria are considered as an important genetic system for studying the evolution of genome structure and functional content due to their extremely low sequence mutation rate,but frequent genomic recombination (Wu et al.,2022).They are also an ideal resource for studying the mechanisms of plant genetic diversity formation and maintenance (Wang et al.,2022).However,plant mitogenomes contain a large number of repetitive sequences and foreign DNA transfers,which lead to the fragmentation of mitogenome assembly and makes it difficult to obtain a complete mitogenome.So far,the total number of released plant mitogenomes is less than 5% of the plastomes,most of which are from eudicots and bryophytes,while the remainder belong to other land plants (e.g.,ferns,gymnosperms,magnoliids,and monocots).

Plant mitogenomes have many unique evolutionary features,as compared to the compact and conservative animal mitogenomes.The most notable difference is that animal mitogenomes are very limited in size(15-18 kb),whereas plant mitogenomes span a wide range of sizes(66 kb-12 Mb).Comparative genomic analyses have revealed that frequent mitogenomic recombination and foreign DNA transfers could integrate large amounts of foreign DNA during evolution,ultimately leading to dramatic variations in the size and structure of the plant mitogenomes (Rice et al.,2013; Wu et al.,2022).A plant mitogenome is conventionally depicted as a single master circle (MC),like a plastome and animal mitogenome.With recent advances in DNA sequencing technology,the increased number of plant mitogenomes suggests that the real structure of the plant mitogenome is far more complicated than a single MC model would represent (Sloan,2013).In addition to inducing the variations in mitogenomic structure and size,recombination and foreign DNA transfers also modify the gene and intron contents of the mitogenome.In terms of intron contents among land plant mitogenomes,the number of introns,including group I and II introns,ranges from only 4 inViscum scurruloideumto 45 inAnthoceros angustus.The content of group II introns is highly conserved in vascular plants,although less so relative to nonvascular plants(Mower,2020).

The largest number of mitogenomes reported so far are those of angiosperms,which are characterized by a variety of unusual properties,including variable sizes,frequent posttranscriptional modifications,low gene densities,and extensive foreign DNA transfers(Knoop et al.,2011).Investigations of the mitogenomes of primitive angiosperms will facilitate the comprehensive insight into the evolutionary patterns of plant mitogenomes.The family Lauraceae is the largest family in the order Laurales of primitive angiosperms and comprises more than 3000 species inca.55 genera,which are widely distributed in tropical and subtropical regions(Yang et al.,2022b).Characterizing the genetic information of the Lauraceae is thus of great significance for understanding the evolution,phylogeny,and sustainable utilization of the family.Cinnamomumbelongs to the family Lauraceae,and is now known to be restricted to the Old World.The genus possesses an unusual combination of morphological characters,e.g.,evergreen trees,tripliveined leaves opposite or subopposite,paniculate inflorescences with ultimate cymes having strictly opposite lateral flowers,tepals mostly partially persistent in fruits.Trees of the genus are economically important because they have been widely used for their chemical components either as spices,or traditional Chinese medicines,or essential oils.

Mitogenomic data are uniparental and have been used for phylogenetic studies in many groups of higher plants.However,it remains unclear if mitochondrial sequences or genomic data are useful for the phylogeny of the family Lauraceae or not.There is not any mitogenome data of Lauraceae available in the public nucleotide database of NCBI,although it is known that the mitogenomes of the family have experienced an unusual and complicated evolutionary history (e.g.,Cassytha) (Zhang et al.,2020a,2022).With the development of sequencing technologies,the PacBio HiFi sequencing method has yielded highly accurate long-read sequencing datasets (Hon et al.,2020),which have great application potential in genome assembly and complex structure detection.This advancement has made it possible to reveal the complicated metagenomes.Here,we assembled the mitogenome ofCinnamomum chekiangenseinto a single master circle using HiFi sequencing data to provide the first mitogenome reference for the family Lauraceae.We further annotated and characterized the mitogenome and conducted a phylogenetic study to better understand its evolutionary significance.

Using the Revio sequencing platform,we obtained a total of 448,389 HiFi sequencing reads,with the maximum and average lengths of 32,397 bp and 14,109 bp,respectively.With the highly accurate long-read sequencing data,the complete mitogenome ofC.chekiangensewas assembled into a single circular molecule with a total length of 750,457 bp (GenBank accession number:NC_082065)(Fig.1A),which is the average size of mitogenomes in magnoliids.In fact,the mitogenome sizes vary considerably among the magnoliid clade,ranging from 535,805 bp inHernandia nymphaeifoliato 967,100 bp inMagnolia biondii.The mitogenome structure of magnoliids is also variable,most being assembled into a single circular molecule (e.g.,M.biondii,M.officinalis,M.figo,Liriodendron tulipifera,andH.nymphaeifolia),while some were assembled into multi-circular molecules (e.g.,Saururus chinensisandMachilus pauhoi; Yu et al.,2023).

Fig.1.(A)The Circular mitogenome map of Cinnamomum chekiangense.Asterisks beside genes represent intron-containing genes.Genes with different functions are depicted using different colors.(B) Schematic representation of the collinearity among five magnoliid mitogenomes.The blue and red dots represent the direct and inverted syntenic regions,respectively.(C)Intron content among 11 seed plant mitogenomes.Blue,intron is absent;Yellow,intron is cis-splicing;Red,intron is trans-splicing.(D)Characteristic of RNA editing sites across all PCGs in the mitogenome of C.chekiangense.PCGs with different functions are depicted using different colors.(E)The phylogenetic relationships of 24 plant species based on mitochondrial PCGs (left) and complete plastome sequences (right).Funaria hygrometrica and Marchantia paleacea were used as outgroup.Numbers on each branch are bootstrap support values.Colors indicate the groups for each species.

Frequent rearrangement is the major driver of plant mitogenome evolution.Using the nucmer program of MUMmer v.3.23(Kurtz et al.,2004),we compared the mitogenome ofC.chekiangensewith another four closely related mitogenomes,i.e.,L.tulipifera,M.biondii,Saururus chinensis,andS.sphenanthera.As clearly illustrated in Fig.1B,the five mitogenomes exhibit very poor collinearity,with numerous regions lacking homology between these mitogenomes.The highest collinearity (~37%) was found between the mitogenomes ofC.chekiangenseandL.tulipifera,while the collinearity between other species andC.chekiangensewas less than 25%(Table S1).The results of collinearity analysis showed that the mitogenome of Lauraceae may have experienced frequent genomic rearrangements during evolution,making it difficult to establish the ancestral mitogenomic structure.

Despite the substantial loss or transfer of mitochondrial proteincoding genes (PCGs) to the nucleus during the endosymbiosis of mitochondria,the plant mitogenomes still retain some unique PCGs (Zardoya,2020).These retained PCGs of land plant mitogenomes include 24 core genes(atp1,4,6,8,and9;ccmB,C,Fc,andFn;cob,cox1-3,nad1-7,9,and4L;mttB,andmatR) and 19 variable genes(sdh3,sdh4,rpl2,5,6,10,and16;rps1-4,7,8,10-14,and19).In this study,most of the PCGs were identified in theC.chekiangensemitogenome,and only one core PCG(matR)and two variable PCGs(rpl6andrps8)were lost during evolution(Fig.1A and Table S2).Almost all of the ancestral PCGs have also been found in other mitogenomes of magnoliids (Fig.S1),such asM.biondii,S.chinensisandL.tulipifera.Additionally,the basally-diverging groups of angiosperms (ANA: Amborellales,Nymphaeales and Austrobaileyales) also retained almost all mitochondrial PCGs.In contrast to the mitogenomes of magnoliids and ANA clade,which have retained almost all of the ancestral repertoire,some gymnosperms (e.g.,Welwitschia mirabilis),hornworts (e.g.,Phaeoceros laevisandAnthoceros angustus),and the hemiparasitic angiospermViscum scurruloideumhave dispensed with half or more of these PCGs (Mower,2020).

Most eukaryotic mitochondrial introns are of two types,group I and group II,which differ in their splicing mechanisms and secondary structure (Mower,2020).A total of 19cis-spliced and 6trans-spliced introns were identified in 13 PCGs(Fig.1C).All introns in theC.chekiangensemitogenome belong to group II.Using the naming scheme proposed by Dombrovska and Qiu (2004),each intron was named according to its position relative to the reference gene in theMarchantia polymorphamitogenome.Comparison ofcis- andtrans-splicing intron content in seed plant mitogenomes revealed a relatively conservative pattern of intron evolution.Most of the introns were shared among seed plants (Fig.1C),with the exception ofnad1i728,rps3i257,cox2i691,cox2i373,andrps10i235.Among these,rps3i257was completely lost during the divergence of angiosperms and gymnosperms,whereasnad1i728was retained in some angiosperms (e.g.,Arabidopsis thalianaandPopulus tremula).Additionally,nad7i676was lost from the mitogenome ofNicotiana tabacum,but was retained in other seed plants.

RNA editing is a common phenomenon in plant mitogenomic transcripts and may lead to massive diversity in posttranscriptional gene sequences.The number of RNA editing sites varies substantially across plant lineages.RNA editing is rare in mosses and liverworts(Rüdinger et al.,2009),but it is abundant in lycophytes (Zhang et al.,2020b).In gymnosperms,the number of RNA editing sites varies from as few as 99 sites inWelwitschia mirabilisto 1405 sites inGinkgo biloba(Fan et al.,2019).RNA editing frequency is generally lower in angiosperms,especially in monocots and eudicots.The number of RNA editing sites in the mitogenomes of angiosperms is between 400 and 500 (Bi et al.,2016).In this study,we utilized GATK (https://github.com/broadinstitute/gatk),Bcftools (Danecek et al.,2021) and REDItools (Picardi and Pesole,2013) to identify RNA editing sites.The thresholds to define an RNA editing site are QUAL >30,depth >100 × ,andPvalue >0.1.We identified a total of 1119 RNA editing sites in 41 PCGs of theC.chekiangensemitogenome based on RNA sequencing data(Fig.1D and Table S3),which is the highest number in angiosperms to date.The above results suggest that the decreasing number of RNA-editing sites may be caused by gene loss from liverworts,mosses,gymnosperms to angiosperms.Although we have manually checked all RNA editing sites using IGV (Thorvaldsdottir et al.,2012),PCR experiment and Sanger sequencing are also required to obtain a more accurate result.

The advent of high-throughput sequencing (HTS) has allowed plant systematists to address long-standing phylogenetic issues at different taxonomic levels.In plant phylogenomic studies,plastomes have been widely used to infer phylogenetic relationships at different taxonomic levels due to their easily assembled genomes(Twyford and Ness,2016; Yang et al.,2022a).In contrast,the mitogenomes have been largely neglected in plant phylogenies due to the difficulty of obtaining complete mitogenomes and generally low rates of nucleotide substitution (Sloan et al.,2009).Despite extensive studies on the early diversification of five major lineages in Mesangiospermae (Ceratophyllales,Chloranthales,eudicots,magnoliids,and monocots),their phylogenetic relationships remain elusive.In recent phylogenetic trees inferred from organellar genomes,monocots have been considered to be more closely related to eudicots than to magnoliids (Li et al.,2019,2021; Xue et al.,2022).However,the phylogenetic relationships inferred from nuclear genes are more chaotic and unstable among the five Mesangiospermae clades (Zhang et al.,2019; Guo et al.,2021; Ma et al.,2021),implying possible hybridization and incomplete lineage sorting in the early history of angiosperms.

To investigate the phylogenetic position ofC.chekiangenseand the magnoliids relative to the monocots and eudicots,our study utilized the whole plastome sequences and 23 conserved mitochondrial PCGs to reconstruct the phylogenetic maximum likelihood (ML) trees of 24 plant species,respectively.These conserved mitochondrial PCGs were extracted to perform the multiple sequence alignment in MAFFT v.7.407 (Katoh and Standley,2013).The aligned sequences were subsequently concatenated to construct the ML tree in IQ-TREE v.2.0.3 with 1000 bootstrap replicates (Minh et al.,2020).Both of the phylogenetic trees emphasized the magnoliids as a sister group to the clade comprising monocots and eudicots (Fig.1E),which is consistent with the APG IV botanical classification system(Angiosperm Phylogeny Group IV et al.,2016).Previous study has used complete set of mitochondrial genes from 18 angiosperms to elucidate the phylogenetic relationships among the five Mesangiospermae clades of angiosperms(Xue et al.,2022).The results provided valuable information and alternative hypotheses to investigate the early evolution of angiosperms.However,it remains unclear whether mitogenomes contribute to the phylogeny of the family Lauraceae or not,as no mitogenomes have been published in this family.With the improvement of HTS technology and the development of effective genome assembly methodology for complex plant mitogenomes,we will be able to further investigate the large-scale phylogenetic relationships based on mitogenome sequences.In due course,some of the complex phylogenetic issues may be resolved based on the genomic data from nuclear,plastid,and mitochondrial genomes.

As the first reported mitogenome in the Lauraceae family,this study provides a valuable reference for mitogenome analysis in Lauraceae.Simultaneously,it provides important insights into RNA editing,mitogenome evolution,and phylogeny in angiosperms.

Data availability

The mitochondrial and plastid genomes supporting this study are available at GenBank with accession numbers:NC_082065 and OR360835,respectively.The HiFi and RNA sequencing data ofC.chekiangenseare deposited in the SRA repository under SRR26158200 and SRR26157632,respectively.

Declaration of competing interest

The authors declare no conflicts of interest.

Author contributions

CB and YY planned and designed the research.CB,NS,and FH analyzed the data and prepared the figures.CB and KX provided the materials and conducted experiments.CB wrote the initial version of the manuscript.YYand DKF revised this and provided comments.All authors read and approved the manuscript.

Acknowledgments

The work is supported by the Natural Science Foundation of Jiangsu Province(BK20220414)and the Natural Science Foundation of the Higher Education Institutions of Jiangsu Province(22KJB220003).

Appendix A.Supplementary data

Supplementary data to this article can be found online at https://doi.org/10.1016/j.pld.2023.11.001.

91香蕉高清国产线观看免费-97夜夜澡人人爽人人喊a-99久久久无码国产精品9-国产亚洲日韩欧美综合