中国寄生虫学与寄生虫病杂志, 2020, 38(5): 642-646 doi: 10.12140/j.issn.1000-7423.2020.05.018

综述

高通量组学技术在尘螨研究中的应用进展

刘娜娜,1, 崔玉宝2, 叶林华1, 黄路圣,1,*

1 蚌埠医学院附属泰兴市人民医院,泰州 225400

2 南京医科大学附属无锡市人民医院检验科,无锡 214023

Applications of high-throughput omics technology in research on dust mite

LIU Na-na,1, CUI Yu-bao2, YE Lin-hua1, HUANG Lu-sheng,1,*

1 Taixing People’s Hospital Affiliated to Bengbu Medical College, Taizhou 225400, China

2 Department of Clinical Laboratory, Wuxi People’s Hospital Affiliated to Nanjing Medical University, Wuxi 214023, China

通讯作者: * 黄路圣,E-mail:huanglusheng@sina.cn

责任编辑: 陈勤

收稿日期: 2020-01-9   网络出版日期: 2020-09-10

基金资助: 国家自然科学基金(NSFC81971511); 国家自然科学基金(NSFC31572319); 江苏省第五期“333工程”科研项目(BRA2017216); 江苏省重点研发计划(BE2018627); 无锡市卫计委重大课题(Z201701)

Corresponding authors: * E-mail:huanglusheng@sina.cn

Received: 2020-01-9   Online: 2020-09-10

Fund supported: National Natural Science Foundation of China(NSFC81971511); National Natural Science Foundation of China(NSFC31572319); 333 Project of Jiangsu Province in 2017(BRA2017216); Primary Research & Development Plan of Jiangsu Province(BE2018627); Major Program of Wuxi Health and Family Planning Commission(Z201701)

作者简介 About authors

刘娜娜(1991-),硕士研究生,医师,从事尘螨与过敏性疾病研究。E-mail:876636493@qq.com

摘要

尘螨是全球最常见的吸入性变应原之一。近20年来,尘螨过敏的患病率呈上升趋势。随着后基因时代的到来,生命科学研究进入到系统、全面、动态的探索阶段。高通量组学采用测序技术从基因、蛋白质等层面对样品进行全面系统的分析和数据挖掘,解释生物学过程的内在机制。本文对尘螨基因组、转录组、蛋白质组、微生物组等4个方面的进展进行综述,为进一步深入研究尘螨过敏原与过敏性疾病、开发新的防治策略提供思路。

关键词: 尘螨; 过敏原组学; 基因组; 转录组; 蛋白质组; 微生物组

Abstract

House dust mites are one of the most common inhaled allergens worldwide. Since the last 20 years, the prevalence of dust mite allergy has shown a trend of increase. With the advent of the post-genome age, research in life sciences has entered a new stage of systematic, comprehensive and dynamic exploration. High-throughput omics technology has been applied to comprehensively analyze samples at the gene and/or protein levels by sequencing for data mining, and interpreting intrinsic mechanisms of biological processes. This paper reviews the advance in application of high-throughput omics in the genomics, transcriptomics, proteomics and microbiology of dust mites, providing ideas forfurther research on dust mite allergens and allergic diseases and for the development of new control strategies.

Keywords: House dust mite; Allergenomics; Genome; Transcriptom; Proteom; Microbiome

PDF (6536KB) 元数据 多维度评价 相关文章 导出 EndNote| Ris| Bibtex  收藏本文

本文引用格式

刘娜娜, 崔玉宝, 叶林华, 黄路圣. 高通量组学技术在尘螨研究中的应用进展[J]. 中国寄生虫学与寄生虫病杂志, 2020, 38(5): 642-646 doi:10.12140/j.issn.1000-7423.2020.05.018

LIU Na-na, CUI Yu-bao, YE Lin-hua, HUANG Lu-sheng. Applications of high-throughput omics technology in research on dust mite[J]. Chinese Journal of Parasitology and Parasitic Diseases, 2020, 38(5): 642-646 doi:10.12140/j.issn.1000-7423.2020.05.018

尘螨呈世界性分布,属节肢动物门(Arthropoda)珠形纲(Arachnida)蜱螨亚纲(Acari)疥螨目(Sarcoptiformes)羽螨总科(Analgoidea)无气门股(Astigmatina)麦食螨科(Pyroglyphidae)的一类微小动物,体型微小。狭义上的尘螨系指麦食螨科螨类。常见螨种为屋尘螨(Dermatophagoides pteronyssinus)、粉尘螨(D. farina)及梅氏嗜霉螨(Euroglyphus maynei)。尘螨的分泌物、尸体分解物及其排泄物等都可以引发能诱发Ⅰ型超敏反应如荨麻疹、过敏性鼻炎、哮喘、皮炎等[1,2]。用13种常见的空气过敏原对我国4个地区17个城市的6 304例哮喘和鼻炎患者进行皮肤点刺试验结果显示,粉尘螨阳性率为59.0%, 屋尘螨阳性率为57.6%,热带无爪螨(Blomia tropicalis)阳性率为40.7%[3]。在儿童中,尘螨引起的过敏性哮喘占哮喘患者总数的80%;在成年人中,约50%的哮喘患者对尘螨过敏[4]。尘螨过敏原按其发现的先后顺序依次分为第1组分、第2组分……截止2019年8月,WHO和国际免疫学会联合会(IUIS)授权的过敏原命名委员会网站(http://www.allergen.org/.已公布了麦食螨过敏原39个组分,其中第1、2组分(Der p 1、Der f 1和Der p 2、Der f 2)是主要组分。随着大数据时代的到来,利用高通量技术研究过敏原成为可能。本文通过对尘螨基因组、转录组、蛋白质组与微生物组的研究进展进行综述,在各种螨同源搜索的基础上,鉴定出尘螨过敏原新的组分,为进一步开发组分分辨诊断试剂、免疫治疗新型制剂等奠定基础。

1 高通量组学在尘螨过敏原组学研究中的应用

Yagami等[5]通过应用蛋白质组学技术对乳胶中的潜在过敏原进行识别验证,证实了运用蛋白质组学的方法研究过敏原是一种快速全面分析潜在过敏原的有效方法,并首次将过敏原和蛋白质两个单词进行整合,产生了过敏原组学的概念。过敏原组学通过双向凝胶电泳(two-dimensional gel electrophoresis,2-DE)分离过敏原粗提浸液,然后用患者血清进行免疫印迹,鉴定出IgE结合蛋白点,再经质谱分析鉴定,数据库比对,一次可以鉴定出多个过敏原组分。

过敏原组学具有通量高、覆盖面广和快速灵敏等特点,成为快速、有效、全面分析IgE相互作用蛋白的关键技术。过敏原组学首先从致敏物质中提取蛋白质,然后进行2-DE分离。利用固定化的pH梯度凝胶,根据蛋白的等电点,首先用等电聚焦分离蛋白。根据蛋白质的相对分子质量(Mr),通过SDS-PAGE将其分离。这种双向扩散的蛋白质随后被转移到膜上,并与患有过敏性疾病患者的血清发生反应。然后,特异性地与IgE抗体相互作用的抗原被检测出来。参照免疫印迹的结果,免疫球蛋白相互作用的斑点可以从染色或荧光染色的2-DE中提取出来。

在蛋白质组学分析中,蛋白质的鉴定是通过质谱(mass spectrometric,MS)方法完成的。用胰蛋白酶消化蛋白质并进行MS鉴定主要有两种方法:一是肽质量指纹图谱,二是肽序列分析。肽质量指纹图谱通常采用基质辅助激光解吸-飞行时间质谱仪分析,而肽序列分析采用串联质谱(MS/MS mass spectrometry/mass spectrometry)。蛋白质组学研究中蛋白质分离的标准方法为2-DE。

2 高通量组学技术在尘螨基因组学研究中的应用

2.1 全基因组

基因组测序通常使用PacBio Sequel、Illumina HiSeq 2000及Thermo Ion Torrent等测序技术。基于Illumina数据的Phusion或SOAP de novo组装及基于PaciBio数据的主基因组装等技术在尘螨研究取得较多进展。Chan等[6]对粉尘螨基因组进行高通量测序和从头组装,获得的粉尘螨基因组大小为53.5 Mb,包含516个核基因支架和14.3 kb的线粒体基因组。用核心真核基因定位方法(core eukaryotic genes mapping approach,CEGMA)估算,该基因组的完整性为97.58%[7]。Randall等[8]对屋尘螨进行PacBio和Illumina全基因组测序,重新组装获得的屋尘螨基因组大小为52.5 Mb,包含834个重叠群,N50长376 kb。依据N50测量结果,用CEGMA评估该基因组的完整性为97.7%(2 686/2 748)[9]。该基因组还确认了19个已经公布的过敏原序列,获得12个与粉尘螨过敏原同源序列。通过与粉尘螨全基因组比较,发现12条未报道的过敏原基因序列,即第16、22和24~34组分。比较基因组学分析显示,大多数过敏原基因在真螨目各螨种中的同源性较高。Rider等[10]通过基因大小评估工具Kmer Genie对梅氏嗜霉螨基因组大小进行估算,获得梅氏嗜霉螨的基因组大小为59 Mb,N50为480 nt。该基因组可预测15 000个蛋白。Waldron等[11]通过dipSPAdes 1.0重新组装获得的屋尘螨基因组,大小约为70.76 Mb,包括322个支架,N50长450 436 bp, L50为33个支架,GC含量为30.93%。通过从头开始基因预测法定位了39种已知的螨过敏原的全长基因序列,包括螨类过敏原组分1~11、13~16、18和20~33[6,12]。Liu等[13]通过重新组装获得屋尘螨的基因组,大小为66.8 Mb,包含1 390个重叠群、634个支架,N50长194 kb,N50长80 kb。而用第三代测序组装工具Canu对屋尘螨基因组进行组装获得的屋尘螨基因组大小为68.0~72.5 Mb[14]。因此,Liu等[13]组装的屋尘螨基因组最多占Canu方法组装屋尘螨基因组的98.2%。使用普通通用的单拷贝直系同源测试对Liu等[13]组装的屋尘螨基因组的完整性进行评估,结果表明,该基因组包含了90.4%单拷贝同源基因,其中949个单拷贝同源基因是完整的。该基因组还解析了Der p 1~36 组分。因此,基因组学研究将使研究人员能够更好地对螨虫暴露的总体情况进行分类,有助于更好地理解可耐受蛋白和致敏蛋白。各种螨的基因组数据结果见表1

表1   各种螨的基因组数据

种名拉丁文基因组大小/Mb支架N50支架预测蛋白数基因组NCBI登录号参考文献
粉尘螨Dermatophagoides farina53.5187 kb51516 376GCA_000767015.1[6]
屋尘螨D. pteronyssinus52.5376 kb83419 368-[8]
梅氏嗜霉螨Euroglyphus maynei59480 nt-15 000-[10]
屋尘螨D. pteronyssinus70.76450 kb332-GCA_001901225.2[11]
屋尘螨D. pteronyssinus66.880 kb634-GCA_003076615.1[13]

注:- 无数据

新窗口打开| 下载CSV


2.2 转录组

转录组主要通过RNA测序和从头组装进行分析。转录组测序数据使用Trinity进行拼接,拼接后的转录本使用GeneMark-ES和GlimmerHMM对注释进行细化。Cui等[15]以腐食酪螨(Tyrophagus putrescentiae)转录组预测的蛋白为数据库,从蛋白质组学数据中获得所有匹配的肽段,然后对肽段和蛋白进行鉴定,发现了3种新过敏原,分别为Tyrp28、Tyr p 35、Tyr p 36(WHO/IUIS官网命名)。Bordas-Le等[16]使用454测序技术和Illumina测序技术对粉尘螨和屋尘螨mRNA进行测序、从头组装,分别得到161 429和156 174个重叠群;通过对屋尘螨mRNA和粉尘螨mRNA进行编码序列预测,分别获得143 369和37 585条编码序列;对获得的编码序列进行翻译、注释后分别得到79 890和21 696个基因;通过该粉尘螨和屋尘螨转录组数据预测获得2种新的过敏原,由IUIS命名为Der f 36和Der p 36。Zhou等[17]通过对热带无爪螨进行RNA测序、从头组装,获得转录组的总长度为62 406 675 bp,包含68 658个转录本。根据转录组分析中的功能注释预测、RT-PCR验证发现3个水通道蛋白亚群,分别为水通道蛋白、水甘油通道蛋白和超水通道蛋白。转录组学方法如RNA测序可以揭示信使RNA分子的全部补体,而RNA测序数据的重新组装可以使基因研究不需要基因组序列[18]。转录组学方法的使用为推进对尘螨基因组的深入分析提供了可能。

3 高通量组学技术在尘螨蛋白质组研究中的应用

国内也有运用蛋白质组学技术研究寄生虫的文献[19,20]。目前蛋白质组学技术在尘螨研究中应用,主要通过SDS-PAGE、免疫印迹、液相质谱法(liquid chromatography,LC)、MS、2-DE等技术来实现。An等[21]采用2-DE和免疫印迹相结合的方法,从粉尘螨提取物中分离出17种过敏原,分属12个不同的组,其中8个是首次报道的粉尘满过敏原,分别是Der f 20、Der f 8、Der f 25~30。其中,Der f 25、Der f 28、Der f 29和Der f 30与尘螨过敏患者的血清IgE结合率分别为75.6%(31/41)、68.3%(28/41)、85.6%(35/41)和63.4%(26/41)。Choopong等[22]通过SDS-PAGE、LC-MS/MS并结合数据库检索,从粉尘螨体提取物鉴定出105个蛋白,分属7个功能不同的组,即过敏原、结构成分、酶、酶抑制剂、受体蛋白、转运蛋白和结合或者调节细胞信号蛋白;而通过2DE-免疫印迹检测出63个能与粉尘螨过敏患者血清结合的蛋白,其中3个与IgE结合率> 50%[Der f 10(75%)、乌头酸水合酶(70%)和未知蛋白(55%)],被认为是过敏原主要组分。Bordas-Le等[16]通过LC-MS/MS从粉尘螨和屋尘螨的虫体中分别鉴定出326和201个蛋白质,从其粪便提取物中分别鉴定出80和77个蛋白质;根据序列相似性一次性鉴定出13个未报道过屋尘螨过敏原,且屋尘螨和粉尘螨均含有Mr为23 000的IgE结合蛋白(由官方命名为Der f 36和Der p 36)。蛋白质组学研究技术对尘螨所致疾病的早期诊断、预后判断和疾病的进展监测至关重要。

4 高通量组学技术在尘螨与微生物组关系研究中的应用

Chan等[6]通过对粉尘螨基因组高通量测序、与微生物数据库的比对,分离出100多个生物物种的112 000微生物基因序列。71 000条基因序列来自肠杆菌属,主要是阴沟肠杆菌(Enterobacter cloacae)和肠杆菌(E. hormaechei),共占63.4%;其次为葡萄球菌(Staphylococcus)和埃希菌(Escherichia ),分别占17.8%和4.9%;最少为巴尔通体属(Bartonella),占1.7%。Chan等[6]还通过免疫组化实验证明肠杆菌在粉尘螨肠道中大量存在。

16S RNA测序是研究尘螨微生物菌落的重要手段。Kim等[23]采用PCR扩增粉尘螨定殖菌16S rRNA的V3~V4区域,利用Illumina MiSeq平台进行下一代测序(next-generation sequencing,NGS),测序后通过3种不同的生物学方法对微生物组数据进行分析,发现粉尘螨中的微生物主要是肠球菌(Enterococcus)和巴尔通体属。已确定粉尘螨和屋尘螨中都存在胞内共生菌cardinium[24,25]。cardinium是一类近几年才被关注和发现的共生菌,属于拟杆菌门的一个独立分支,同沃尔巴克氏体一样可以通过细胞质遗传。关于cardinium共生的基因组分析表明,cardinium与宿主之间存在营养上的相互作用,如生物素的生物合成[26,27]。Hubert等[28]发现cardinium在粉尘螨微生物组中有大量的序列,但在韩国的粉尘螨种群中无cardinium的报道[23]。在最近的一项研究中,粉尘螨全基因组鸟枪测序显示,粉尘螨微生物中最丰富的菌群是肠杆菌属[6]。而Valerio等[29]研究发现巴尔通体属是粉尘螨中最丰富的微生物。Erban等[30]通过对腐食酪螨的16S rRNA的V1~V3区域进行扩增、测序、qPCR定量,发现腐食酪螨微生物菌落包括沃尔巴克氏体、cardinium、solitalea类细菌和巴尔通氏类细菌(Blattabacterium-like)。通过对尘螨微生物组菌群的研究,知道尘螨的核心菌群,为研究微生物在尘螨引起疾病中的作用奠定基础。。

5 结语

尘螨在世界上广泛分布,对人类的生活和健康造成严重的影响。目前,对尘螨过敏原及其所致的过敏性疾病的研究越来越多,一些新的过敏原及其亚型正在等待被发现。基于对尘螨基因组、转录组、蛋白质组和微生物组研究,不仅有助于发现新的过敏原,还丰富了已知尘螨过敏原的亚型及其组分的鉴定,为尘螨所致疾病的机制研究提供帮助,为解决尘螨过敏性疾病的早期诊断、免疫治疗和和预防提供资料。

伦理批准和患者知情同意 本文不涉及伦理批准和患者知情同意。

出版授权 作者同意以纸质版和网络版的形式同时出版。

数据和材料的可及性 本文中的所有参考文献如有需要,可与刘娜娜联系。

利益冲突 作者声明无利益冲突。

作者贡献 刘娜娜负责文献收集和文章撰写,崔玉宝、黄路圣、叶林华负责文章修改。

参考文献

Banerjee S, Resch Y, Chen KW, et al.

Der p 11 is a major allergen for house dust mite-allergic patients suffering from atopic dermatitis

[J]. J Invest Dermatol, 2015,135(1):102-109.

DOI:10.1038/jid.2014.271      URL     PMID:24999597      [本文引用: 1]

House dust mites (HDMs) belong to the most potent indoor allergen sources worldwide and are associated with allergic manifestations in the respiratory tract and the skin. Here we studied the importance of the high-molecular-weight group 11 allergen from Dermatophagoides pteronyssinus (Der p 11) in HDM allergy. Sequence analysis showed that Der p 11 has high homology to paramyosins from mites, ticks, and other invertebrates. A synthetic gene coding for Der p 11 was expressed in Escherichia coli and rDer p 11 purified to homogeneity as folded, alpha-helical protein as determined by circular dichroism spectroscopy. Using antibodies raised against rDer p 11 and immunogold electron microscopy, the allergen was localized in the muscle beneath the skin of mite bodies but not in feces. IgE reactivity of rDer p 11 was tested with sera from HDM-allergic patients from Europe and Africa in radioallergosorbent test-based dot-blot assays. Interestingly, we found that Der p 11 is a major allergen for patients suffering from atopic dermatitis (AD), whereas it is only a minor allergen for patients suffering from respiratory forms of HDM allergy. Thus, rDer p 11 might be a useful serological marker allergen for the identification of a subgroup of HDM-allergic patients suffering from HDM-associated AD.

He XM, Shao C, Wei QY.

Progress in sensitized protein components and subcutaneous specific immunotherapy for dust mites

[J]. Int J Pediatrics, 2019(3):198-202. (in Chinese)

[本文引用: 1]

( 何雪梅, 邵婵, 魏庆宇.

尘螨致敏蛋白组份及其皮下特异性免疫治疗的研究进展

[J]. 国际儿科学杂志, 2019,46(3):198-202.)

[本文引用: 1]

Li J, Sun B, Huang Y, et al.

A multicentre study assessing the prevalence of sensitizations in patients with asthma and/or rhinitis in China

[J]. Allergy, 2009,64(7):1083-1092.

DOI:10.1111/j.1398-9995.2009.01967.x      URL     PMID:19210346      [本文引用: 1]

Hui Y, Li L, Qian J, et al.

Efficacy analysis of three-year subcutaneous SQ-standardized specific immunotherapy in house dust mite-allergic children with asthma

[J]. Exp Ther Med, 2014,7(3):630-634.

URL     [本文引用: 1]

Yagami T, Haishima Y, Tsuchiya T, et al.

Proteomic analysis of putative latex allergens

[J]. Int Arch Allergy Immunol, 2004,135(1):3-11.

DOI:10.1159/000080036      URL     PMID:15286439      [本文引用: 1]

BACKGROUND: Extensive analysis of allergenic proteins is generally time-consuming and labor-intensive. Accordingly, a rapid and easy procedure for allergen identification is required. As sequence information on proteins and genes is accumulated in databases, it is becoming easier to identify a candidate protein using proteomic strategies, i.e. two-dimensional gel electrophoresis, site-specific fragmentation, mass spectrometry and then database search. In this study, we evaluated the usefulness of a proteomic strategy for identifying putative allergens through its application to latex proteins. METHODS: Latex proteins were separated with two-dimensional gel electrophoresis, and putative allergens were visualized by IgE immunoblotting using pooled serum from latex-sensitive patients. The IgE-interactive proteins were cut out from the negatively stained two-dimensional gel and subjected to in-gel digestion by trypsin. Then the resulting peptides were analyzed with mass spectrometry. Based on the mass spectrometric data we obtained, the allergen candidates were assigned by a database search. RESULTS: Five previously reported allergens and five new allergen candidates were identified with the proteomic approach without isolating the individual proteins. Less than 1 mg of crude latex protein was sufficient for the entire protocol. Because plural proteins can be processed in parallel, analysis of about 50 IgE-interactive proteins was accomplished within 1 week. CONCLUSIONS: Analysis of putative allergens with proteomic strategies (allergenomics) is a promising avenue for rapid and exhaustive research. The high resolving power of two-dimensional gel electrophoresis is superior to conventional gel electrophoresis. Moreover, the notable sensitivity and speed of mass spectrometry have pronounced advantages over the N-terminal sequencing that has generally been used for protein identification.

Chan TF, Ji KM, Yim AK, et al.

The draft genome, transcriptome, and microbiome of Dermatophagoides farinae reveal a broad spectrum of dust mite allergens

[J]. J Allergy Clin Immunol, 2015,135(2):539-548.

DOI:10.1016/j.jaci.2014.09.031      URL     PMID:25445830      [本文引用: 6]

BACKGROUND: A sequenced house dust mite (HDM) genome would advance our understanding of HDM allergens, a common cause of human allergies. OBJECTIVE: We sought to produce an annotated Dermatophagoides farinae draft genome and develop a combined genomic-transcriptomic-proteomic approach for elucidation of HDM allergens. METHODS: A D farinae draft genome and transcriptome were assembled with high-throughput sequencing, accommodating microbiome sequences. The allergen gene structures were validated by means of Sanger sequencing. The mite's microbiome composition was determined, and the predominant genus was validated immunohistochemically. The allergenicity of a ubiquinol-cytochrome c reductase binding protein homologue was evaluated with immunoblotting, immunosorbent assays, and skin prick tests. RESULTS: The full gene structures of 20 canonical allergens and 7 noncanonical allergen homologues were produced. A novel major allergen, ubiquinol-cytochrome c reductase binding protein-like protein, was found and designated Der f 24. All 40 sera samples from patients with mite allergy had IgE antibodies against rDer f 24. Of 10 patients tested, 5 had positive skin reactions. The predominant bacterial genus among 100 identified species was Enterobacter (63.4%). An intron was found in the 13.8-kDa D farinae bacteriolytic enzyme gene, indicating that it is of HDM origin. The Kyoto Encyclopedia of Genes and Genomes pathway analysis revealed a phototransduction pathway in D farinae, as well as thiamine and amino acid synthesis pathways, which is suggestive of an endosymbiotic relationship between D farinae and its microbiome. CONCLUSION: An HDM genome draft produced from genomic, transcriptomic, and proteomic experiments revealed allergen genes and a diverse endosymbiotic microbiome, providing a tool for further identification and characterization of HDM allergens and development of diagnostics and immunotherapeutic vaccines.

Parra G, Bradnam K, Korf I.

CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes

[J]. Bioinformatics, 2007,23(9):1061-1067.

DOI:10.1093/bioinformatics/btm071      URL     PMID:17332020      [本文引用: 1]

MOTIVATION: The numbers of finished and ongoing genome projects are increasing at a rapid rate, and providing the catalog of genes for these new genomes is a key challenge. Obtaining a set of well-characterized genes is a basic requirement in the initial steps of any genome annotation process. An accurate set of genes is needed in order to learn about species-specific properties, to train gene-finding programs, and to validate automatic predictions. Unfortunately, many new genome projects lack comprehensive experimental data to derive a reliable initial set of genes. RESULTS: In this study, we report a computational method, CEGMA (Core Eukaryotic Genes Mapping Approach), for building a highly reliable set of gene annotations in the absence of experimental data. We define a set of conserved protein families that occur in a wide range of eukaryotes, and present a mapping procedure that accurately identifies their exon-intron structures in a novel genomic sequence. CEGMA includes the use of profile-hidden Markov models to ensure the reliability of the gene structures. Our procedure allows one to build an initial set of reliable gene annotations in potentially any eukaryotic genome, even those in draft stages. AVAILABILITY: Software and data sets are available online at http://korflab.ucdavis.edu/Datasets.

Randall TA, Mullikin JC, Mueller GA.

The draft genome assembly of Dermatophagoides pteronyssinus supports identification of novel allergen isoforms in Dermatophagoides species

[J]. Int Arch Allergy Immunol, 2018,175(3):136-146.

DOI:10.1159/000481989      URL     PMID:29320781      [本文引用: 2]

Miller JR, Koren S, Sutton G.

Assembly algorithms for next-generation sequencing data

[J]. Genomics, 2010,95(6):315-327.

DOI:10.1016/j.ygeno.2010.03.001      URL     PMID:20211242      [本文引用: 1]

The emergence of next-generation sequencing platforms led to resurgence of research in whole-genome shotgun assembly algorithms and software. DNA sequencing data from the Roche 454, Illumina/Solexa, and ABI SOLiD platforms typically present shorter read lengths, higher coverage, and different error profiles compared with Sanger sequencing data. Since 2005, several assembly software packages have been created or revised specifically for de novo assembly of next-generation sequencing data. This review summarizes and compares the published descriptions of packages named SSAKE, SHARCGS, VCAKE, Newbler, Celera Assembler, Euler, Velvet, ABySS, AllPaths, and SOAPdenovo. More generally, it compares the two standard methods known as the de Bruijn graph approach and the overlap/layout/consensus approach to assembly.

Rider SD Jr, Morgan MS, Arlian LG.

Allergen homologs in the Euroglyphus maynei draft genome

[J]. PLoS One, 2017,12(8):e0183535.

DOI:10.1371/journal.pone.0183535      URL     PMID:28829832      [本文引用: 2]

Euroglyphus maynei is a house dust mite commonly found in homes worldwide and is the source of allergens that sensitize and induce allergic reactions in humans. It is the source of species-specific allergens as well as allergens that are cross-reactive with the allergens from house dust mites Dermatophagoides farinae and D. pteronyssinus, and the ectoparasitic scabies mite Sarcoptes scabiei. The genomics, proteomics and molecular biology of E. maynei and its allergens have not been as extensively investigated as those of D. farinae, D. pteronyssinus, and S. scabiei where natural and recombinant allergens from these species have been characterized. Until now, little was known about the genome of E. maynei and it allergens but this information will be important for producing recombinant allergens for diagnostic and therapeutic purposes and for understanding the allergic response mechanism by immune effector cells that mediate the allergic reaction. We sequenced and assembled the 59 Mb E. maynei genome to aid the identification of homologs for known allergenic proteins. The predicted proteome shared orthologs with D. farinae and S. scabiei, and included proteins with homology to more than 30 different groups of allergens. However, the majority of allergen candidates could not be assigned as clear orthologs to known mite allergens. The genomic sequence data, predicted proteome, and allergen homologs identified from E. maynei provide insight into the relationships among astigmatid mites and their allergens, which should allow for the development of improved diagnostics and immunotherapy.

Waldron R, McGowan J, Gordon N, et al.

Draft genome sequence of Dermatophagoides pteronyssinus, the European house dust mite

[J]. Genome Announc, 2017,5(32):e00789-e00717.

DOI:10.1128/genomeA.00789-17      URL     PMID:28798186      [本文引用: 2]

Dermatophagoides pteronyssinus is the European dust mite and a major source of human allergens. Here, we present the first draft genome sequence of the mite, as well as the ab initio gene prediction and functional analyses that will facilitate comparative genomic analyses with other mite species.

Rider SD Jr, Morgan MS, Arlian LG.

Draft genome of the Scabies mite

[J]. Parasit Vectors, 2015,8:585.

DOI:10.1186/s13071-015-1198-2      URL     PMID:26555130      [本文引用: 1]

BACKGROUND: The disease scabies, caused by the ectoparasitic mite, Sarcoptes scabiei, causes significant morbidity in humans and other mammals worldwide. However, there is limited data available regarding the molecular basis of host specificity and host-parasite interactions. Therefore, we sought to produce a draft genome for S. scabiei and use this to identify molecular markers that will be useful for phylogenetic population studies and to identify candidate protein-coding genes that are critical to the unique biology of the parasite. METHODS: S. scabiei var. canis DNA was isolated from living mites and sequenced to ultra-deep coverage using paired-end technology. Sequence reads were assembled into gapped contigs using de Bruijn graph based algorithms. The assembled genome was examined for repetitive elements and gene annotation was performed using ab initio, and homology-based methods. RESULTS: The draft genome assembly was about 56.2 Mb and included a mitochondrial genome contig. The predicted proteome contained 10,644 proteins, ~67 % of which appear to have clear orthologs in other species. The genome also contained more than 140,000 simple sequence repeat loci that may be useful for population-level studies. The mitochondrial genome contained 13 protein coding loci and 20 transfer RNAs. Hundreds of candidate salivary gland protein genes were identified by comparing the scabies mite predicted proteome with sialoproteins and transcripts identified in ticks and other hematophagous arthropods. These include serpins, ferritins, reprolysins, apyrases and new members of the macrophage migration inhibitory factor (MIF) gene family. Numerous other genes coding for salivary proteins, metabolic enzymes, structural proteins, proteins that are potentially immune modulating, and vaccine candidates were identified. The genes encoding cysteine and serine protease paralogs as well as mu-type glutathione S-transferases are represented by gene clusters. S. scabiei possessed homologs for most of the 33 dust mite allergens. CONCLUSION: The draft genome is useful for advancing our understanding of the host-parasite interaction, the biology of the mite and its phylogenetic relationship to other Acari. The identification of antigen-producing genes, candidate immune modulating proteins and pathways, and genes responsible for acaricide resistance offers opportunities for developing new methods for diagnosing, treating and preventing this disease.

Liu XY, Yang KY, Wang MQ, et al.

High-quality assembly of Dermatophagoides pteronyssinus genome and transcriptome reveals a wide range of novel allergens

[J]. J Allergy Clin Immunol, 2018,141(6):2268-2271.

DOI:10.1016/j.jaci.2017.11.038      URL     PMID:29305317      [本文引用: 4]

Koren S, Walenz BP, Berlin K, et al.

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation

[J]. Genome Res, 2017,27(5):722-736.

DOI:10.1101/gr.215087.116      URL     PMID:28298431      [本文引用: 1]

Long-read single-molecule sequencing has revolutionized de novo genome assembly and enabled the automated reconstruction of reference-quality genomes. However, given the relatively high error rates of such technologies, efficient and accurate assembly of large repeats and closely related haplotypes remains challenging. We address these issues with Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences. Canu introduces support for nanopore sequencing, halves depth-of-coverage requirements, and improves assembly continuity while simultaneously reducing runtime by an order of magnitude on large genomes versus Celera Assembler 8.2. These advances result from new overlapping and assembly algorithms, including an adaptive overlapping strategy based on tf-idf weighted MinHash and a sparse assembly graph construction that avoids collapsing diverged repeats and haplotypes. We demonstrate that Canu can reliably assemble complete microbial genomes and near-complete eukaryotic chromosomes using either Pacific Biosciences (PacBio) or Oxford Nanopore technologies and achieves a contig NG50 of >21 Mbp on both human and Drosophila melanogaster PacBio data sets. For assembly structures that cannot be linearly represented, Canu provides graph-based assembly outputs in graphical fragment assembly (GFA) format for analysis or integration with complementary phasing and scaffolding techniques. The combination of such highly resolved assembly graphs with long-range scaffolding information promises the complete and automated assembly of complex genomes.

Cui Y, Yu L, Teng F, et al.

Transcriptomic/proteomic identification of allergens in the mite Tyrophagus putrescentiae

[J]. Allergy, 2016,71(11):1635-1639.

DOI:10.1111/all.12999      URL     PMID:27496383      [本文引用: 1]

While a number of allergens from house dust mites have been described, much remains to be discovered about allergens from storage mites. Here, next-generation sequencing was combined with MS/MS shotgun proteomics to identify proteins, especially potential allergens from Tyrophagus putrescentiae, commonly found in stored food products, especially flour. cDNAs of suspected allergens were cloned and expressed from bacterial cells, and recombinant allergens were tested for binding to IgE in sera from T. putrescentiae-sensitive patients. These analyses identified three previously uncharacterized allergens, Tyr p 28, Tyr p 35, and Tyr p 36, which have been officially assigned by the WHO/IUIS Allergen Nomenclature Sub-committee. Recombinant proteins rTyr p 28, rTyr p 35, and rTyr p 36 bound with 47.1%, 82.4%, and 70.6% of T. putrescentiae-sensitive sera. We provide here a new method to identify allergens by the combination of transcriptomic and proteomic approaches.

Bordas-Le Floch V, Le Mignon M, Bussières L, et al.

A combined transcriptome and proteome analysis extends the allergome of house dust mite Dermatophagoides species

[J]. PLoS One, 2017,12(10):e0185830.

DOI:10.1371/journal.pone.0185830      URL     PMID:28982170      [本文引用: 2]

BACKGROUND: House dust mites (HDMs) such as Dermatophagoides farinae and D. pteronyssinus represent major causes of perennial allergy. HDM proteomes are currently poorly characterized, with information mostly restricted to allergens. As of today, 33 distinct allergen groups have been identified for these 2 mite species, with groups 1 and 2 established as major allergens. Given the multiplicity of IgE-reactive mite proteins, potential additional allergens have likely been overlooked. OBJECTIVE: To perform a comprehensive characterization of the transcriptomes, proteomes and allergomes of D. farinae and D. pteronyssinus in order to identify novel allergens. METHODS: Transcriptomes were analyzed by RNA sequencing and de novo assembly. Comprehensive mass spectrometry-based analyses proteomes were combined with two-dimensional IgE reactivity profiling. RESULTS: Transcripts from D. farinae and D. pteronyssinus were assembled, translated into protein sequences and used to populate derived sequence databases in order to inform immunoproteomic analyses. A total of 527 and 157 proteins were identified by bottom-up MS analyses in aqueous extracts from purified HDM bodies and fecal pellets, respectively. Based on high sequence similarities (>71% identity), we also identified 2 partial and 11 complete putative sequences of currently undisclosed D. pteronyssinus counterparts of D. farinae registered allergens. Immunoprofiling on 2D-gels revealed the presence of unknown 23 kDa IgE reactive proteins in both species. Following expression of non-glycosylated recombinant forms of these molecules, we confirm that these new allergens react with serum IgEs from 42% (8/19) of HDM-allergic individuals. CONCLUSIONS: Using combined transcriptome and immunoproteome approaches, we provide a comprehensive characterization of D. farinae and D. pteronyssinus allergomes. We expanded the known allergen repertoire for D. pteronyssinus and identified two novel HDM allergens, now officially referred by the International Union of Immunological Societies (IUIS) Nomenclature Subcommittee as Der f 36 and Der p 36.

Zhou Y, Li L, Qian J, et al.

Identification of three aquaporin subgroups from Blomia tropicalis by transcriptomics

[J]. Int J Mol Med, 2018,42(6):3551-3561.

DOI:10.3892/ijmm.2018.3877      URL     PMID:30221673      [本文引用: 1]

Aquaporins (AQPs), or water channel proteins, are highly conserved across species. These transmembrane proteins promote water and solute transport across cell membranes. No AQPrelated proteins have been identified in mites to date. The present study used transcriptomics (RNAsequencing) to identify potential AQPs in the mite species Blomia tropicalis. Molecular cloning techniques were then used to obtain the fulllength gene sequences encoding these AQP family members, and bioinformatics analyses were used to categorize them based on similarity to AQPs in other species. This approach led to the identification of 5 putative AQPcoding sequences, known as BlotAQP15 (GenBank accession numbers: KX655540, KX655541, KX655542, KX655543 and KX655544, respectively), which were indexed into all three subgroups, i.e., AQPs, aquaglyceroporins and superAQPs. To the best of our knowledge, these represent the first known AQPs in any mite species. Further studies are required to investigate their functional roles in water transport and their potential as drug targets.

Haas BJ, Papanicolaou A, Yassour M, et al.

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

[J]. Nat Protoc, 2013,8(8):1494-1512.

DOI:10.1038/nprot.2013.084      URL     PMID:23845962      [本文引用: 1]

De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.

Zhao QP, Jiang MS.

Proteomics and its application in parasitology

[J]. Chin J Parasitol Parasit Dis, 2006,24(2):136-139. (in Chinese)

[本文引用: 1]

( 赵琴平, 蒋明森.

蛋白质组学及其在寄生虫学研究中的应用

[J]. 中国寄生虫学与寄生虫病杂志, 2006,24(2):136-139.)

[本文引用: 1]

He DG.

Proteomics and its application in parasitology

[J]. Chin Trop Med, 2003,3(4):507-512. (in Chinese)

[本文引用: 1]

( 何东苟.

蛋白质组学研究及其在寄生虫学上的应用

[J]. 中国热带医学, 2003,3(4):507-512.)

[本文引用: 1]

An S, Chen LL, Long CB, et al.

Dermatophagoides farinae allergens diversity identification by proteomics

[J]. Mol Cell Proteomics, 2013,12(7):1818-1828.

DOI:10.1074/mcp.M112.027136      URL     PMID:23481662      [本文引用: 1]

The most important indoor allergens for humans are house dust mites (HDM). Fourteen Dermatophagoides farinae allergens (Der f 1-3, 6, 7, 10, 11, 13-18, and 22) are reported although more than 30 allergens have been estimated in D. farinae. Seventeen allergens belonging to 12 different groups were identified by a procedure of proteomics combined with two-dimensional immunoblotting from D. farina extracts. Their sequences were determined by Edman degradation, mass spectrometry analysis, and cDNA cloning. Their allergenicities were assayed by enzyme-linked immunosorbent assay inhibition tests, immunoblots, basophil activation test, and skin prick tests. Eight of them are the first report as D. farinae allergens. The procedure of using a proteomic approach combined with a purely discovery approach using sera of patients with broad IgE reactivity profiles to mite allergens was an effective method to investigate a more complete repertoire of D. farinae allergens. The identification of eight new D. farinae allergens will be helpful for HDM allergy diagnosis and therapy, especially for patients without response for HDM major allergens. In addition, the current work significantly extendedthe repertoire of D. farinae allergens.

Choopong J, Reamtong O, Sookrung N, et al.

Proteome, allergenome, and novel allergens of house dust mite, Dermatophagoides farinae

[J]. J Proteome Res, 2016,15(2):422-430.

DOI:10.1021/acs.jproteome.5b00663      URL     PMID:26754146      [本文引用: 1]

Dermatophagoides farinae mite is a predominant source of indoor allergens causing high incidence of allergy worldwide. People with different genetic background respond differently to the mite components, and thus the component-resolved diagnosis (CRD) is preferred to the conventional allergy test based on crude mite extract. In this study, proteome and culprit components in the D. farinae whole body extract that sensitized the allergic patients were studied by using SDS-PAGE (1DE) and 2DE-IgE immunoblotting followed by LC-MS/MS and database search for protein identification. From the 1DE, the mite extract revealed 105 proteins that could be classified into seven functionally different groups: allergens, structural components, enzymes, enzyme inhibitor, receptor proteins, transporters, and binding/regulatory/cell signaling proteins. From the 2DE, the mite extract produced 94 spots; 63 were bound by IgE in sera of 20 D. farinae allergic patients. One more protein that was not revealed by the 2DE and protein staining reacted with IgE in 2 allergic patients. Proteins in 40 spots could be identified as 35 different types. Three of them reacted to IgE of >50% of the allergic patients, and hence they are major allergens: tropomyosin or Der f 10 (75%), aconitate hydratase (70%), and one uncharacterized protein (55%). Aconitate hydratase is a novel D. farinae major allergen unraveled in this study. Several mite minor allergens that have never been previously reported are also identified. The data have clinical applications in the component-resolved diagnosis for tailor-designed allergen-specific immunotherapy.

Kim JY, Yi MH, Hwang Y, et al.

16S rRNA profiling of the Dermatophagoides farinae core microbiome: Enterococcus and Bartonella

[J]. Clin Exp Allergy, 2018,48(5):607-610.

DOI:10.1111/cea.13104      URL     PMID:29381238      [本文引用: 2]

Hubert J, Kopecky J, Perotti MA, et al.

Detection and identification of species-specific bacteria associated with synanthropic mites

[J]. Microb Ecol, 2012,63(4):919-928.

DOI:10.1007/s00248-011-9969-6      URL     PMID:22057398      [本文引用: 1]

Internal bacterial communities of synanthropic mites Acarus siro, Dermatophagoides farinae, Lepidoglyphus destructor, and Tyrophagus putrescentiae (Acari: Astigmata) were analyzed by culturing and culture-independent approaches from specimens obtained from laboratory colonies. Homogenates of surface-sterilized mites were used for cultivation on non-selective agar and DNA extraction. Isolated bacteria were identified by sequencing of the 16S rRNA gene. PCR amplified 16S rRNA genes were analyzed by terminal restriction fragment length polymorphism analysis (T-RFLP) and cloning sequencing. Fluorescence in situ hybridization using universal bacterial probes was used for direct bacterial localization. T-RFLP analysis of 16S rRNA gene revealed distinct species-specific bacterial communities. The results were further confirmed by cloning and sequencing (284 clones). L. destructor and D. farinae showed more diverse communities then A. siro and T. putrescentiae. In the cultivated part of the community, the mean CFUs from four mite species ranged from 5.2 x 10(2) to 1.4 x 10(3) per mite. D. farinae had significantly higher CFUs than the other species. Bacteria were located in the digestive and reproductive tract, parenchymatical tissue, and in bacteriocytes. Among the clones, Bartonella-like bacteria occurring in A. siro and T. putresecentiae represented a distinct group related to Bartonellaceae and to Bartonella-like symbionts of ants. The clones of high similarity to Xenorhabdus cabanillasii were found in L. destructor and D. farinae, and one clone related to Photorhabdus temperata in A. siro. Members of Sphingobacteriales cloned from D. farinae and A. siro clustered with the sequences of

Kopecky J, Perotti MA, Nesvorna M, et al.

Cardinium endosymbionts are widespread in synanthropic mite species (Acari: Astigmata)

[J]. J Invertebr Pathol, 2013,112(1):20-23.

DOI:10.1016/j.jip.2012.11.001      URL     PMID:23147105      [本文引用: 1]

'Candidatus Cardinium' is an intracellular endosymbiont or parasite frequently occurring in invertebrates including mites and ticks. In this work we report Cardinium bacteria in Astigmata mites and explore their incidence in synanthropic species. Amplification of a 776 bp bacterial 16S rRNA gene fragment, using specific primers, enabled identification of closely related Cardinium sequences in 13 laboratory-reared populations of mites. In addition, Cardinium sequences were identified in three wild mite populations. Large scale screening of these populations showed 100% prevalence of Cardinium, representing the highest incidence compared to other major Chelicerate groups.

Santos-Garcia D, Rollat-Farnier PA, Beitia F, et al.

The genome of Cardinium cBtQ1 provides insights into genome reduction, symbiont motility, and its settlement in Bemisiatabaci

[J]. Genome Biol Evol, 2014,6(4):1013-1030.

DOI:10.1093/gbe/evu077      URL     PMID:24723729      [本文引用: 1]

Many insects harbor inherited bacterial endosymbionts. Although some of them are not strictly essential and are considered facultative, they can be a key to host survival under specific environmental conditions, such as parasitoid attacks, climate changes, or insecticide pressures. The whitefly Bemisia tabaci is at the top of the list of organisms inflicting agricultural damage and outbreaks, and changes in its distribution may be associated to global warming. In this work, we have sequenced and analyzed the genome of Cardinium cBtQ1, a facultative bacterial endosymbiont of B. tabaci and propose that it belongs to a new taxonomic family, which also includes Candidatus Amoebophilus asiaticus and Cardinium cEper1, endosymbionts of amoeba and wasps, respectively. Reconstruction of their last common ancestors' gene contents revealed an initial massive gene loss from the free-living ancestor. This was followed in Cardinium by smaller losses, associated with settlement in arthropods. Some of these losses, affecting cofactor and amino acid biosynthetic encoding genes, took place in Cardinium cBtQ1 after its divergence from the Cardinium cEper1 lineage and were related to its settlement in the whitefly and its endosymbionts. Furthermore, the Cardinium cBtQ1 genome displays a large proportion of transposable elements, which have recently inactivated genes and produced chromosomal rearrangements. The genome also contains a chromosomal duplication and a multicopy plasmid, which harbors several genes putatively associated with gliding motility, as well as two other genes encoding proteins with potential insecticidal activity. As gene amplification is very rare in endosymbionts, an important function of these genes cannot be ruled out.

Penz T, Schmitz-Esser S, Kelly SE, et al.

Comparative genomics suggests an independent origin of cytoplasmic incompatibility in cardinium hertigii

[J]. PLoS Genet, 2012,8(10):e1003012.

DOI:10.1371/journal.pgen.1003012      URL     PMID:23133394      [本文引用: 1]

Terrestrial arthropods are commonly infected with maternally inherited bacterial symbionts that cause cytoplasmic incompatibility (CI). In CI, the outcome of crosses between symbiont-infected males and uninfected females is reproductive failure, increasing the relative fitness of infected females and leading to spread of the symbiont in the host population. CI symbionts have profound impacts on host genetic structure and ecology and may lead to speciation and the rapid evolution of sex determination systems. Cardinium hertigii, a member of the Bacteroidetes and symbiont of the parasitic wasp Encarsia pergandiella, is the only known bacterium other than the Alphaproteobacteria Wolbachia to cause CI. Here we report the genome sequence of Cardinium hertigii cEper1. Comparison with the genomes of CI-inducing Wolbachia pipientis strains wMel, wRi, and wPip provides a unique opportunity to pinpoint shared proteins mediating host cell interaction, including some candidate proteins for CI that have not previously been investigated. The genome of Cardinium lacks all major biosynthetic pathways but harbors a complete biotin biosynthesis pathway, suggesting a potential role for Cardinium in host nutrition. Cardinium lacks known protein secretion systems but encodes a putative phage-derived secretion system distantly related to the antifeeding prophage of the entomopathogen Serratia entomophila. Lastly, while Cardinium and Wolbachia genomes show only a functional overlap of proteins, they show no evidence of laterally transferred elements that would suggest common ancestry of CI in both lineages. Instead, comparative genomics suggests an independent evolution of CI in Cardinium and Wolbachia and provides a novel context for understanding the mechanistic basis of CI.

Hubert J, Nesvorna M, Kopecky J, et al.

Population and culture age influence the microbiome profiles of house dust mites

[J]. Microb Ecol, 2019,77(4):1048-1066.

URL     PMID:30465068      [本文引用: 1]

Valerio CR, Murray P, Arlian LG, et al.

Bacterial 16S ribosomal DNA in house dust mite cultures

[J]. J Allergy Clin Immunol, 2005,116(6):1296-1300.

DOI:10.1016/j.jaci.2005.09.046      URL     PMID:16337462      [本文引用: 1]

Erban T, Ledvinka O, Nesvorna M, et al.

Experimental manipulation shows a greater influence of population than dietary perturbation on the microbiome of Tyrophagus putrescentiae

[J]. Appl Environ Microbiol, 2017,83(9):e00128-e00117.

URL     PMID:28235879      [本文引用: 1]

/