国家微生物科学数据中心——国家微生物资源大数据体系建设

范国梅 , 孙清岚 , 张荐辕 , 吴林寰 , 马俊才*
中国科学院微生物研究所,北京 100101

摘 要:

国家微生物科学数据中心成立于2019 年,以中国科学院微生物研究所作为依托单位。中心数据资源总量超过6 PB,数据记录数超过52 亿条,数据内容完整覆盖微生物资源、微生物及交叉技术方法、研究过程及工程、微生物组学、微生物技术以及微生物文献、专利、专家、成果等微生物研究的全生命周期。国家微生物科学数据中心通过建设一系列重点数据库构建系统全面的国家微生物大数据体系,涉及全球微生物菌种分类及研究领域、病原微生物研究领域、微生物组研究方面及真菌研究领域,为全球微生物学相关的工作者提供信息服务和交流平台。在新冠疫情期间,中心开发新型冠状病毒国家科技资源服务系统,第一时间建立了全球科学数据发布及共享平台。研发的新型冠状病毒变异评估和预警系统(New Coronavirus Variation Evaluation and Early Warning System, VarEPS),是全球首个对SARS-CoV-2 基因组已知变异及虚拟变异进行多维度风险评估和预警的系统。中心以世界微生物数据中心(World Data Center for Microorganisms, WDCM) 为平台,倡导全球微生物菌种保藏目录(Global Catalogue of Microorganisms, GCM),发起全球微生物模式菌株基因组和微生物组测序合作计划(Global Microbial Type Strain Genome and Microbiome Sequencing Project, GCM 2.0),有效促进了全球微生物资源的共享利用。

通讯作者:马俊才 , Email:ma@im.ac.cn

National Microbiology Science Data Center--construction of national big data system for microbial resources
FAN Guo-Mei , SUN Qing-Lan , ZHANG Jian-Yuan , WU Lin-Huan , MA Jun-Cai*
Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China

Abstract:

National Microbiology Science Data Center (NMDC) was established in 2019 by the Institute of
Microbiology, Chinese Academy of Sciences. The total amount of NMDC data resources exceeds 6 PB, and the number of data records exceeds 5.2 billion, covering the whole life cycle of microbial research, including microbial resources, microbial and cross-technology methods, research process and engineering, microbiomics, microbial technology, microbial literature, patents, experts, and achievements. NMDC builds a systematic and comprehensive national microbial big data system through the construction of a series of key databases, covering global microbial strains classification and research, pathogenic microorganisms, microbiomes, and fungi, and provides information services and communication platforms for microbiology-related researcher around the world. During the COVID-19 epidemic, NMDC developed Novel Coronavirus National Science and Technology Resource Service System, established a global scientific data release and sharing platform in the first time, and developed New Coronavirus Variation Evaluation and Early Warning System (VarEPS), which is the first system in the world to provide multidimensional risk assessment and early warning for known and virtual variants in the SARS-CoV-2 genome. Based on World Data Center for Microorganisms (WMDC) as a platform, NMDC advocated the Global Catalogue of Microorganisms (GCM) and Global Microbial Type Strain Genome and Microbiome Sequencing Project (GCM 2.0), which has effectively promoted the sharing and utilization of global microbial resources.

Communication Author:MA Jun-Cai , Email:ma@im.ac.cn

Back to top