基于Hadoop研发新药文献检索系统 |
投稿时间:2016-02-25 点此下载全文 |
引用本文:梁永浩,陆涛,赵鸿萍.基于Hadoop研发新药文献检索系统[J].医学信息学杂志,2016,37(5):73-78 |
摘要点击次数: |
全文下载次数: |
|
|
中文摘要:为提高新药文献检索的效率,研发基于Hadoop的分布式新药文献检索系统。系统包括全文检索和化学结构式检索两大部分,其中全文检索基于关键技术Lucene和Hadoop实现;化学结构式检索则使用Hbase存储结构式的SMILES码和连接表,基于图同构算法VF2对结构式进行匹配。 |
中文关键词:Hadoop Lucene 分布式检索 结构式检索 |
|
Developing New Medicine Literature Retrieval System Based on Hadoop |
|
|
Abstract:In order to improve the efficiency of retrieving literature on new medicine, a distributed new medicine literature retrieval system is developed based on Hadoop. This system contains two parts: full-text retrieval and chemical structural formula retrieval. The former is implemented based on the key technologies of Lucene and Hadoop. The latter uses Hbase to store structured SMILES and connection tables and matches the structural formula based on graph isomorphism algorithm VF2. |
keywords:Hadoop Lucene Distributed retrieval Structure retrieval |
查看全文 查看/发表评论 下载PDF阅读器 |