In order to improve the efficiency of retrieving literature on new medicine, a distributed new medicine literature retrieval system is developed based on Hadoop. This system contains two parts: full-text retrieval and chemical structural formula retrieval. The former is implemented based on the key technologies of Lucene and Hadoop. The latter uses Hbase to store structured SMILES and connection tables and matches the structural formula based on graph isomorphism algorithm VF2.