Abstract:Retrieving the literatures on human genome sequence analysis from PubMed published from 200111 to 2011511, extracts bibliographic information and carries out co-word analysis, high frequency subject headings are extracted, word matrix, co-occurrence matrix, co-word clustering are formulated. It clarifies that data mining is a good way to reflect development status and research hotspots, so as to provide valuable information to researchers.