Abstract:Taking the Hospital Information System (HIS) as study basis, the paper analyzes the extraction methods and process of clinical research data from the perspective of data sources according to the classification of clinical medical texts. Natural Language Processing (NLP) and database retrieval techniques are used to extract and mine medical texts, which achieves good results and helps clinical research.