Abstract:The paper analyzes the core idea and basic steps of k-means, learns from the existing methods of determining initial text cluster centers which is based on frequent word sets, proposes a practical institutions cluster method meeting the need of large-scale institutions processing applications. It concretely elaborates the generation of institution clustering center, the selection of similarity algorithm and iterative times, the experimental results and its application perform well.