由于人的重名现象,人名检索的结果往往是同名的不同人物实体相关网页的混合。重名消解是根据上下文采区分同名的不同人物实体的过程。本文提出了基于相关社区的重名消解方法,采用改进的Espresso算法进行相关社区发现。将每个网页发现的社区应用到两阶段重名消解算法中,并且在WePS-2测试集上进行试验。实验结果表明了该方法的有效性。
Person's names are so ambiguous that the results of searching for a person's name are usually a mixture of pages about namesakes. Person's name disambiguation is a course of distinguishing different person's entities with the same name. The method of person's name disambiguation based on the relevant community was proposed and the modi- fied Espresso algorithm was used to find relevant community for each Web page. The enlarged name sets were applied in the two-stage person's name disambiguation algorithm, and then the algorithm was tested it on the WePS-2 test data- set. The experimental results show the effectiveness of our method.