A Knowledge Representation Based User-Driven Ontology Summarization Method

Yuehang DING  Hongtao YU  Jianpeng ZHANG  Huanruo LI  Yunjie GU  

IEICE TRANSACTIONS on Information and Systems   Vol.E102-D   No.9   pp.1870-1873
Publication Date: 2019/09/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2019EDL8069
Type of Manuscript: LETTER
Category: Data Engineering, Web Information Systems
ontology,  ontology summarization,  semantic distance,  K-means++,  

Full Text: PDF(375.3KB)>>
Buy this Article

As the superstructure of knowledge graph, ontology has been widely applied in knowledge engineering. However, it becomes increasingly difficult to be practiced and comprehended due to the growing data size and complexity of schemas. Hence, ontology summarization surfaced to enhance the comprehension and application of ontology. Existing summarization methods mainly focus on ontology's topology without taking semantic information into consideration, while human understand information based on semantics. Thus, we proposed a novel algorithm to integrate semantic information and topological information, which enables ontology to be more understandable. In our work, semantic and topological information are represented by concept vectors, a set of high-dimensional vectors. Distances between concept vectors represent concepts' similarity and we selected important concepts following these two criteria: 1) the distances from important concepts to normal concepts should be as short as possible, which indicates that important concepts could summarize normal concepts well; 2) the distances from an important concept to the others should be as long as possible which ensures that important concepts are not similar to each other. K-means++ is adopted to select important concepts. Lastly, we performed extensive evaluations to compare our algorithm with existing ones. The evaluations prove that our approach performs better than the others in most of the cases.