DODDLE II: A Domain Ontology Development Environment Using a MRD and Text Corpus

Masaki KUREMATSU  Takamasa IWADE  Naomi NAKAYA  Takahira YAMAGUCHI  

IEICE TRANSACTIONS on Information and Systems   Vol.E87-D   No.4   pp.908-916
Publication Date: 2004/04/01
Online ISSN: 
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Section on Knowledge-Based Software Engineering)
Category: Knowledge Engineering and Robotics
ontology,  concept-hierarchy,  concept-relationship,  MRD,  text corpus,  co-occurrence,  association rule,  

Full Text: PDF>>
Buy this Article

In this paper, we describe how to exploit a machine-readable dictionary (MRD) and domain-specific text corpus in supporting the construction of domain ontologies that specify taxonomic and non-taxonomic relationships among given domain concepts. In building taxonomic relationships (hierarchical structure) of domain concepts, some hierarchical structure can be extracted from a MRD with marked subtrees that may be modified by a domain expert, using matching result analysis and trimmed result analysis. In building non-taxonomic relationships (specification templates) of domain concepts, we construct concept specification templates that come from pairs of concepts extracted from text corpus, using WordSpace and an association rule algorithm. A domain expert modifies taxonomic and non-taxonomic relationships later. Through case studies with "the Contracts for the International Sales of Goods (CISG)" and "XML Common Business Library (xCBL)", we make sure that our system can work to support the process of constructing domain ontologies with a MRD and text corpus.