Logic-based remodeling of the Digital Anatomist Foundational Model. (pdf)

Article PDF cannot be displayed. You can download it here:

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1479995/pdf/

Logic-based remodeling of the Digital Anatomist Foundational Model.

Logic-based Remodeling of the Digital Anatomist Foundational Model Rainer Beck Stefan Schulz Freiburg University Hospital, Department of Medical Informatics (http://www.imbi.uni-freiburg.de/medinf) Abstract This paper describes a development cycle for the engineering of large knowledge bases: A graphical tool is used for editing and the content is transformed into a logic-based representation language. This representation is used to check the consistency of the knowledge base as well as to facilitate the reviewing process. Showing the usefulness of this approach, aspects of the Digital Anatomist Foundational Model will be transformed into a Description Logics representation. We introduce a special modeling technique to account for the representation of the complex part/whole relationships in the biomedical domain. INTRODUCTION As more and more facts are gathered in the domain of life sciences, so do biomedical concept systems grow. The emphasis in most of these systems is on the comprehensiveness of coverage; only a few formalize concept representation. The UMLS [15] is a specific example since it merges vocabularies which differ in terms of conceptualization, resulting in poorly defined semantics. This deficiency can be largely attributed to the different contexts of its source vocabularies. However, there now is an increased trend towards strict semantics, resulting in concept systems capable of supporting formal reasoning [8, 14]. Physiology, pathology, molecular biology and other subdomains are characterizable by the change they inflict on underlying physical structure. Therefore formal representation of the structural sciences (like anatomy, structural biology) is particularly important. So it is to be expected that an elaborated, common structural model would facilitate the conceptualization and representation of all biomedical domains. Moreover this constitutes a critical requirement for a principled representation of knowledge about diagnostic and therapeutic procedures. As a representation of anatomical structure, the Digital Anatomist Foundational Model (FM) [11] is outstanding in respect to coverage and formal strictness: It describes canonical anatomical knowledge using rather precise semantics. The design of the model is guided by the very tight formal Aristotelian principles of genus and differentiae [5] (most prominently imple- mented in the upper level ontology). Taxonomic and partonomic hierarchies are strictly separated. Because of its coverage (more than 67,000 concepts) and its design principles, the FM presents an ideal candidate to be represented in a logic-based language. Whereas the taxonomic structure of the FM may be already interpreted in terms of a formal ontology, the semantics of the relations part-of and has-part needs to be further refined. This is particularly desirable since various fundamental properties of biological entities (like contains, has-function) interact with part / whole relationships between concepts. Therefore, the expressive representation of partonomy is of paramount importance. As a consequence of the comprehensiveness of the FM, ameliorating the part/whole relationships proves to be a complex and challenging task: Small changes of the partonomy may escalate throughout the hierarchy and lead to arbitrarily complex results. In this paper, we propose a development cycle suitable for the engineering of large knowledge bases obeying our principles of “single point of edit” and “minimum manual intervention”. Furthermore, we apply this method to the Foundational Model and show how to transform its taxonomy and partonomy into a representation language with formal semantics based on description logics (DL) [1]. We use a distinctive modeling pattern to address the specific needs for representing part/whole relations. DEVELOPMENT CYCLE The engineering of large knowledge bases on code level can be a daunting if not impossible task. To support the knowledge engineer in the process, we propose the following cycle (cf. Figure 1): 1. Concept Creation or Change In order to effect all modifications in a precise manner, the knowledge engineer must be able to identify the context he or she is working in: Which concepts are defined? Which relations hold between them? This process can be facilitated if there exists a comprehensive graphical representation including search and navigation facilities sufficient for the task. However, significant advantage will be lost if parts of the editing are left to be made outside this AMIA 2003 Symposium Proceedings − Page 71 ! ! ! Figure 1: Development Cycle environment because of lack of support. The “single point of edit” provided by the environment should not be compromised. We consider the kind of environment described a necessity for working with large knowledge bases. 2. Consistency Check After changing or adding to the knowledge represented, consistency checks should be performed: Semantic integrity is easily lost in large knowledge bases, especially when role propagation 1 along several axes is involved. Therefore, we propose at least semi-automatical checks by an inference engine which can provide a status report. Thus, inconsistencies as well as terminological cycles will be identified and can be corrected by iterating the first step. It is also highly desirable to integrate these mechanisms into the editing environment for automatic consistency checks. 3. Adequacy Check For good reasons, implicit knowledge as well as complete role propagation are usually not fully accessible in a knowledge editing tool. As these inferences are prerequisites for checking the adequacy of the representation entered, they have to be made accessible to the knowledge engineer: Any inference engine used should provide a suitable display interface to allow the analysis of all concepts and the accuracy of their representation by the knowledge engineer. Should the representation be incomplete or inadequate, steps one and two might be repeated until the results are satisfactory. 4. Completeness Check The criteria of adequacy being satisfied, the check for completeness may take place in which the knowledge engineer compares the coverage in the representation with the domain to be described. The display and browsing tools required for checking adequacy may be used for this purpose. Eventually, the process results in a valid and comprehen1 Inheritance sive knowledge base representing the domain of interest. If the transition between the steps described is well automated, it will be possible to handle large and very large knowledge bases with “minimum manual intervention”. The danger of losing track of the modifications can be minimized and their respective effects on the representation can be controlled at the same time. The merging of all steps into an editing tool would be the ultimate solution. However, considering the complexity involved with reasoning, online processing and feedback seems rather unlike (...truncated)