A novel paradigm for cell and molecule interaction ontology: from the CMM model to IMGT-ONTOLOGY
Pappalardo et al. Immunome Research 2010, 6:1
http://www.immunome-research.com/content/6/1/1
IMMUNOME RESEARCH
RESEARCH
Open Access
A novel paradigm for cell and molecule
interaction ontology: from the CMM model to
IMGT-ONTOLOGY
Francesco Pappalardo1,2*, Marie-Paule Lefranc3, Pier-Luigi Lollini4, Santo Motta2
Abstract
Background: Biology is moving fast toward the virtuous circle of other disciplines: from data to quantitative
modeling and back to data. Models are usually developed by mathematicians, physicists, and computer scientists
to translate qualitative or semi-quantitative biological knowledge into a quantitative approach. To eliminate
semantic confusion between biology and other disciplines, it is necessary to have a list of the most important and
frequently used concepts coherently defined.
Results: We propose a novel paradigm for generating new concepts for an ontology, starting from model rather
than developing a database. We apply that approach to generate concepts for cell and molecule interaction
starting from an agent based model. This effort provides a solid infrastructure that is useful to overcome the
semantic ambiguities that arise between biologists and mathematicians, physicists, and computer scientists, when
they interact in a multidisciplinary field.
Conclusions: This effort represents the first attempt at linking molecule ontology with cell ontology, in IMGTONTOLOGY, the well established ontology in immunogenetics and immunoinformatics, and a paradigm for life
science biology. With the increasing use of models in biology and medicine, the need to link different levels, from
molecules to cells to tissues and organs, is increasingly important.
Introduction
Biology is a knowledge-based discipline. Many predictions and interpretations of biological data are made by
comparing the data against existing knowledge. Traditionally, the knowledge base in biology has resided
within the heads of experienced scientists who have
devoted much study and became experts in their particular domain. This approach worked well in the past,
when considerable effort was needed to tease a few new
data out of biological experiments. However, this situation is changing rapidly, and biology is moving fast
toward the virtuous circle of other disciplines: from data
to quantitative modeling and back to data. Models are
usually developed by mathematicians, physicists, and
computer scientists to translate qualitative or semiquantitative biological knowledge into a quantitative
approach [1].
* Correspondence:
1
Institute for Computing Applications ‘M. Picone’, National Research Council
(CNR), Rome, Italy
To eliminate semantic confusion between biology and
other disciplines, it is necessary to have a list of the
most important and frequently used concepts coherently
defined so that involved people could use such a set of
definitions to create new models and software, to provide an exact, semantic specification of the concepts
used in an existing schema and to curate and annotate
existing database entries consistently. We notice here
that it is important to understand that semantic ambiguities also can arise between human experts. However, in
the course of a conversation usually enough background
knowledge and context is available so that semantic
ambiguities are most often faster resolved than even
consciously recognized. This is possible because of our
intelligent capabilities which computers, programs and
databases, at least for the near future, fall yet short of.
An ontology describes basic concepts in a domain and
defines relations among them. Basic building blocks of
ontology design include concepts and their instances;
properties of each concept describing various features
© 2010 Pappalardo et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative
Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and
reproduction in any medium, provided the original work is properly cited.
Pappalardo et al. Immunome Research 2010, 6:1
http://www.immunome-research.com/content/6/1/1
and attributes of the concept (slots, sometimes called
roles or properties); restrictions on slots (facets, sometimes called role restrictions). An ontology provides a
common vocabulary for researchers who need to share
information in the domain and allows to build knowledge databases. Ontologies are widely used in biology
and medicine and several important ontology systems
have been established. They contribute to a precise and
exhaustive way to access bio-information and define
concepts in a precise and rigorous way [2-9]. Interestingly, despite or because of the complexity of the
immune response, IMGT-ONTOLOGY, the first ontology for immunogenetics and immunoinformatics, is also
conceptually one of the more advanced biological ontologies [2-6], on which has been built IMGT®, the international ImMunoGeneTics information system® http://
www.imgt.org[10].
Other important efforts are underway to link models
at different scales by means of markup languages (i.e.
XML). CellML project is one of this http://www.cellml.
org. The CellML language is an open standard based on
the XML markup language. CellML is being developed
by the Auckland Bioengineering Institute at the University of Auckland and affiliated research groups. The
purpose of CellML is to store and exchange computerbased mathematical models. CellML allows scientists to
share models even if they are using different modeling
tools. It also enables them to reuse components from
one model in another, thus accelerating model
development.
Whereas usually ontologies led to knowledge databases, in what follows, we adopted another approach in
which concepts for an ontology of cell and molecule
interaction were generated starting from an agent based
model (ABM), the Catania Mouse Model (CMM for
short) and its computer implementation, the SimTriplex
simulator [11,12]. SimTriplex simulates the immune system response elicited by the Triplex vaccine [13,14]
against mammary carcinoma. This effort provides a
solid infrastructure that is useful to overcome the
semantic ambiguities that arise between biologists and
mathematicians, physicists, and computer scientists,
when they interact in such a multidisciplinary field.
The development of ontologies for molecular and cellular biology information, and the sharing of those
ontologies within the bioinformatics community, are
central problems in bioinformatics. If the bioinformatics
community is to share ontologies effectively, ontologies
must be exchanged in a form that uses standardized
syntax and semantics. For this reason, while the initial
motivation of our study was to present an ontology for
the CMM, the paradigm we show here has wider applications as it bridges the molecule ontology with cell
ontology (Figure 1). This is achieved by defining, at the
Page 2 of 11
same time, interactions (...truncated)