[seek-kr-sms] Re: Thoughts on GUIDs - ontologies
Nico M. Franz
franz at nceas.ucsb.edu
Wed Jun 2 09:56:50 PDT 2004
Hi Shawn:
that's a nice little paper for an outsider to look at. I'm writing this
mainly to give the "ontologists" in SEEK a bit of an idea about biological
systematics. It's tempting to think about how taxonomic concepts ought to
look like so that computers can understand a lot about them.
Though I must say I started stumbling already on page 1, when the
authors started talking about more, or less rigid essences.
I suppose you can look at most comprehensive and internally consistent
biological classifications as ontologies. In the Linnean system, these tend
to be hierarchical (ranks), and higher taxa subsume lower ones. A genus
includes (necessarily) at least one species. The hierarchy is based on
perceived similarities and differences in traits. These can range from DNA
base pairs to bird songs. So far so good.
Systematists could observe all kinds of similarities, e.g. whether an
ant was collected by Shawn Bowers on the roof on the SDSC (check: yes), or
not (check: no). What they strive to observe specifically to figure out the
natural (evolutionary, phylogenetic) relationships among organisms, is
called special similarity, or homology. Homology is similarity due to
common ancestry. It's usually (i.e. in systematics) applied to traits fixed
among species and at higher levels, not among populations whose trait
frequencies still vary due to continuous interbreeding. Example of a good
homology: the wings of bats and birds are homologous AS tetrapod (these are
land vertebrates) fore arms.
Now, it seems to me that the essence/rigidity notions are (not yet?)
sophisticated enough to describe what systematists do. First of all,
homologies are inherently non-rigid. Nearly all traits vary somewhat among
individuals pertaining to a species. They'll quite often vary considerably
among species pertaining to a genus that is nevertheless said to have a
particular trait. This is a feature of traits like wings or feathers being
brought forth by thousands of DNA bases and being controlled by multiple
genes, all of which are potentially subject to modification. So the
"essential non-rigidity" (pun intended) goes all the way down. We have lots
of reasons to believe it's real. At a larger time scale, we can talk of
birds being dinosaurs and the birds' feathers being homologous to the
dinosaurs' reptile-like scales. Meaning that there's a historical,
evolutionary identity that one might reasonably propose, in spite of the
fact that there has been so much transformation that a lot of the DNA
responsible for producing bird feathers no longer looks like that which
once brought forth dinosaur scales. So the "is homologous to" can be a
fairly sophisticated way of saying "is a". And scientists can have
alternative, almost equally plausible solutions to this, backed up by
varying amounts of relevant yet conflicting observations.
Which leads me to the second point. Even though the properties of a
taxon, in the evolutionary (homologous) sense, are in some way necessary,
they're not necessarily obvious at the moment that taxon is first named and
classified. That can be the case regardless of whether one gets the
classification right or wrong. So one could correctly recognize and name
what ultimately turns out to be a valid genus of ants, yet get most of the
features that make it a natural evolutionary entity wrong. Or just not
mention them in a 1758 Latin publication. In modern days, many systematists
probably consider the naming/classifying business as a final, almost
trivial step. The discussions really revolve around what's the right/best
kind of evidence that a taxon is natural or not. For example: flies have a
set of highly modified hind wings (called halteres) and so do the fore
wings (!) of an enigmatic lineage of parasitic insects (Strepsiptera in
Italian). Whether these are homologous or not depends on whether flies and
these parasites are one or two independent lineages. Lots of genes are
being sequenced to figure this out, and the morphology is being reanalyzed
too. There's still no convincing solution. People also try to see whether
developmentally it's possible to have a mutation that produces halteres on
another body segment (fore to hind wings). Etc. Once it's all said and
done, those parasites may truly have "halteres" (not just things that look
like them), and also be classified in a new way. So essences can be
necessary (must be there always) and contingent (we won't see them until
very late) at the same time.
I wrote this partly because so far I've had this hunch that CS/AI
ontologists (what's a proper name? OntoClean sounds like a mouthwash, I'm
sure it'll be synonymized soon) draw most of their examples from
classification where humans STIPULATE properties. In systematics, *we*
slowly try to work them out. Essences in taxonomy are ontologically soft
(due to evolution), and in any case hard to figure out. Yet they're the
backbone for building the hierarchies and naming taxa. I don't think that's
impossible to incorporate in ontologies, though *I* haven't seen it yet.
Has it been done? One way might be to represent properties
probabilistically. "Is a, with a chance of 80%, according to study X of
person Y." Maybe this makes no sense (yet)...
BTW, various of the other topics dealt with in the paper come up in
systematics too. And philosophers of science have wondered for 50 years
whether species are "classes or individuals". The really seem to be both.
Cheers,
Nico
At 12:38 PM 5/25/2004 -0700, Shawn Bowers wrote:
>Beach, James H wrote:
snip
>>If the key itself has information then you will inevitably run into a
>>situation where the key will need to be changed because something about
>>the information represented by the key value has changed or is in doubt
>>or is a matter of interpretation, (thus losing the temporal uniqueness of
>>the GUID).
>
>Again, then the information used as the key isn't really "identifying"
>information, and you have a problem anyway.
>
>There is a very interesting article that people may want to read
>concerning properties of things and classification, including identity and
>unity, that may be relevant to what taxon is trying to accomplish with
>concepts.
>
>The paper can be found here, and was published in the Communications of
>the ACM in 2002. There are longer, more detailed versions available, but
>this is a good primer.
>
>http://www.loa-cnr.it/Papers/CACM2002.pdf
>
>snip 2
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mercury.nceas.ucsb.edu/ecoinformatics/pipermail/seek-kr-sms/attachments/20040602/0608e8fb/attachment.htm
More information about the Seek-kr-sms
mailing list