[SEEK-Taxon] Thoughts on GUIDs

Nico M. Franz franz at nceas.ucsb.edu
Thu May 27 13:27:14 PDT 2004


Hi Rich, et al.:

    I find this discussion quite productive! The more I think about the 
concepts "definitions" and concept relations, the less I'm convinced it's 
something *we* ought to invest too much emotion in. Meaning: we need to 
look at practice and get a grasp of the RANGE of notions of stability and 
change perceived by taxonomists within and between taxonomic treatments. 
Then try to produce a system that can support (most of, at least) that 
range. Then let taxonomists play with it.

    Some will define "sameness" in terms of "subsumed constituents" (I 
believe that's an ostensive definition; "pointing at" something). Others 
may consider things like diagnoses of (evolving) properties as a more 
useful criterion for sameness (intentional, though the actual extension is 
subject to argument). Then there's already nomenclatural synonymy 
(referring to the identity of types) which does some but not a lot of 
cognitive work. Each way to look at it comes with strengths and weaknesses. 
Humans usually pick and choose what's best for their own understanding, 
which may be different from what a machine can understand. But a machine 
CAN spit back "congruent (=) in terms on properties [acc. to specialists 
X], yet overlapping (><) in terms of constituents [list of things now 
excluded yet then included; and vice-versa]".

    When computers handle this kind of information to make calls about 
concept synonymy, they may perform more reliably with ostensive 
definitions, especially at lower taxonomic levels. To capture what humans 
sense about the stability of meanings of names, particularly for 
higher-level taxa, properties connected to a small sample of exemplars 
(i.e. a sophisticated way to do typification) will probably take over.

    I believe I'm not just making this up, but rather, this seems to be 
what the German moss list authors - my personal and unfortunately very 
anonymous concept champs - ended up doing. They started with the 
constituent approach, and then abandoned it each time it "felt" wrong based 
on what they knew about properties. They never went beyond genus, I ASSUME 
because things started getting very awkward (not for computers, though!).

    So you're right, Bill, there are multiple ways to read an "=" sign. I 
think we need to make those alternatives explicit and available side by 
side, rather than put all our own money on just one of them. Apparently, 
that's not what practitioners (should) do.

Let's keep this going!

Cheers,

Nico



At 09:27 AM 5/27/2004 -1000, Richard Pyle wrote:
> > guid() is the function that, given a concept returns its guid. Whether
> > it comes from the concept instance document or the algorithm that
> > assigns it ought to be immaterial.
> >
> > S here was meant to be the relation "isTheSameConceptAs". So in natural
> > language, that expression would be the requirement that
> >
> >    c1 is the same concept as c2 if, and only if, their guids are identical
>
>I don't mean to sound like Bill Clinton in trying to define the word
>"is"....but in this case, I think it's important to define what we really
>mean by "same concept as".  From how you describe it, I see "same concept
>as" in the very strictest sense.  For example, if I have a dataset that
>reports Species X from Locality Y, that dataset would need to assign
>"Species X" to a particular concept GUID (of which there may be many for
>"Species X"), and then that concept GUID is matched indentically to some
>universal metadataset for the GUID, as well as to other datasets that
>reference the same GUID.
>
>This is a very different "same concept as" function of "S" than would be
>another core component of the SEEK model, which is to make the statement:
>
>"Concept GUID 1234 is congruent with concept GUID 4567"
>
>They maintain their separate GUID values because they are *potentially*
>different (that is, that congruency is an interpretation, rather than an
>objective statement).
>
> >   If c3 is a synonym of c2 and c2 is a synonym of c1, then c3 is a
> > synonym of c1
>
>I take it that your use of "is a synonym of" is different from "is the same
>concept as" (i.e., that c1, c2, and c3 would necessarily have different
>GUIDs)?
>
>Aloha,
>Rich
>_______________________________________________
>seek-taxon mailing list
>seek-taxon at ecoinformatics.org
>http://www.ecoinformatics.org/mailman/listinfo/seek-taxon




More information about the Seek-taxon mailing list