[seek-kr-sms] OBOE clarifications and questions

Joshua Madin madin at nceas.ucsb.edu
Wed Jun 14 15:59:33 PDT 2006


Hi Ferdinando and all,

I've inserted a few comments below --

On Jun 14, 2006, at 9:14 AM, Ferdinando Villa wrote:

> Hi Matt,
>
> a few quick additions to Josh response to clarify your points 4)  
> and 5). I'm
> in a rush so I hope this is readable:
>
> The contextualization relationship (observation -|>> observation)  
> relates a
> "main" observation to others that are not related to the semantics  
> of the
> main observable, but are essential to define the semantics of how  
> it was
> observed. So in your relation R1 we annotate four observations in  
> which
> Biomass is contextualized by Subplot, Nitrogen and Phosporus.  
> Subplot, in
> turn, is contextualized by Plot. Each observation is annotated as a  
> separate
> instance of oboe:Observation, linked through the contextualization
> relationship.
>
I apologize.  I annotated Matt's data sets incorrectly, and I've  
attached alternative versions to this email (Matt_examples_2.pdf).   
The first annotation is what I had yesterday, the first alternative  
is how I think it should be, and the second alternative is what I  
think Ferdinando is suggesting above.  I consider the addition of  
fertilizer a "characteristic" of the subplot (first alt), not an  
observation.  So, plot provides context for subplot, the subplot has  
various characteristics (e.g., amount of added nitrogen), and subplot  
provides context for biomass (of some biological entity).

> The different contexts make R1, R2 an R3 classify as different  
> classes, so
> they would not combine.
>
> This is obviously at a very abstract level, which is intentional.  
> To get to
> the numbers, each observation must define a "value space" as you  
> called it.
> This is not in the current OBOE although I put it in a revised  
> version that
> Josh sent around (as "Observation space"), not yet vetted by the  
> group. The
> value space is the set of all possible values generated by an  
> observation of
> a certain class. So for things like plots/subplots, the value space  
> of the
> space observation that contextualizes biomass is the enumerated set of
> subplots (seen as individuals of an appropriate ontology), whose  
> semantics
> (external to OBOE) will specify labels, polygons, and may require  
> or allow
> further contextualization in higher-level subdivision (plots etc).

I'm really interested in the idea of a value space.  Could you give a  
more specific example of how it would be used, maybe using Matt's  
data sets.  Is this something that we need?  How is it different from  
what we currently have?

> For
> things like measurements, the value space is abstract and defined  
> in terms
> of an abstract unit (the ideal reference subdivision of the space)  
> and a
> "realized unit" that links the abstract unit to the semantic type  
> of what
> the measurement is (a meter of shore, a second of "quality  
> time" :). For
> things like land use, the space is the set of classes that can be  
> used to
> tag each observation value. OBOE currently has a simplified way to  
> describe
> this that probably works, but I now feel is a bit of a shortcut for  
> the
> explicit definition of the observation space.
>
Right. OBOE currently deals primarily with abstract units (or  
standards), characteristics and entities, which together form a  
realized unit... E.g., meter (abstract unit) of height  
(characteristic) of tree (entity).  Composite or semantic units  
already have a characteristic describing them and therefore only have  
abstracts unit and entities.  E.g., The characteristic ArealDensity  
<-- count (abstract unit) of trees (entity) per meters square  
(abstract unit) of forest (entity).

> The remaining problem is how to define the mapping between a set of
> observation values and the values of the observations that provide its
> context. This is necessary unless we see each data point of the main
> observation as an instance of observation, which however would make  
> the size
> of the annotation potentially huge (dependent on the size of the  
> dataset).
> Anyway this is a representational issue more than a semantic one  
> and I don't
> think it should be in OBOE (I have proposed a separate representation
> ontology that also details data formats and parsing modalities,  
> also not
> vetted by the group yet). While a these mappings are likely to be  
> precisely
> definable with full generality only in higher-order logics, well- 
> behaving
> datasets can be annotated by simply specifying the ordering in  
> which the
> contexts are applied. Of course the specification of these can be  
> complex
> and we need to work on this. But this should not change the  
> semantic power
> of using contextualization in OBOE.
>
I've attached Ferdinando's representation.owl ontology.  When opening  
in Protege 3.2 beta, you'll need to point to the local file  
"Observation.owl" which I sent several days ago.
> Does this answer some of your question, or confuse things further?
>
> cheers
> f
>
> --
> Ferdinando Villa, Associate Research Professor, Ecoinformatics
> Ecoinformatics Collaboratory, Gund Inst. for Ecol. Economics and  
> Dept. of
> Botany
> University of Vermont           http://ecoinformatics.uvm.edu
>
>> -----Original Message-----
>> From: seek-kr-sms-bounces at ecoinformatics.org
>> [mailto:seek-kr-sms-bounces at ecoinformatics.org] On Behalf Of
>> Matt Jones
>> Sent: Tuesday, June 13, 2006 7:41 PM
>> To: seek-kr-sms at ecoinformatics.org
>> Subject: [seek-kr-sms] OBOE clarifications and questions
>>
>> Josh,
>>
>> Nice job on OBOE.  I've been looking it over and learning a
>> lot, but there a number of specific areas that I don't
>> understand.  Maybe you could clarify?
>>
>> 0) Is OBOE, and are other ontologies, available on the SEEK
>> wiki?  They should be, and probably as links into the CVS
>> tree.  The site review team requested this.  I linked a
>> couple in to the wiki here:
>>    http://seek.ecoinformatics.org/Wiki.jsp?page=KROntologies
>> Can you do the rest?  Maybe Rich's ontologies should be made
>> available under the 'As-is' Formal Ontologies section for
>> reference purposes?
>>
>> 1) how do counts differ from moles?  Isn't the NIST 'amount'
>> the same as the absolute scale in OBOE?
>>
>> 2) how to deal with log units?
>>
>> 3) How to deal with multiple relations with integrity
>> constraints?  For example, a 'site' table, and a 'tree
>> measurement' table that has a foreign key into the site
>> table.  Can we create annotations that refer to attributes in
>> both tables?
>>
>> 4) context doesn't seem to be enough to handle experiments --
>> it captures some information, such as spatial nesting of
>> experimental units, but it doesn't fully capture the
>> dependency information in tuples.  In particular, it seems to
>> me that experimental manipulations are different from spatial
>> nesting. See example below. Can you clarify?
>>
>> I looked at the GCE examples you sent.  They show the
>> 'Experimental Treatment' as the subject of the Measurement
>> for Treatment (with unit 'Name').  Is this list of subjects
>> controlled?  And is 'Experimental Treatment' a special
>> characteristic that should be treated specially?
>> The value of the Measurement is set to 'N'.  Where is the
>> value space for these treatments defined?  And how does one
>> differentiate between manipualted and control values (ie, in
>> the value column in one experiment the values 0, 5, 10 might
>> indicate control, 5g/m^2, and 10
>> g/m^2 treatments) -- are these defined formally somewhere?
>>
>> Here's an example to expand on #4.  The three relations below
>> (R1, R2, and R3) all measure biomass in subplots within
>> plots.  In R1, the subplots are given both a nitrogen and
>> phosphorus addition treatment which affects the
>> interpretation of the biomass measured.  So biomass has some
>> concrete dependency on the nitrogen and phosphorus treatments.
>>   In relation R2, only a phosphorus treatment is added.  R1
>> and R2 could be combined incorrectly by taking the mean by
>> plot,subplot,nitrogen in
>> R1 and then concatenating with R2. Or it could be correctly
>> combined by taking only those observations in which
>> Phosphorous manipulation is 0 and combining that set of
>> records with those from R2.  Likewise, R3 has no
>> manipulations, so should really only be compared against
>> observations in R2 where the nitrogen treatment is 0 and R1
>> where both nitrogen and phosphorous are 0.  These are the
>> semantics that OBOE should capture reagarding the dependecy
>> between the measured value (biomass) and the manipulated
>> treatments and their levels.  Can OBOE do that?
>>
>> Relation R1
>> ----------
>> Plot   Subplot    Nitrogen    Phosphorous    Biomass
>> 1      A          5           3              56
>> 1      B          0           3              87
>> 1      C          5           0              78
>> 1      D          0           0              24
>> 2      A          5           3              58
>> 2      B          0           3              88
>> 2      C          5           0              76
>> 2      D          0           0              26
>>
>> Relation R2
>> -----------
>> Plot   Subplot    Nitrogen     Biomass
>> 1      LR         5            56
>> 1      LL         0            87
>> 1      UR         5            78
>> 1      UL         0            24
>> 2      LR         5            58
>> 2      LL         0            88
>> 2      UR         5            76
>> 2      UL         0            26
>>
>> Relation R3
>> -----------
>> Plot   Subplot    Biomass
>> 1      1          56
>> 1      2          87
>> 2      1          58
>> 2      2          88
>> 3      1          76
>> 3      2          26
>>
>>
>> Thanks for the clarifications,
>>
>> Matt
>>
>> --
>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>> Matt Jones                                   Ph: 907-789-0496
>> jones at nceas.ucsb.edu                    SIP #: 1-747-626-7082
>> National Center for Ecological Analysis and Synthesis (NCEAS)
>> UC Santa Barbara     http://www.nceas.ucsb.edu/ecoinformatics
>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>> _______________________________________________
>> Seek-kr-sms mailing list
>> Seek-kr-sms at ecoinformatics.org
>> http://mercury.nceas.ucsb.edu/ecoinformatics/mailman/listinfo/
>> seek-kr-sms
>>
>
> _______________________________________________
> Seek-kr-sms mailing list
> Seek-kr-sms at ecoinformatics.org
> http://mercury.nceas.ucsb.edu/ecoinformatics/mailman/listinfo/seek- 
> kr-sms
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Matt_examples_2.pdf
Type: application/pdf
Size: 33808 bytes
Desc: not available
Url : http://mercury.nceas.ucsb.edu/ecoinformatics/pipermail/seek-kr-sms/attachments/20060614/c86b216c/Matt_examples_2-0001.pdf
-------------- next part --------------

-------------- next part --------------
A non-text attachment was scrubbed...
Name: Representation.owl
Type: application/octet-stream
Size: 14026 bytes
Desc: not available
Url : http://mercury.nceas.ucsb.edu/ecoinformatics/pipermail/seek-kr-sms/attachments/20060614/c86b216c/Representation-0001.obj


More information about the Seek-kr-sms mailing list