Calendar dates are interval scale YES

David Blankman dblankman at lternet.edu
Thu Oct 31 14:38:25 PST 2002


I participated in only part of the dateTime discussion. At first it 
seemed to me that treating dates as ordinal made sense, but  I think 
that Tim has a valid point: use the guiding prinicipal of useability 
rather than philosopohical correctness.  While it is true that time is 
very problematic when looked at from a deep strucure philosophical 
perspective, it is also true that  people add and subract dates all the 
time..Most of ecology functions in a Newtonian world, free from quantum 
mechanical and relativistic paradoxes and fluidity of time and space.

Many measurements done in ecology have limited precision. A 1-meter plot 
is probably 1-meter plus or minus some centimeters (2 - 3 cm maybe). 
Coverage estimates have even less precision. Why should we be concerned 
about an even smaller lack of precision in dates?

Processing systems can be built to factor in the complex calendar rules. 
EML doesn't have to do that.  

I think that Tim's suggestion of units makes great sense, as long as the 
person documenting a dataset does not imply greater precision than is 
appropriate.


David


Tim Bergsma wrote:

>I'm not on IRC, so if you want to hash this there, call me at
>269-671-2337.
>
>We can't rehash forever, but this is a usability issue of the first
>order.
>
>There are two problems with yesterday's conference call consensus
>regarding datetime:  1) we provide no mechanism for handling durations;
>2) calendar dates are interval scale not ordinal scale.
>
>Regarding durations, one might argue that we provide xs:duration in the
>kludge of the ordinal measurementScale.  But I looked at the
>representation of xs:duration
>(http://www.w3.org/TR/xmlschema-2/#duration), and quite frankly, no one
>has duration data in that format!  EML has to handle data like this:
>
>Watershed	YearOfClearCut	YearsToReforestation
>W3		1887		40
>LittleCreek	1910		35
>JasperRidge	1950		52
>
>-or-
>
>EggMass		DateOfLaying	DaysTillHatching
>DuckPond	5-15-2000	30
>LittleCreek	4-31-2000	18
>GullLake	6-1-2000	16
>
>Recommendation:  we should provide categories in the unitDictionary such
>as nominalYears, nominalDays, nominalMonths, nominalHours, etc. (or
>YearsDuration, DaysDuration, etc) and define them in conventional terms,
>explicitly acknowledging lack of precision.  For instance, a
>nominalMinute is 60 seconds, +/- 1 second. A nominalYear is 365
>nominalDays, +/- 1 day.  xs:gYear is fine for YearOfClearCut, but
>xs:duration will not be adequate for YearsToReforestation.
>
>Regarding scale: I'm convinced that ordinal scales are simply ranked
>categories.  You don't do math on ranked categories, other than to test
>for order relations.  But we do lots of math on CalendarDates, such as
>taking the difference between two dates, or adding a duration to a
>date.  The objection is raised that the duration of sub-units of the
>Calendar are not constant.  True, but we do the math, still the same, so
>it must be an interval scale.  Actually, it is a deeply nested
>concatenation of interval scales of varying domain.  But the scale is
>completely determined, and even naive calculations are valid, albeit
>with qualified precision, while sophisticated calculations are exact.  I
>found one webpage that explicitly assigns calendar dates to interval
>scale: http://www.rattlesnake.com/notions/guttman-scales.html.
>
>So, modeling DateTime etc. under ordinal is wrong.  But if we provide
>DateTime etc. under interval MeasurementScale, what are the units? 
>DateTime does have units (year-month-day-hour-min-sec) , but they are
>concatenated.  The concatenation is a mechanism for traversing the
>nested tree of (arbitrary, often-non periodic) interval scales that
>comprise the calendar.  I think, as someone suggested yesterday, we will
>have to provide a notation for indicating date format, such as
>CCYY-MM-DD or MM-DD-YY, etc.  Applications will need the notation as a
>key for digesting date strings.  We can't expect eml authors to change
>their data to conform to some format. Given the ubiquity of date/time
>data, we either have to enumerate some common formats (unit
>concatenations) or provide a notation for describing formats.
>
>And this just in...Campbell data loggers everywhere are storing dates as
>as a field pair:  Year and DayOfYear.  This just proves that there are
>alternate ways of traversing a nested interval scale.  This is perhaps
>our last opportunity to trap DayOfYear and do something meaningful with
>it.  It is not a duration.  It has exactly the same properties as
>xs:gMonthDay:
>
>"[Definition:]   gMonthDay is a gregorian date that recurs, specifically
>a day of the year such as the third of May. Arbitrary recurring dates
>are not supported by this datatype. The ·value space· of gMonthDay is
>the set of calendar dates, as defined in § 3 of [ISO 8601].
>Specifically, it is a set of one-day long, annually periodic instances."
>
>Solutions welcome.
>
>Tim.
>  
>

-- 
David E. Blankman
Database Integration Developer
Long Term Ecological Research  (LTER) Network Office
University of New Mexico
801 University, SE #104
Albuquerque, NM 87106
Phone 505/272-7346  fax 505/272-7080 






More information about the Eml-dev mailing list