[Fwd: protocol, methods, and project]

Tim Bergsma tbergsma at kbs.msu.edu
Wed Aug 28 14:57:35 PDT 2002


David,

Good points.  I agree with the way you've reclassified my example.

Tim.

> David Blankman wrote:
> 
> Tim et al,
> 
> I understand your issues about where do I put what (in dataset? , in
> project? in dataTable?)
> 
> The following is not meant to imply a need to change the EML model,
> but to comment on some of the issues that Matt and Tim raise. The
> Project module has been a source of confusion from its inception (if I
> remember correctly Project started out at a level equal to that of
> dataset). The confusion stems in part from the fuzziness of the
> concept "Project" and to a certain extent "Dataset".:
>  1. The scope of "project" is variable and indeterminate.
> 
>  2. The fact that project is contained within dataset adds to the
> confusion, since normally we would view a dataset as part of a
> project. ["Contained within" may not be technically correct in XML
> Schema terms but it is the way that most of us would describe it].
> 
>  3. The change in the packaging concept from triples to containment
> makes some things clearer but makes the question of "what goes where"
> more important. .
> 
>  4. For some, perhaps many, of the LTER information managers, the term
> "dataset" is is used to describe what is represented in EML as
> "dataTable", while the term  "project"  is used to describe what is
> represented in EML as "dataset".
> 
> I would suggest that  in Tim's example, "
> 
>  "For instance, at KBS, our
> mainsite layout is a randomized complete block agricultural
> experiment,
> installed in 1986.  That's a project, with a method, and created no
> data
> per se.  All of our main datasets, however, each with their own
> sampling
> techniques (methods/protocols) implicitly rely on the project method."
> 
> This information would logically go in
> Project/designDescription/paragraph (or whatever paragraph becomes).
> Perhaps the designDescription module should also be able to reference
> a resource-level protocol, although that may be possible already
> through the use of references. If I understand the distinction Peter
> makes between protocol and method, then anything at the project level
> would be a protocol.
> 
> Specific methods then belong at the dataTable level. Since "project"
> is optional, dataTable then needs to be able to include protocols as
> well as methods.
> 
> While I agree with Matt on trying to minimize the places where
> elements reside, the needs of site-based programs may be different
> from those of individual ecologists. An LTER site may need/want to
> have a richer set of items at the project level than an individual
> ecologist might need. Over time, I can foresee variations in Morpho
> configurations (or other EML tools) that might give users different
> recommended eml subsets in the same way that Quickbooks does for
> helping businesses choose account configurations that are relevant to
> their business, e.g. service vs retail vs manufacturer.
> 
> David
> 
> Tim Bergsma wrote:
> 
> > Matt, Peter,
> >
> > I'm scrambling to keep up here, but two comments:
> >
> > I think project needs its own method.  In fact, the best reason for
> > declaring a project is to document methods that form the context for
> > multiple datasets.  Advice to the user:  if a method is dataset
> > specific, associate it with dataset.  If datasets share an
> > experimental
> > context, declare a project and document the activities that created
> > the
> > experimental context as project/method.  For instance, at KBS, our
> > mainsite layout is a randomized complete block agricultural
> > experiment,
> > installed in 1986.  That's a project, with a method, and created no
> > data
> > per se.  All of our main datasets, however, each with their own
> > sampling
> > techniques (methods/protocols) implicitly rely on the project
> > method.
> >
> > I have no objection to simplifying method by dropping software,
> > instrumentation, etc and just making it a textType.  It solves some
> > of
> > the problems I mentioned in my last email.  I have hesitated to
> > suggest
> > this, because I don't know the history of why they are there.
> >
> > regards,
> >
> > Tim.
> >
> > -------- Original Message --------
> > Subject: protocol, methods, and project
> > Date: Wed, 28 Aug 2002 09:35:59 -0800
> > From: Matt Jones <jones at nceas.ucsb.edu>
> > To: Peter McCartney <peter.mccartney at asu.edu>
> > CC: "Eml-Dev (E-mail)" <eml-dev at ecoinformatics.org>
> > References: <11232E890694F74893375BC7AD88A62DAAB20E at MAINEX4.ASU.EDU>
> >
> > Hi Peter,
> >
> > So I looked over your protocol and methods changes as well.
> > Excellent
> > start. Some comments...
> >
> > I like the idea of distinguishing between a method that is ad-hoc
> > from a
> > protocol that is a resource-level version of a method.  And I agree
> > that
> > protocol should be modified to import eml-method so that they are
> > identical underneath in structure (except that protocol would have
> > the
> > resource fields).  Will you do that as well?
> >
> > I noticed that you removed the recursion from methodStep, although
> > methodSteps are repeatable.  This has some interaction with the
> > "procedure/step/substep" debate that's been happening in the
> > eml-text
> > thread.  Personally, I think it would be better if EML defined the
> > methodStep as repeatable and recursive, and left
> > "procedure/step/substep" docbook tags out of the eml-text module.
> > Or I
> > could be persuaded to leave it out of eml-method and put it in
> > eml-text.
> >   What I think would be bad is putting it in both places.  That
> > would
> > just make it confusing to end users who are trying to document their
> > procedures.
> >
> > I also was a bit confused about the inclusion of "protocol" in
> > method.
> > If a user has a protocol to describe, should they use the methodStep
> > block under method, or under method/protocol?  How do they decide?
> >
> > The single most frequent question from our EML/Morpho users in our
> > seminars, and the one that frustrates users to no end, is: "I have
> > piece
> > of information, in which of these X places should i put it in eml?".
> > Basically, they have a very difficult time distinguishing the intent
> > in
> > eml of reused element trees.  For example, they might say "data was
> > collected in 1999", and want to know if they should put that piece
> > of
> > information in dataset/coverage or dataset/dataTable/coverage.
> > Legitimate question, subtle distinction.  There are hundreds of
> > these
> > subtleties in EML, which add up to massive confusion to users.
> >
> > So how does this relate to protocol?  By sprinkling
> > "protocol/method"
> > elements throughout eml, users will have lots of choices of where to
> > put
> > things, and almost no understanding of why things should go in one
> > place
> > over another.  In general, if we can, I would like to see us come up
> > with a solid rationale and use case for each use of "method" in eml.
> > If
> > we can avoid it, I'd prefer to not have two or more locations for
> > information that can be interpreted identically (e.g., method in
> > dataset
> > and project).  Soo....can we consolidate the method references
> > somewhat?
> >   I'm happy to move it to dataset if that's where people want to see
> > it,
> > but I propose that we should not have it in both dataset and
> > project.
> >
> > I'm also a little confused about the whole method structure itself
> > -- it
> > seems overly complex for what it does.  This is not information that
> > we
> > intend to machine process at this point.  So why are instrumentation
> > and
> > software broken out from the description?  Can't methodStep just be
> > a
> > structured text block, maybe called "method"?  Is protocol needed,
> > or
> > can it be referecend under citation if it is a published protocol?
> > Is
> > qualityCOntrol so different from methodStep in its machine
> > processing
> > that it needs to be a separate structure?  Can't we just include
> > quality
> > control methods as a methodStep (aka method in my earlier terms)?
> >
> > Finally, in project, some of the cardinality rules don't make sense.
> > designDescription is 0 to many, and contains a child choice that is
> > 1 to
> > many, which contains an optional citation.  I think it should be:
> > designDescription is 0 to 1, anc contains a choice that is 1 to
> > many,
> > where both elements in the choice are 1 to 1.  An analogous argument
> > could be applied to studyAreaDescription.  This approach allows
> > these
> > elements to be optional, but if they are present, requires that they
> > have at least one child for content.  In general having a child of a
> > choice be optional is unneeded.
> >
> > Well, that's probably enough of a tome for now.  Thanks for your
> > excellent effort on this...
> >
> > Matt
> >
> > Peter McCartney wrote:
> >
> >
> >> 2) my proposed changes to address the LTER IM complaints about
> >> protocol/project: these involved defining a new module called
> >> eml-methods which includes elements for quality control, sampling
> >> description and methodStep(s). MethodSteps includes pointers to
> >> related
> >> sofware, datsets, and instruments and a description which any
> >> combination of text, eml-protocol, or eml-citation.
> >>
> >> methods is imported optionally into dataset, entityBase and
> >> attribute
> >> and any previous links to protocol is removed.
> >>
> >> This solution leaves eml-protocol to be used only for generic
> >> protocols
> >> that we wish to publish as resources. There is still a flaw here
> >> in that
> >> protocol also defines a methodStep elment that differs from the
> >> one in
> >> method. so the name should be changed or perhaps protocol should
> >> just
> >> import eml-method so that all it does is add the resourceBase
> >> elements.
> >>
> >>
> >>
> >
> >
> 
> --
> David E. Blankman
> Database Integration Developer
> Long Term Ecological Research Network Office
> University of New Mexico
> 801 University, SE #104
> Albuquerque, NM 87106
> (505) 272-7346 / (505) 272-7080 FAX

-- 
Tim Bergsma
LTER Information Manager
W.K. Kellogg Biological Station
Michigan State University
Hickory Corners, MI   49060
616/671-2337
tbergsma at kbs.msu.edu
http://lter.kbs.msu.edu



More information about the Eml-dev mailing list