EML mapping question.

Peter McCartney peter.mccartney at asu.edu
Tue Nov 19 09:33:55 PST 2002


Hmmm...it was my recollection that there was references were contained under
project which would have been the appropariate place to associate a
reference that described the project that produced the datsaet. We never had
anything for the purpose of citing references that made use of a previously
published dataset.  Maintining a citation index for inverse lookups is a lot
of work - we recently decided not to attempted it at CAP, hoping instead
that users would learn to provide bibliographic citations of datsets the
same way they cite literature now. one way to encourage that is to include
in our displays of metadata a "proper citation" guide. eg - "cite this
dataset as - "Hayden, B. and J. Porter, 2002, Small Mammal Body Weight for
VCR. VCR LTER, http://vcr.edu/datacatalog?id=23." 

I am suprised at the lack of a citation element directly under project, but
it just seems to me to be yet again an example that EML is NOT well suited
as a complete information management structure. Its a format for describing
resources. with an independent database to manage the interrlationships, its
possible for me to build a web display that shows related literature - the
info just isnt built into EML. 

The triple structure was far superior for managing this information and
extending it. Thats why I proposed in sevilleta to adopt a hybrid approach
where triples were used to manage data within an archive and a set of rules
existed for merging files together into a single, validatable xml file for
query and exchange. Thats how our sql management system is organized now. 

Peter McCartney (peter.mccartney at asu.edu)
Center for Environmental Studies
Arizona State University
480-965-6791 

> -----Original Message-----
> From: James W Brunt [mailto:jbrunt at lternet.edu]
> Sent: Tuesday, November 19, 2002 10:06 AM
> To: Christopher Jones
> Cc: David Blankman; Eml-Dev (E-mail)
> Subject: Re: EML mapping question.
> 
> 
> It seems that some information is getting lost with this 
> solution - in 
> the absence of a triple to define the relationship between 
> the citation 
> and the dataset I don't see how the information "this 
> citation is for a 
> paper that cites or contains analysis of these data" is recorded. 
> Enlighten me.
> 
> James
> 
> Christopher Jones wrote:
> 
> > David,
> >
> > Here's my take, and I welcome any corrections if I've done any poor 
> > interpreting of EML constructs...
> >
> > In short, you need eml-literature for your citation.
> >
> > Here's the babbling answer:
> >
> > Both eml-dataset and eml-literature are considered top 
> level resources 
> > (as are eml-software and eml-protocol).  They extend the 
> eml-resource 
> > schema by providing additional content that is dataset or 
> literature 
> > (citation) specific.
> >
> > In packages marked up with EML2.0.0beta6 and earlier, a literature 
> > document was "related" to a dataset document through the <triple> 
> > construct.  However, with further development of EML, the community 
> > has decided to replace the RDF-like triples by adopting the 
> XMLSchema 
> > approach to relating XML trees to each other by  referencing 'id' 
> > attributes within a *single document* that uniquely 
> identify a tree of 
> > elements.  This is  now done through the <additionalMetadata> tag, 
> > with it's <describes> child tag.
> >
> > To mark up a single EML package (i.e. <eml> ...metadata and/or data 
> > ... </eml> that includes both a <dataset> tree and a 
> <citation> tree, 
> > it would look something like the following (notice the id's and the 
> > namespace declarations, they are critical for validation):
> >
> > <?xml version="1.0"?>
> > <eml:eml
> >    packageId="eml.1.1" system="knb"
> >    xmlns:eml="eml://ecoinformatics.org/eml-2.0.0rc3"
> >    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
> >    xmlns:ds="eml://ecoinformatics.org/dataset-2.0.0rc3"
> >    xmlns:stmml="http://www.xml-cml.org/schema/stmml"
> >    
> xsi:schemaLocation="eml://ecoinformatics.org/eml-2.0.0rc3 eml.xsd">
> >
> > <dataset id="1555">
> >
> >  <!-- all the required eml-dataset elements go in here ... -->
> >
> > </dataset>
> > <additionalMetadata>
> >  <describes>
> >    1555
> >  </describes>
> >  <cit:citation id="345" scope="document"
> >            xmlns:cit="eml://ecoinformatics.org/literature-2.0.0rc3"
> >            xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
> >            
> > xsi:schemaLocation="eml://ecoinformatics.org/literature-2.0.0rc3
> >            eml-literature.xsd">
> >
> >    <!-- all the required eml-literature elements go in here ... -->
> >
> >  </cit:citation>
> > </additionalMetadata>
> > </eml:eml>
> >
> > The semantic nature of the relationship between the two trees isn't 
> > explicitly stated, as was the intention of the whole <triple> 
> > approach, but I think that it can be inferred by the namespace 
> > declaration for the <citation> tag.  This lets processing 
> systems know 
> > to treat the tree as a citation, and render it as such in 
> some GUI app.
> >
> > I've checked a sample document into CVS (in the eml/test directory) 
> > called eml-datasetWithCitation.xml that is a validating 
> sample of what 
> > I think you may be asking for here.
> >
> > Cheers,
> >
> > Chris
> >
> > David Blankman wrote:
> >
> >> Below is a  metadata document from the McMurdo LTER site. In the 
> >> document they have a section called "CITATION".
> >> * CITATIONS: *Spaulding, Sarah A, Diane M. McKnight, 
> Richard L. Smith 
> >> and Richard Dufford. 1994. Phytoplankton population dynamics in 
> >> perennially ice-covered Lake Fryxell, Antarctica. Journal 
> of Plankton 
> >> Research. Vol.16 no.5 pp.527-541.
> >>
> > _________________________________________________________________
> > christopher jones     cjones at lifesci.ucsb.edu      (805) 893-5144
> > marine science institute  university of california, santa barbara
> > _________________________________________________________________
> >
> >
> > _______________________________________________
> > eml-dev mailing list
> > eml-dev at ecoinformatics.org
> > http://www.ecoinformatics.org/mailman/listinfo/eml-dev
> 
> 
> -- 
> James W. Brunt
> Associate Director for Information Management
> Long Term Ecological Research Network Office
> Department of Biology
> University of New Mexico
> Albuquerque, NM 87131-1091
> 505 272 7085
> jbrunt at lternet.edu
> 
> 
> _______________________________________________
> eml-dev mailing list
> eml-dev at ecoinformatics.org
> http://www.ecoinformatics.org/mailman/listinfo/eml-dev
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mercury.nceas.ucsb.edu/ecoinformatics/pipermail/eml-dev/attachments/20021119/27339dac/attachment.htm


More information about the Eml-dev mailing list