[eml-dev] [Re: Proposed revision to eml-literature module...]

Margaret O'Brien mob at icess.ucsb.edu
Mon Sep 12 14:26:04 PDT 2005


Sorry to be so late in chiming in. I was on vacation during the second 
half of August, and, since, have been mostly engaged in various local 
emergencies. But one advantage is that I have gotten to read the entire 
thread at once.  Getting the historical perspective has definitely been 
educational, particularly from my POV as an implementer.  BTW, thanks 
for the approval to join this group (I'm particularly excited about the 
salary and benefits that James mentioned).

It seems that generally, Matt, James and Peter agree that creating a 
bibliography schema that imports eml-literature is an acceptable 
solution for maintaining lists of citations, and that a bibliography 
could be considered a resource of a project, individual, or group, so 
EML can be used as it's container.  At first I thought that this was a 
fairly simple addition, but now I see that it carries some implications 
for list-making that could be more precedent-setting.  When I first took 
a stab at moving a bibliography into EML, the choice between handling a) 
hundreds of xml docs plus a results-set or b) a single doc seemed pretty 
clear.  Up until last week, it had been my experience that displaying 
one document from metacat (regardless of length) was much faster than 
waiting for a results-set. However, Chris Jones has installed metacat 
1.6?rc in Santa Barbara, and it's at least10x faster at returning query 
results, so that argument is gone. Eventually, we'll have a programmatic 
method of managing the bibliography, and it wont really matter whether 
there is one document produced or hundreds. Also in the long run, it may 
be more difficult to filter a long xml document based on content, that 
it is to query separate documents and get back a resultset.

So now, I'm starting to think that a more generic <eml-list> that could 
contain any number of any of the basic modules could serve several 
purposes -- you could create a bibliography (either all or a subset), or 
use it to group datasets, protocols and the resulting papers. Possibly, 
using either a harvest-list or resultset accomplishes this already, but 
these both seem to have been designed for other purposes.

This brings up a question I've had about eml-dev's intended use of the 
citation module: Was it intended that the including a citation tree 
within a dataset be used as a mechanism for attaching the appropriate 
datasets to a published paper? Datasets are allowed to contain the 
citation tree in several places, but not at a very high-level.  Some 
journals are beginning to ask that their authors also publish datasets. 
It's possible that the idea of using eml and metacat for this could be 
way off-base -- maybe this isnt the right place for linking a dataset 
and a paper, or maybe it's the responsibility of the publisher, not the 
lter-site.

I've played with one way of attaching datasets to publications (from 
within the citation).  Also, there were some other items in Mark's 
schema that need some feedback from this group. I'd rather bring these 
up in a second email, since this one is already long.  And possibly, 
these are insignigicant in light of the higher level list issue and 
should be put off.

Margaret






More information about the Eml-dev mailing list