FW: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT 2004

Duane Costa dcosta at lternet.edu
Thu Aug 26 17:17:42 PDT 2004


Matt,

Thanks for looking at this. 

Yes, I'd like to work on the bug -- I think I'd learn a lot about the EML
parser in the process. Let me talk to James and Mark about it and then I'll
get back to you.

Duane

> -----Original Message-----
> From: Matt Jones [mailto:jones at nceas.ucsb.edu]
> Sent: Thursday, August 26, 2004 4:46 PM
> To: Duane Costa
> Cc: eml-dev at ecoinformatics.org; 'Corinna Gries'
> Subject: Re: FW: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT
> 2004
> 
> Hi Duane,
> 
> I think its a bug in the EMLParser -- it appears to be ignoring system,
> when in fact it should do as Corinna suggests make sure that all IDs
> within a system are unique.  Want to fix this bug?
> 
> Matt
> 
> Duane Costa wrote:
> > Could anyone comment as to whether the EML error reported by Metacat
> below
> > is a genuine EML error versus a bug in Metacat or the EML validator
> program?
> > The issue is whether the id value for <dataset> must be unique from the
> id
> > value for <creator>.
> >
> > Thanks,
> > Duane
> >
> > -----Original Message-----
> > From: Corinna Gries [mailto:corinna at asu.edu]
> > Sent: Thursday, August 26, 2004 3:48 PM
> > To: dcosta at lternet.edu
> > Subject: RE: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT 2004
> >
> > Hi Duane,
> >
> > I am trying to fix these problems with our eml files. Some are easy
> > because they are actual errors in our files, but there is one where I
> > wonder if the ID checking is right. I understood IDs should be unique
> > within the system, that is for example:
> >
> > <dataset id="30" system="ces_dataset"> ... Is different from
> > <creator id="30" system="ces_party"> ....
> >
> > However, your harvester complains that they are the same:
> >
> > ************************************************************************
> > *****
> > *
> > * METACAT HARVESTER REPORT: Wed Aug 25 11:00:36 MDT 2004
> > *
> > * A TOTAL OF 22 ERRORS WERE DETECTED.
> > * Please see the log entries below for additonal details.
> > *
> > ************************************************************************
> > *****
> > ************************************************************************
> > *****
> > *
> > * harvestLogID:         5549
> > * harvestDate:          Wed Aug 25 11:00:36 MDT 2004
> > * status:               1
> > * message:
> > * harvestOperationCode: InsertDocError
> > * description:          Error inserting EML document to Metacat
> > * detailLogID:          383
> > * errorMessage:         MetacatException: <?xml version="1.0"?>
> > <error>
> > Error running xpath expression:
> > //dateTimeDomain|//nonNumericDomain|//numericDomain|//access|//attribute
> > List|//constraint|//coverage|//temporalCoverage|//geographicCoverage|//t
> > axonomicCoverage|/dataset|/eml/dataset|//dataSource|//dataTable|//otherE
> > ntity|//citation|//address|//conferenceLocation|//party|//originator|//c
> > reator|//contact|//publisher|//editor|//recipient|//performer|//institut
> > ion|//metadataProvider|//associatedParty|//personnel|//physical|//connec
> > tionDefinition|//distribution|//researchProject|//project|//relatedProje
> > ct|//software|//spatialRaster|//spatialReference|//spatialVector|//store
> > dProcedure|//view|//protocol|//additionalMetadata : Error in xml
> > document.  This EML document is not valid because the id 30 occurs more
> > than once.  IDs must be unique. </error>
> >
> > * scope:                ces_dataset
> > * identifier:           30
> > * revision:             1
> > * documentType:         eml://ecoinformatics.org/eml-2.0.0
> > * documentURL:
> > http://seinet.asu.edu/DataCatalog/getXanthoriaRecord.jsp?source=ces_data
> > set_mohave&id=30
> > *
> > ************************************************************************
> > *****
> >
> > What do you think?
> >
> > Corinna
> >
> > _______________________________________________
> > eml-dev mailing list
> > eml-dev at ecoinformatics.org
> > http://www.ecoinformatics.org/mailman/listinfo/eml-dev
> 
> --
> -------------------------------------------------------------------
> Matt Jones                                     jones at nceas.ucsb.edu
> http://www.nceas.ucsb.edu/    Fax: 425-920-2439    Ph: 907-789-0496
> National Center for Ecological Analysis and Synthesis (NCEAS)
> University of California Santa Barbara
> Interested in ecological informatics? http://www.ecoinformatics.org
> -------------------------------------------------------------------




More information about the Eml-dev mailing list