FW: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT 2004

Matt Jones jones at nceas.ucsb.edu
Thu Aug 26 16:45:54 PDT 2004


Hi Duane,

I think its a bug in the EMLParser -- it appears to be ignoring system, 
when in fact it should do as Corinna suggests make sure that all IDs 
within a system are unique.  Want to fix this bug?

Matt

Duane Costa wrote:
> Could anyone comment as to whether the EML error reported by Metacat below
> is a genuine EML error versus a bug in Metacat or the EML validator program?
> The issue is whether the id value for <dataset> must be unique from the id
> value for <creator>.
> 
> Thanks,
> Duane
> 
> -----Original Message-----
> From: Corinna Gries [mailto:corinna at asu.edu] 
> Sent: Thursday, August 26, 2004 3:48 PM
> To: dcosta at lternet.edu
> Subject: RE: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT 2004
> 
> Hi Duane,
> 
> I am trying to fix these problems with our eml files. Some are easy
> because they are actual errors in our files, but there is one where I
> wonder if the ID checking is right. I understood IDs should be unique
> within the system, that is for example:
> 
> <dataset id="30" system="ces_dataset"> ... Is different from
> <creator id="30" system="ces_party"> ....
> 
> However, your harvester complains that they are the same:
> 
> ************************************************************************
> *****
> *
> * METACAT HARVESTER REPORT: Wed Aug 25 11:00:36 MDT 2004
> *
> * A TOTAL OF 22 ERRORS WERE DETECTED.
> * Please see the log entries below for additonal details.
> *
> ************************************************************************
> *****
> ************************************************************************
> *****
> *
> * harvestLogID:         5549
> * harvestDate:          Wed Aug 25 11:00:36 MDT 2004
> * status:               1
> * message:              
> * harvestOperationCode: InsertDocError
> * description:          Error inserting EML document to Metacat
> * detailLogID:          383
> * errorMessage:         MetacatException: <?xml version="1.0"?>
> <error>
> Error running xpath expression:
> //dateTimeDomain|//nonNumericDomain|//numericDomain|//access|//attribute
> List|//constraint|//coverage|//temporalCoverage|//geographicCoverage|//t
> axonomicCoverage|/dataset|/eml/dataset|//dataSource|//dataTable|//otherE
> ntity|//citation|//address|//conferenceLocation|//party|//originator|//c
> reator|//contact|//publisher|//editor|//recipient|//performer|//institut
> ion|//metadataProvider|//associatedParty|//personnel|//physical|//connec
> tionDefinition|//distribution|//researchProject|//project|//relatedProje
> ct|//software|//spatialRaster|//spatialReference|//spatialVector|//store
> dProcedure|//view|//protocol|//additionalMetadata : Error in xml
> document.  This EML document is not valid because the id 30 occurs more
> than once.  IDs must be unique. </error>
> 
> * scope:                ces_dataset
> * identifier:           30
> * revision:             1
> * documentType:         eml://ecoinformatics.org/eml-2.0.0
> * documentURL:
> http://seinet.asu.edu/DataCatalog/getXanthoriaRecord.jsp?source=ces_data
> set_mohave&id=30
> *
> ************************************************************************
> *****
> 
> What do you think?
> 
> Corinna
> 
> _______________________________________________
> eml-dev mailing list
> eml-dev at ecoinformatics.org
> http://www.ecoinformatics.org/mailman/listinfo/eml-dev

-- 
-------------------------------------------------------------------
Matt Jones                                     jones at nceas.ucsb.edu
http://www.nceas.ucsb.edu/    Fax: 425-920-2439    Ph: 907-789-0496
National Center for Ecological Analysis and Synthesis (NCEAS)
University of California Santa Barbara
Interested in ecological informatics? http://www.ecoinformatics.org
-------------------------------------------------------------------



More information about the Eml-dev mailing list