FW: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT 2004
Matt Jones
jones at nceas.ucsb.edu
Thu Aug 26 16:45:54 PDT 2004
Hi Duane,
I think its a bug in the EMLParser -- it appears to be ignoring system,
when in fact it should do as Corinna suggests make sure that all IDs
within a system are unique. Want to fix this bug?
Matt
Duane Costa wrote:
> Could anyone comment as to whether the EML error reported by Metacat below
> is a genuine EML error versus a bug in Metacat or the EML validator program?
> The issue is whether the id value for <dataset> must be unique from the id
> value for <creator>.
>
> Thanks,
> Duane
>
> -----Original Message-----
> From: Corinna Gries [mailto:corinna at asu.edu]
> Sent: Thursday, August 26, 2004 3:48 PM
> To: dcosta at lternet.edu
> Subject: RE: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT 2004
>
> Hi Duane,
>
> I am trying to fix these problems with our eml files. Some are easy
> because they are actual errors in our files, but there is one where I
> wonder if the ID checking is right. I understood IDs should be unique
> within the system, that is for example:
>
> <dataset id="30" system="ces_dataset"> ... Is different from
> <creator id="30" system="ces_party"> ....
>
> However, your harvester complains that they are the same:
>
> ************************************************************************
> *****
> *
> * METACAT HARVESTER REPORT: Wed Aug 25 11:00:36 MDT 2004
> *
> * A TOTAL OF 22 ERRORS WERE DETECTED.
> * Please see the log entries below for additonal details.
> *
> ************************************************************************
> *****
> ************************************************************************
> *****
> *
> * harvestLogID: 5549
> * harvestDate: Wed Aug 25 11:00:36 MDT 2004
> * status: 1
> * message:
> * harvestOperationCode: InsertDocError
> * description: Error inserting EML document to Metacat
> * detailLogID: 383
> * errorMessage: MetacatException: <?xml version="1.0"?>
> <error>
> Error running xpath expression:
> //dateTimeDomain|//nonNumericDomain|//numericDomain|//access|//attribute
> List|//constraint|//coverage|//temporalCoverage|//geographicCoverage|//t
> axonomicCoverage|/dataset|/eml/dataset|//dataSource|//dataTable|//otherE
> ntity|//citation|//address|//conferenceLocation|//party|//originator|//c
> reator|//contact|//publisher|//editor|//recipient|//performer|//institut
> ion|//metadataProvider|//associatedParty|//personnel|//physical|//connec
> tionDefinition|//distribution|//researchProject|//project|//relatedProje
> ct|//software|//spatialRaster|//spatialReference|//spatialVector|//store
> dProcedure|//view|//protocol|//additionalMetadata : Error in xml
> document. This EML document is not valid because the id 30 occurs more
> than once. IDs must be unique. </error>
>
> * scope: ces_dataset
> * identifier: 30
> * revision: 1
> * documentType: eml://ecoinformatics.org/eml-2.0.0
> * documentURL:
> http://seinet.asu.edu/DataCatalog/getXanthoriaRecord.jsp?source=ces_data
> set_mohave&id=30
> *
> ************************************************************************
> *****
>
> What do you think?
>
> Corinna
>
> _______________________________________________
> eml-dev mailing list
> eml-dev at ecoinformatics.org
> http://www.ecoinformatics.org/mailman/listinfo/eml-dev
--
-------------------------------------------------------------------
Matt Jones jones at nceas.ucsb.edu
http://www.nceas.ucsb.edu/ Fax: 425-920-2439 Ph: 907-789-0496
National Center for Ecological Analysis and Synthesis (NCEAS)
University of California Santa Barbara
Interested in ecological informatics? http://www.ecoinformatics.org
-------------------------------------------------------------------
More information about the Eml-dev
mailing list