FW: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT 2004
Duane Costa
dcosta at lternet.edu
Thu Aug 26 17:17:42 PDT 2004
Matt,
Thanks for looking at this.
Yes, I'd like to work on the bug -- I think I'd learn a lot about the EML
parser in the process. Let me talk to James and Mark about it and then I'll
get back to you.
Duane
> -----Original Message-----
> From: Matt Jones [mailto:jones at nceas.ucsb.edu]
> Sent: Thursday, August 26, 2004 4:46 PM
> To: Duane Costa
> Cc: eml-dev at ecoinformatics.org; 'Corinna Gries'
> Subject: Re: FW: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT
> 2004
>
> Hi Duane,
>
> I think its a bug in the EMLParser -- it appears to be ignoring system,
> when in fact it should do as Corinna suggests make sure that all IDs
> within a system are unique. Want to fix this bug?
>
> Matt
>
> Duane Costa wrote:
> > Could anyone comment as to whether the EML error reported by Metacat
> below
> > is a genuine EML error versus a bug in Metacat or the EML validator
> program?
> > The issue is whether the id value for <dataset> must be unique from the
> id
> > value for <creator>.
> >
> > Thanks,
> > Duane
> >
> > -----Original Message-----
> > From: Corinna Gries [mailto:corinna at asu.edu]
> > Sent: Thursday, August 26, 2004 3:48 PM
> > To: dcosta at lternet.edu
> > Subject: RE: Report from Metacat Harvester: Wed Aug 25 11:00:36 MDT 2004
> >
> > Hi Duane,
> >
> > I am trying to fix these problems with our eml files. Some are easy
> > because they are actual errors in our files, but there is one where I
> > wonder if the ID checking is right. I understood IDs should be unique
> > within the system, that is for example:
> >
> > <dataset id="30" system="ces_dataset"> ... Is different from
> > <creator id="30" system="ces_party"> ....
> >
> > However, your harvester complains that they are the same:
> >
> > ************************************************************************
> > *****
> > *
> > * METACAT HARVESTER REPORT: Wed Aug 25 11:00:36 MDT 2004
> > *
> > * A TOTAL OF 22 ERRORS WERE DETECTED.
> > * Please see the log entries below for additonal details.
> > *
> > ************************************************************************
> > *****
> > ************************************************************************
> > *****
> > *
> > * harvestLogID: 5549
> > * harvestDate: Wed Aug 25 11:00:36 MDT 2004
> > * status: 1
> > * message:
> > * harvestOperationCode: InsertDocError
> > * description: Error inserting EML document to Metacat
> > * detailLogID: 383
> > * errorMessage: MetacatException: <?xml version="1.0"?>
> > <error>
> > Error running xpath expression:
> > //dateTimeDomain|//nonNumericDomain|//numericDomain|//access|//attribute
> > List|//constraint|//coverage|//temporalCoverage|//geographicCoverage|//t
> > axonomicCoverage|/dataset|/eml/dataset|//dataSource|//dataTable|//otherE
> > ntity|//citation|//address|//conferenceLocation|//party|//originator|//c
> > reator|//contact|//publisher|//editor|//recipient|//performer|//institut
> > ion|//metadataProvider|//associatedParty|//personnel|//physical|//connec
> > tionDefinition|//distribution|//researchProject|//project|//relatedProje
> > ct|//software|//spatialRaster|//spatialReference|//spatialVector|//store
> > dProcedure|//view|//protocol|//additionalMetadata : Error in xml
> > document. This EML document is not valid because the id 30 occurs more
> > than once. IDs must be unique. </error>
> >
> > * scope: ces_dataset
> > * identifier: 30
> > * revision: 1
> > * documentType: eml://ecoinformatics.org/eml-2.0.0
> > * documentURL:
> > http://seinet.asu.edu/DataCatalog/getXanthoriaRecord.jsp?source=ces_data
> > set_mohave&id=30
> > *
> > ************************************************************************
> > *****
> >
> > What do you think?
> >
> > Corinna
> >
> > _______________________________________________
> > eml-dev mailing list
> > eml-dev at ecoinformatics.org
> > http://www.ecoinformatics.org/mailman/listinfo/eml-dev
>
> --
> -------------------------------------------------------------------
> Matt Jones jones at nceas.ucsb.edu
> http://www.nceas.ucsb.edu/ Fax: 425-920-2439 Ph: 907-789-0496
> National Center for Ecological Analysis and Synthesis (NCEAS)
> University of California Santa Barbara
> Interested in ecological informatics? http://www.ecoinformatics.org
> -------------------------------------------------------------------
More information about the Eml-dev
mailing list