EML for Mere Mortals

Chad Berkley berkley at nceas.ucsb.edu
Tue Jun 10 15:16:40 PDT 2003


Hello David,

I've been meaning to write a review of your document for a month or so 
but I've been on vacation and at various meetings.  I apologize for the 
delay...but better late than never.  I'm going to start out with what I 
see as being wrong with the document then try to give you some 
suggestions on how it should be rewritten.

If I understand correctly, this document is supposed to be a high level 
look at EML that a scientist or data manager could use to fill out EML 
fields and to understand the granularity and purpose behind those 
fields.  I have to say that "EML for Mere Mortals" as it currently 
exists falls short of this goal.

If we are truly trying to give the scientist a top level overview, then 
why is there so much discussion of XML structure and syntax?  I think 
this document could be written without mentioning the acronym "XML" a 
single time.  The sentence "...EML has been designed as a form of 
XML..." is completely wrong and I think this is the wrong thesis 
statement to base this document on.  EML is a content model that happens 
to be implemented in XML.  It could be implemented in any number of 
other markup languages or binary formats.  The content model itself (not 
the implementation) is truly what should be the center of the discussion 
in this document.  Possibly as an appendix, some implementation details 
on how to write an EML document marked up in XML would be useful, but it 
does not belong in the body of this document.

As far as your discussion of the content model goes, I do not think 
referring to it as a "Taxonomy" is at all intuitive or correct.  In 
fact, it's downright confusing because of the actual sub-module of EML 
that deals with biological taxonomy.  It took me several re-reads to 
figure out exactly what you were trying to do with the taxonomy analogy 
(and I'm still not completely sure I understand).  I just don't think 
it's leading in an easy-to-understand direction.

The premise behind the the conversations between Pat, emlMaven and Eco 
Informatics, while well intentioned, frankly made me queasy.  I think 
this device has been over used and not well executed in this document. 
The names of the characters themselves insult me. I think the purpose of 
this device (to give some real world problems and use EML to solve them) 
is a sound one, however I think it would be better to have a set of 
scenarios maybe with multiple implementations and multiple outcomes than 
to have three imaginary characters with less-than-normal names 
conversing.  The "Metalog" device also fails in the same respect.

What I would like to see come out of this document is a generalized map 
of the content model cross-linked to different types of ecological 
research projects. It could have sections like "Spatially referenced 
time series research project" and show what parts of EML would be used 
to correctly document this type of project.  There may be some overlap 
in the sections, but I think that is ok because it will show how many 
parts of EML are generalized for multiple purposes.  I don't think using 
the XML Spy images of the schema is very useful for someone not used to 
it.  It may be painful for you, but I think that some other program 
(Illustrator or the like) should be used to create a nice form view of 
the EML document that is being created for the given scenario.  Ths form 
view does not have to show the entire document (or even document 
segment) in one image, it could be broken up in a visually pleasing way.

I think that the extraneous cheese should be removed from the document. 
  It is distracting to the reader and really just adds more white noise 
though which the reader must sort.  I think a softened professional tone 
should be used throughout.  Not as professional as an IBM AS-400 manual 
but not as soft as Winnie the Pooh either.  Something like the tone in 
an Oreilly book would do nicely.

In a document like this, it is extremely important not to give the 
reader the idea that he is being talked down to.  No one likes to be 
talked down to.  For this reason, I would recomend changing the title to 
something less patronizing.  The tone of the introductory paragraphs 
could also use some work in this area.

A few more tidbits....The XML tags should also be removed.  I don't 
think they are appropriate in this document.  Also, the phrase "computer 
literate" should never be used.  It is a cliche` with basically no 
meaning.  I would also like to see an acknowledgements section as many 
people have worked on EML and deserve to be recognized (NSF should 
probably also be recognized as the grantor...ask Matt or James about 
that).  Also, I don't think that ripping NSF with one-liners in an NSF 
funded grant is a super good idea.

Hope this helps.  I'm not trying to be overly-critical just calling it 
like I see it.  Let me know if you want me to clarify anything that I've 
said.

chad

-- 
-----------------------
Chad Berkley
National Center for
Ecological Analysis
and Synthesis (NCEAS)
berkley at nceas.ucsb.edu
-----------------------




More information about the Eml-dev mailing list