[eml-dev] EML 2.0.2 changes to text leaf nodes

Christopher Jones cjones at msi.ucsb.edu
Fri Mar 21 07:49:16 PDT 2008


Margaret,

About searching txt:TextType nodes:

On Mar 20, 2008, at Mar20---6:15:43 PM, Margaret O'Brien wrote:
> First, one comment on Chris's original question about drawbacks to  
> mixed content instead of xs:string: I recall one possible drawback  
> in searches. If someone searches on the string "uptake rates for  
> Alnus tenuifolia" the search will fail if <title> is text instead of  
> a string because of the embedded tags. One solution is an optional  
> <complexTitle> which is txt:TextType, in addition to <title>. The  
> search on the simple <title> returns the correct doc, and the title  
> template needs to look for the complexTitle first. Apologies for not  
> remembering who to credit for this - I think it was a verbal exchange.


To me, this seems like a database implementation detail, rather than a  
schema issue.  For instance, metacat models the DOM, and so places  
child nodes as unique records from their parents.  Other XML databases  
do this differently.  I think it is the onus of the database to  
correctly interpret data typing, and search appropriately (or at least  
create indices based on data types so that they are searched correctly).

Cheers,
Chris
_________________________________________________________________
christopher jones       cjones at msi.ucsb.edu      (805) 680-5946
marine science institute  university of california, santa barbara
_________________________________________________________________






More information about the Eml-dev mailing list