[SEEK-Taxon] FW: [TAXACOM] Text Extraction Again (from Taxonomic e-text)

Beach, James H beach at ku.edu
Sat Jan 24 11:31:25 PST 2004


FYI,  an interesting service of from the Ubio project which finds
taxonomic names on web pages and then makes HTML links to their taxon
database. 


 
--------------------------------
James H. Beach
Biodiversity Research Center
University of Kansas
1345 Jayhawk Boulevard
Lawrence, KS 66045, USA
Tel: 785 864-4645, Fax: 785 864-5335
Televideocon: (H.323): 129.237.201.102


-----Original Message-----
From: David J Patterson [mailto:paddy at mail.usyd.edu.au] 
Sent: 23 January, 2004 7:40 PM
To: Beach, James H
Subject: Re: [TAXACOM] Text Extraction Again (from Taxonomic e-text)

Jim

I am not sure how close what we have been doing some or all of your
needs

http://129.78.177.112/baypaul/microscope/talks/frontonia/frontonia_left.
htm

click on the upper left corner to open up a demo web page, select the
URL, go back to the original page, click on  linkin and insert the web
page into the linkin URL cell.  This will not only mark up the demo page
but will produce a list of the taxa.

Dave Remsen has something which performs similar tasks.  I will ask him
to respond to you.

Linkin is just illustration of concept, and we need to make a number of
steps before it would be a really useful tool.

Hope all is well with you.

Paddy


Quoting "Beach, James H" <beach at KU.EDU>:

> Does anyone have information on recent attempts to use text extraction

> software on taxonomic e-texts and databases for the purposes of 
> extracting taxonomic names or other taxon attribute data?
> 
> I recall there was an Australian project 2-3 years ago, that has some 
> success extracting names and character data for the purpose of 
> automating diagnostic key construction.
> 
> We are interested in the possibility of using data extraction 
> techniques to populate prototype taxon concept databases we are 
> building for our semantic web "SEEK" Project.
http://seek.ecoinformatics.org.
> 
> Any pointers would be appreciated.  Many thanks,
> 
> Jim B.
>  
> --------------------------------
> James H. Beach
> Biodiversity Research Center
> University of Kansas
> 1345 Jayhawk Boulevard
> Lawrence, KS 66045, USA
> Tel: 785 864-4645, Fax: 785 864-5335
> Televideocon: (H.323): 129.237.201.102
> 
> 


--
David J. Patterson
School of Biological Sciences
University of Sydney
NSW 2006 AUSTRALIA

phone + 61 2 9351 2438
fax   + 61 2 9351 4119



-------------------------------------------------
This mail sent through IMP: www-mail.usyd.edu.au



More information about the Seek-taxon mailing list