[SEEK-Taxon] some concerns about the mammal data uploaded by Trevor

xianhual@email.unc.edu xianhual at email.unc.edu
Mon Feb 21 07:39:10 PST 2005


Hi:

I went through the mammal data Trevor created and thank him for his great job
before moving. I just found something I am not very sure with.

Firstly, different pages in the same publication have been treated as
different publications. See an example as following,

<Publication id="MSW_PUB5473"> <PublicationSimple>Ann. Mag. Nat. Hist., ser.
8, 10:396.</PublicationSimple> </Publication>
<Publication id="MSW_PUB5474"> <PublicationSimple>Ann. Mag. Nat. Hist., ser.
8, 10:397.</PublicationSimple> </Publication>
<Publication id="MSW_PUB5432"> <PublicationSimple>Ann. Mag. Nat. Hist., ser.
8, 10:399.</PublicationSimple> </Publication>


I wonder if it is better to treated them as one publication 'Ann. Mag. Nat.
Hist., ser. 8, 10' with differenct microreference of pages - 396,397 and 399
respectively. This might be the use of TCS the way it has been designed.

Additionally, there are some duplications in the publications. See examples as
following,

example 1:

<Publication id="MSW_PUB5479"> <PublicationSimple>Ann. Sci. Nat. Zool.
(Paris), ser. 5, 7:375.</PublicationSimple> </Publication>
<Publication id="MSW_PUB5480"> <PublicationSimple>Ann. Sci. Nat. Zool.
(Paris), ser. 5, 7:375.</PublicationSimple> </Publication>

example 2:

<Publication id="MSW_PUB5481"> <PublicationSimple>David, Nouv. Arch. Mus.
Hist. Nat. Paris, Bull. for 1871, 7(4):92 [1872].</PublicationSimple>
</Publication>
<Publication id="MSW_PUB5482"> <PublicationSimple>David, Nouv. Arch. Mus.
Hist. Nat. Paris, Bull. for 1871, 7(4):92 [1872].</PublicationSimple>
</Publication>


Before we import the data into SEEK database for further processing, we'd
better re-check the data and make it as clean as possible.


Xianhua



More information about the Seek-taxon mailing list