[SEEK-Taxon] some concerns about the mammal data uploaded by Trevor
xianhual@email.unc.edu
xianhual at email.unc.edu
Mon Feb 21 07:39:10 PST 2005
Hi:
I went through the mammal data Trevor created and thank him for his great job
before moving. I just found something I am not very sure with.
Firstly, different pages in the same publication have been treated as
different publications. See an example as following,
<Publication id="MSW_PUB5473"> <PublicationSimple>Ann. Mag. Nat. Hist., ser.
8, 10:396.</PublicationSimple> </Publication>
<Publication id="MSW_PUB5474"> <PublicationSimple>Ann. Mag. Nat. Hist., ser.
8, 10:397.</PublicationSimple> </Publication>
<Publication id="MSW_PUB5432"> <PublicationSimple>Ann. Mag. Nat. Hist., ser.
8, 10:399.</PublicationSimple> </Publication>
I wonder if it is better to treated them as one publication 'Ann. Mag. Nat.
Hist., ser. 8, 10' with differenct microreference of pages - 396,397 and 399
respectively. This might be the use of TCS the way it has been designed.
Additionally, there are some duplications in the publications. See examples as
following,
example 1:
<Publication id="MSW_PUB5479"> <PublicationSimple>Ann. Sci. Nat. Zool.
(Paris), ser. 5, 7:375.</PublicationSimple> </Publication>
<Publication id="MSW_PUB5480"> <PublicationSimple>Ann. Sci. Nat. Zool.
(Paris), ser. 5, 7:375.</PublicationSimple> </Publication>
example 2:
<Publication id="MSW_PUB5481"> <PublicationSimple>David, Nouv. Arch. Mus.
Hist. Nat. Paris, Bull. for 1871, 7(4):92 [1872].</PublicationSimple>
</Publication>
<Publication id="MSW_PUB5482"> <PublicationSimple>David, Nouv. Arch. Mus.
Hist. Nat. Paris, Bull. for 1871, 7(4):92 [1872].</PublicationSimple>
</Publication>
Before we import the data into SEEK database for further processing, we'd
better re-check the data and make it as clean as possible.
Xianhua
More information about the Seek-taxon
mailing list