SDD document: Choice of appropriate term for inferring information on object or class trees

TDWG working group: Structure of Descriptive Data (SDD)

Introduction

An important consideration in descriptive data is that data in the object or class tree (specimen, subspecies, species, genus) are related for evolutionary as well as for logical reasons (a genus is defined as a set of species). The process of combining object descriptions to produce a parent description can be manual, as it is common in genus and family descriptions, but it can also be performed by a machine (computer program). The latter may be a dynamic process (as far as I know to be implemented in BioLink) or may generate a static copy of data (as it is done in the CSIRO DELTA programs and in DeltaAccess).

In the discussion of some decisions about the proposed general structure of character and state information we need to refer to this process. Which term shall we use?

Current usage:

In the SDD minutes and other documents so far I have largely adopted the term "collation" with a sprinkle of my old "summarize". Bob Morris thinks that "collation" is inappropriate, meaning "put in order". After looking the term up in a dictionary, I tend to agree. Bob proposes "aggregation".

Available terms

Quotes from Collins English Dictionary:

Quotes from Oxford English Dictionary (OED Online):

Comments

Gregor Hagedorn:

Bryan Heidorn:

Martin Grube:

Bob Morris:

Kevin Thiele:

Guillaume Sauvenay:

Guillaume Rousse:

Ben Moretti:

Gregor Hagedorn:

Peter Rauch:

Steve Shattuck:

Discussion in Lisbon (TDWG 2003)

The issue was taken up at the meeting in Lisbon. The following proposals were added to the list above:

Martin Pullan:

Bob Morris:

Nico Franz:

Conclusion

A vote which terms are preferred was taken in Lisbon and resulted in the following terminology:

Inference is the preferred head term to refer to the following processes:

As a minority vote Gregor Hagedorn maintained that inferring information from specimen to species is not structurally different than inferring information from species to genus. Consequently the same term should be used. A higher taxon description results from the knowledge about which lower classes are contained and is no independent information or derived at from external principles. Although during the identification process the higher taxon description may operationally be treated as an independent logical expression to determine whether a lower taxon is a member of a higher taxon (e. g. in a key in a Flora), it is in fact not independent. If phylogenetic analysis shows that a genus so far only known to contain leaf-bearing plants also contains a species in which leaves have been reduced, the conclusion is that the genus description must be changed, not that the species belongs to a different taxon. This is fundamentally different from ontological statements or definitions for languages, where if an object does not belong to the agreed definition of, e. g. a chair, than it simply is not a chair.

Request for discussion

Please send your criticism or suggestions to the SDD mailing list or to any of the authors.

Gregor Hagedorn; Vers. 3; 29. October 2003



Return to the SDD starting page.

First published 2003-03-11, last update: 2003-10-29.

Valid XHTML 1.0! Valid CSS1! Viewable With Any Browser