simple biogeography

Download Report

Transcript simple biogeography

Extending the
biogeographical model
Africamuseum
6 (7?) June 2013
Workplan
 Continue with database constructed so far
 Add fields for


Authority (name of the author, date, parentheses)
Classification



Fields for different ranks
Synonymy
Sources
Fields for classification
 Add fields to ‘species’ table for the main ranks:





Kingdom
Phylum (=division)
Class
Order
Family
 Alternative: separate table

Split taxonomy in genus/species, and family and above
 Problem: not properly normalised, not good solution
for long-term management of taxonomic register
Add field for authority
 Add field for authority in the species
table


One text field, not structured further
Too complex to be split up in separate fields
 Optionally, add field for reference to
original description

Ideally as a link to the sources table
Add table sources
 Fields




ID (=primary key, autonumber)
Author
Year
Reference
Relations with sources table
 Add ‘SourcesID’ link in Species table
 Add ‘SourcesID’ link in Distribution table
 Create relations between


Sources and Species table
Sources and distribution table
Add synonymy
 ‘Self-referential’ table


Record in table refers to other record in same table
Needs status of the taxon described in the record
 Create new fields:


AcceptedID
TaxonStatus (optionally link to separate table)
 Create relation:

AcceptedID link to ID of taxon table
Relations diagram - synonyms
One-to-many relationships
 E.g. species in family: each species
belongs to one, and only one family
 Implement by listing the Primary Key of
the ‘one’ side as a field in the table of
the ‘many’ side
 ‘Foreign Key’ points at the right record in
the ‘one’ side of the relationship
One-to-many relationships
(Many) species in
(one) family
(Many) places/localities
in (one) country
Many-to-many relationships?
 Problem: we don’t have a ‘one’ side,
there is no single record to refer to
 Can not be implemented directly in SQL
 Needs extra table


To split up many-to-many in two one-tomany relations
Extra table takes pointers to both sides of
the relationship
Many-to-many relationship
 Example: ‘sources’ versus ‘species’

Source = literature, report, database…
 One source refers to several species
 A species is referred to in many sources
 Create many-to-many table

Split the one many-to-many relationship in
two one-to-many relationships
Many-to-many relationship
Relations diagram - complete
Present structure:
Philosophy
 Structure has to be as simple as possible
 But not any simpler!!
 Present structure is good short-term solution,
but not long-term

Needs better alternative to represent classification
and hierarchy
Hierarchy: flat table
 Every rank in the hierarchy is
represented by a field in the table
 Simplest solution


Easy to create
Easy to query
Hierarchy: flat table
Hierarchy: flat table
 Problems

not normalised!
Not a real problem if a quick-and-dirty solution is
all that is needed
 Difficult to maintain hierarchy in the long term
 ‘Standard’ problems with non-normalised
database

 Possible conflicting information, inefficient storage…

Cfr MASDEA; too simple
Hierarchy: normalised tables
 Every rank is represented by a separate
table
 Not very difficult to write a query to
regenerate flat table
 Every taxon can have additional
information

Extra fields with description…
Hierarchy: cascading tables
Hierarchy: normalised tables
 Avantages


Easy to maintain and query
Normailised, possible to add information at any level of the
hierarchy
 Drawbacks

Ranks are hard-wired on the structure of the database




New rank would require change of the structure of the database
And probably of the user interface, web interface…
Number of tables
Lot of functionality duplicated
Taxonomic reality
 Ranks used depend on the taxonomic group


Botany: mainly infra-specific; zoology: mainly on
higher levels
Many of the ranks are only sparsely used
 Needs for a more flexible system
 Much of the functionality is the same across all
ranks


‘parent’, synonymy
Authority, description…
‘Open Hierarchy’
 Possible to define new ranks without having to
rewrite the structure of the database
 All taxonomic names are stored in a field in a
single table; other fields indicate parent and
rank
 Many-to-one relation: a single parent, several
descendants

Include ID of parent in the record of the
descendant
Open Hierarchy
 Avantages


Completely normalised
Flexible
 Drawbacks

Difficult to query classification


Queries of the type ‘all species of the Echinodermata’…
Solution:



Recursive query
‘calculated field’
program
Synonymy
 Every taxon can have several synonyms;
in principle, only one valid name for any
synonym

Many-to-one relation: one valid name, many
synonymous names
Include ID of the valid name in the record of the
synonymous name
 Other fields for the type of synonymy…

Rest of the taxonomic model
 Ranks should be in a separate table


Information on the level of the rank can be
added
Possibility of extra quality control
Rank of a parent as compared to rank of
descendants
 Rank of siblings should be same

Combine with rest
 Previous work on geography and sources