Metadata education suggestions and materials for:
|
Database principles |
Learning Material | Preparatory topics | Complementary topics | Vocabulary
Conviction
Motivation
Skills
General skills:
Advanced skills:
Knowledge
General:
Advanced
Two basic structures of GIS databases, using DBMS (database management systems):
Types of database systems
The hierarchical, network and relational models all try to deal with the same problem with tabular data: inability to deal with more than one type of object, or with relationships between objects. For instance, a database may need to handle information on aircraft, crew, flights and passengers - four types of records with different attributes, but with relationships between them (e.g. "is booked on" between passenger and flight). This type of organization is referred to as "database normalization". Though it has many advantages, in terms of reducing data redundancy, database normalization also increases the complexity of the database's structure. A data dictionary (which is a subtype of metadata) is critical for interpreting the structure of a normalized database for efficient use.
Types of categorical relationships
Data types associated with attributes: individual attribute records may recorded as integer, real, character, data, or text fields.
Adapted from:
NCGIA CC (original) Unit 43: Database Concepts I NCGIA CC for GIScience, Unit051: Information Organization and Data structure
A generic definition of a data dictionary is a catalogue containing information about map features or attributes. The catalog may define data file and element names, sources, accuracy, date of entry or date of update. Within different metadata content formats, the section corresponding to the data dictionary are referenced different ways:
The Entity and Attribute Information section is well-suited for relational database descriptions. The Feature Catalogue Information section is also suited for relational database descriptions, but in addition it has been developed to accommodate object-oriented database systems. Object-oriented databases and the ISO standard are dealt with in more detail in the following section, advanced database structures
Feature-level metadata
Accuracy is included as one potential component of a data dictionary. Attributes can be defined to keep track of "feature-level metadata": information specific to attributes such as when they were entered, updated, by whom, by what method, quality of method, etc. For instance, road networks often change over time as new roads are built and the data is updated. It is helpful to define attributes for keeping track of which roads were added, when they were added, how they were added, etc. Keeping track of changing features in any dynamic dataset is very helpful in maintaining data quality.
Feature-level metadata is also useful when datasets are comprised of more than one source of information, especially if the sources are of varying data quality. If any concerns about the dataset's quality are brought up in the future, the portions of the dataset whose sources are in question can be easily identified and isolated.
Embedded metadata
Embedded metadata is a form of feature-level metadata. Metadata embedded directly within the data is most frequently designed to describe the accuracy of the elements of the database or the confidance one might place in different elelements of the database.
Kennedy, M. 2000. Embedded metadata - quality control with the dot probability paradigm and ArcQC. Proceedings of the Twentieth Annual ESRI User Conference. View full paper
Components of the Entity and Attribute Information section
There are basically two different methods for describing entities, attributes, and attribute domains within the Entity and Attribute Information section. The first is a textual description, which may be entered under the Overview Description. The Detailed Description is a much more structured method of describing the data dictionary. For simple datasets with only one or two entity types and a limited number of attributes and attribute domains, the Overview section may be sufficient. For more complex databases, it is important to enter information in both the Overview and Detailed Descriptions. The Overview Description should be used as a summary, to explain the structure and relation of different entities; the Detailed Description gives specifics about each of the entity's attributes and attribute domains, including definitions and sources.
In the case of some complex databases, a data dictionary may already have been compiled in a database format. Therefore it is redundant to transfer the information over into the Detailed Description format of the FGDC's content standard. The Overview Description can be used to reference the separate data dictionary, its format, and how to access it.
Some examples of metadata for the Entity and Attribute Information section:
Learning material for this topic is under construction
Case example: difficulties of representing a dynamic segmentation database in the metadata content standard
Object-oriented databases and the ISO standard