Metadata Education Project

Metadata education suggestions and materials for:

Database principles

Learning Material | Preparatory topics | Complementary topics | Vocabulary


Learning Outcomes

Conviction

Motivation

Skills

Knowledge


Preparatory topics:


Complementary topics:


Vocabulary

Vocabulary definitions

General:

Advanced


Material for this topic


Motivation: basic information about GIS databases

Two basic structures of GIS databases, using DBMS (database management systems):

Types of database systems

The hierarchical, network and relational models all try to deal with the same problem with tabular data: inability to deal with more than one type of object, or with relationships between objects. For instance, a database may need to handle information on aircraft, crew, flights and passengers - four types of records with different attributes, but with relationships between them (e.g. "is booked on" between passenger and flight). This type of organization is referred to as "database normalization". Though it has many advantages, in terms of reducing data redundancy, database normalization also increases the complexity of the database's structure. A data dictionary (which is a subtype of metadata) is critical for interpreting the structure of a normalized database for efficient use.

Types of categorical relationships

Data types associated with attributes: individual attribute records may recorded as integer, real, character, data, or text fields.

Adapted from:
NCGIA CC (original) Unit 43: Database Concepts I NCGIA CC for GIScience, Unit051: Information Organization and Data structure


Motivation: data dictionaries in metadata

A generic definition of a data dictionary is a catalogue containing information about map features or attributes. The catalog may define data file and element names, sources, accuracy, date of entry or date of update. Within different metadata content formats, the section corresponding to the data dictionary are referenced different ways:

The Entity and Attribute Information section is well-suited for relational database descriptions. The Feature Catalogue Information section is also suited for relational database descriptions, but in addition it has been developed to accommodate object-oriented database systems. Object-oriented databases and the ISO standard are dealt with in more detail in the following section, advanced database structures

Feature-level metadata

Accuracy is included as one potential component of a data dictionary. Attributes can be defined to keep track of "feature-level metadata": information specific to attributes such as when they were entered, updated, by whom, by what method, quality of method, etc. For instance, road networks often change over time as new roads are built and the data is updated. It is helpful to define attributes for keeping track of which roads were added, when they were added, how they were added, etc. Keeping track of changing features in any dynamic dataset is very helpful in maintaining data quality.
Feature-level metadata is also useful when datasets are comprised of more than one source of information, especially if the sources are of varying data quality. If any concerns about the dataset's quality are brought up in the future, the portions of the dataset whose sources are in question can be easily identified and isolated.

Embedded metadata

Embedded metadata is a form of feature-level metadata. Metadata embedded directly within the data is most frequently designed to describe the accuracy of the elements of the database or the confidance one might place in different elelements of the database.

Kennedy, M. 2000. Embedded metadata - quality control with the dot probability paradigm and ArcQC. Proceedings of the Twentieth Annual ESRI User Conference. View full paper

Components of the Entity and Attribute Information section


Example exercises to demonstrate the importance of metadata


Skills: describing entities, attributes, and attribute domains within the Entity and Attribute Information section

There are basically two different methods for describing entities, attributes, and attribute domains within the Entity and Attribute Information section. The first is a textual description, which may be entered under the Overview Description. The Detailed Description is a much more structured method of describing the data dictionary. For simple datasets with only one or two entity types and a limited number of attributes and attribute domains, the Overview section may be sufficient. For more complex databases, it is important to enter information in both the Overview and Detailed Descriptions. The Overview Description should be used as a summary, to explain the structure and relation of different entities; the Detailed Description gives specifics about each of the entity's attributes and attribute domains, including definitions and sources.

In the case of some complex databases, a data dictionary may already have been compiled in a database format. Therefore it is redundant to transfer the information over into the Detailed Description format of the FGDC's content standard. The Overview Description can be used to reference the separate data dictionary, its format, and how to access it.

Some examples of metadata for the Entity and Attribute Information section:


Knowledge: advanced database structures

Learning material for this topic is under construction

Case example: difficulties of representing a dynamic segmentation database in the metadata content standard

Object-oriented databases and the ISO standard


Back to Course Topics