USGS DEM

Last updated

The USGS DEM standard is a geospatial file format developed by the United States Geological Survey for storing a raster-based digital elevation model. It is an open standard, and is used throughout the world. It has been superseded by the USGS's own SDTS format but the format remains popular due to large numbers of legacy files, self-containment, relatively simple field structure and broad, mature software support.

Contents

DEM Level

A USGS DEM can be classified into one of four levels of quality. This is due to the multiple methods of data collection, and certainty in the data.

DEM levelDEM seriesProduction processComments
Level 17.5-minute
30-minute
Created by autocorrelation or manual profiling from aerial photographsMay be derived from a wide variety of sources using any of a number of USGS approved techniques and instrumentation
Level 2All seriesCreated from digital line graph (DLG) contours or equivalent, from any USGS map series up to 1:100000 scaleThe accuracy and data spacing are intended to support computer applications that analyze hypsographic features to a level of detail similar to manual interpolations of information from printed source maps
Level 37.5-minuteCreated from DLG that have vertically integrated with all categories of hypsography, hydrography, ridge line, break line, drain files, and all vertical and horizontal control networkRequires a system of logic to interpret the data
Level 4All seriesCreated from electronic (non-photogrammetric) imaging sensor systems, either active (e.g., radar or lasar) or passive (received radiant energy)The process includes all additional post-processing required to bring the DEM grid to its final form

Format Structure

The USGS DEM format is a self-contained (single file) set of ASCII-encoded (text) 1024-byte (1024 ASCII chars) blocks that fall into three record categories called A, B, and C. There is no cross-platform ambiguity since line ending control codes are not used, and all data including numbers is represented in readable text form. There is no known binary analogue of the format, although it is common practice to compress the files with gzip.

Floating-point numbers are encoded using Fortran scientific notation, so C/C++ programs need to swap the "D" exponent-indicating character with "E" when parsing (and vice versa when writing).

A record fields hold the origin, type, summary statistics and the measurement systems used by the profiles. The A record appears once as the file's header, the C record also appears once as the trailer, and multiple B records (called profiles) comprise the elevation data. A and C records each fit within one block, but a single B record typically requires multiple blocks. When such block-spanning occurs, data are shifted to start cleanly on each block boundary. A records also come in "old" and "new" flavors, because the USGS added several fields to the A record. One of the key items is the quadrangle, which is a set of four terrestrial coordinates describing the four-sided polygon enclosing the area of interest.

A Record Layout:Incomplete – please contribute—full details are in the Appendix of http://nationalmap.gov/standards/pdf/2DEM0198.PDF
Starting ColEnding ColDescription
0134Descriptive Name of the represented area
150155?
156161?
162167UTM Zone number
529534Unit of resolution of ground grid (0=radian;1=feet;2=metre;3=arc-second)
535540Unit of resolution Elevation (1=feet;2=metre)
546569Easting of the South West corner
570593Northing of the South West corner
594617Easting of the North West corner
618641Northing of the North West corner
642665Easting of the North East corner
666689Northing of the North East corner
690713Easting of the South East corner
714737Northing of the South East corner
738761Minimum elevation found in this file
762786Maximum elevation found in this file
816827Resolution per grid cell East – West
828839Resolution per grid cell North – South
858863Number of columns

B records (profiles) are a variable-length longitudinal column of raster elevations that start at a specified location. They are some multiple of 1024 bytes long and contain a small header summarizing the profile. The elevations are contiguous; breaks or other discontinuities are expressed using "void" elevations of value -32767. Each elevation is described as a six-character readable integer occupying a fixed location in a block. The profile header only appears in the first block, so subsequent blocks hold more elevation values. When reading the DEM file from first byte to last, one reads the profiles as columns from west to east. The elevations within a profile run from south to north. The variable-location and variable-length nature of profiles stems mainly from the use of the UTM (Universal Transverse Mercator) ground reference system. Since measurements within UTM employ fixed distances (e.g., 30 meters between elevation samples), the quadrangle must slightly distort to map such locations onto the spherical Earth. This distortion usually manifests as a rotated square, hence the elevation columns near the east and west edges start more northward and contain fewer samples.

C records contain root-mean squared error (RMSE) quality control data, using ten six-character integer fields.

Related Research Articles

The Au file format is a simple audio file format introduced by Sun Microsystems. The format was common on NeXT systems and on early Web pages. Originally it was headerless, being simply 8-bit μ-law-encoded data at an 8000 Hz sample rate. Hardware from other vendors often used sample rates as high as 8192 Hz, often integer multiples of video clock signal frequencies. Newer files have a header that consists of six unsigned 32-bit words, an optional information chunk and then the data.

Multipurpose Internet Mail Extensions (MIME) is an Internet standard that extends the format of email messages to support text in character sets other than ASCII, as well as attachments of audio, video, images, and application programs. Message bodies may consist of multiple parts, and header information may be specified in non-ASCII character sets. Email messages with MIME formatting are typically transmitted with standard protocols, such as the Simple Mail Transfer Protocol (SMTP), the Post Office Protocol (POP), and the Internet Message Access Protocol (IMAP).

The Resource Interchange File Format (RIFF) is a generic file container format for storing data in tagged chunks. It is primarily used to store multimedia such as sound and video, though it may also be used to store any arbitrary data.

In computing, tar is a computer software utility for collecting many files into one archive file, often referred to as a tarball, for distribution or backup purposes. The name is derived from (t)ape (ar)chive, as it was originally developed to write data to sequential I/O devices with no file system of their own. The archive data sets created by tar contain various file system parameters, such as name, time stamps, ownership, file access permissions, and directory organization. The command line utility was first introduced in the Version 7 Unix in January 1979, replacing the tp program. The file structure to store this information was standardized in POSIX.1-1988 and later POSIX.1-2001, and became a format supported by most modern file archiving systems.

Flexible Image Transport System (FITS) is an open standard defining a digital file format useful for storage, transmission and processing of data: formatted as multi-dimensional arrays, or tables. FITS is the most commonly used digital file format in astronomy. The FITS standard was designed specifically for astronomical data, and includes provisions such as describing photometric and spatial calibration information, together with image origin metadata.

The BMP file format, also known as bitmap image file or device independent bitmap (DIB) file format or simply a bitmap, is a raster graphics image file format used to store bitmap digital images, independently of the display device, especially on Microsoft Windows and OS/2 operating systems.

In computer programming, the term magic number has multiple meanings. It could refer to one or more of the following:

A GIS file format is a standard of encoding geographical information into a computer file. They are created mainly by government mapping agencies or by GIS software developers.

The archiver, also known simply as ar, is a Unix utility that maintains groups of files as a single archive file. Today, ar is generally used only to create and update static library files that the link editor or linker uses and for generating .deb packages for the Debian family; it can be used to create archives for any purpose, but has been largely replaced by tar for purposes other than static libraries. An implementation of ar is included as one of the GNU Binutils.

A comma-separated values (CSV) file is a delimited text file that uses a comma to separate values. Each line of the file is a data record. Each record consists of one or more fields, separated by commas. The use of the comma as a field separator is the source of the name for this file format. A CSV file typically stores tabular data in plain text, in which case each line will have the same number of fields.

df is a standard Unix command used to display the amount of available disk space for file systems on which the invoking user has appropriate read access. df is typically implemented using the statfs or statvfs system calls.

Binary file computer file encoded in binary form

A binary file is a computer file that is not a text file. The term "binary file" is often used as a term meaning "non-text file". Many binary file formats contain parts that can be interpreted as text; for example, some computer document files containing formatted text, such as older Microsoft Word document files, contain the text of the document but also contain formatting information in binary form.

A FourCC is a sequence of four bytes used to uniquely identify data formats. It originated from the OSType or ResType metadata system used in classic Mac OS and was adopted for the Amiga/Electronic Arts Interchange File Format and derivatives. The idea was later reused to identify compressed data types in QuickTime and DirectShow.

Shapefile package of file formats which together define geographic features

The shapefile format is a geospatial vector data format for geographic information system (GIS) software. It is developed and regulated by Esri as a mostly open specification for data interoperability among Esri and other GIS software products. The shapefile format can spatially describe vector features: points, lines, and polygons, representing, for example, water wells, rivers, and lakes. Each item usually has attributes that describe it, such as name or temperature.

The Apple Icon Image format is the icon format used in Apple Inc.'s macOS. It supports icons of 16 × 16, 32 × 32, 48 × 48, 128 × 128, 256 × 256, 512 × 512, and 1024 × 1024 pixels, with both 1- and 8-bit alpha channels and multiple image states. The fixed-size icons can be scaled by the operating system and displayed at any intermediate size.

Disk sector Logical or physical division of storage media

In computer disk storage, a sector is a subdivision of a track on a magnetic disk or optical disc. Each sector stores a fixed amount of user-accessible data, traditionally 512 bytes for hard disk drives (HDDs) and 2048 bytes for CD-ROMs and DVD-ROMs. Newer HDDs use 4096-byte (4 KiB) sectors, which are known as the Advanced Format (AF).

An Esri grid is a raster GIS file format developed by Esri, which has two formats:

  1. A proprietary binary format, also known as an ARC/INFO GRID, ARC GRID and many other variations
  2. A non-proprietary ASCII format, also known as an ARC/INFO ASCII GRID
BSAVE image file format

BSAVE and BLOAD are commands in many varieties of the BASIC programming language. BSAVE copies RAM to a binary file, and BLOAD copies the contents of the file to RAM. The term "BSAVE image" could mean any of various raw image formats of video display controllers, or more generally any file containing the raw contents of a section of memory.

The Esri TIN format is a popular yet proprietary geospatial vector data format for geographic information system (GIS) software for storing elevation data as a triangulated irregular network. It is developed and regulated by Esri. The Esri TIN format can spatially describe elevation information including breaking edge features. Each points and triangle can carry a tag information. A TIN stored in this file format can have any shape, cover multiple regions and contain holes.

The CMS file system is the native file system of IBM's Conversational Monitor System (CMS), a component of VM/370. It was the only file system for CMS until the introduction of the CMS Shared File System with VM/SP.