Distance geometry

Last updated

Distance geometry is the branch of mathematics concerned with characterizing and studying sets of points based only on given values of the distances between pairs of points. [1] [2] [3] More abstractly, it is the study of semimetric spaces and the isometric transformations between them. In this view, it can be considered as a subject within general topology. [4]

Contents

Historically, the first result in distance geometry is Heron's formula in 1st century AD. The modern theory began in 19th century with work by Arthur Cayley, followed by more extensive developments in the 20th century by Karl Menger and others.

Distance geometry problems arise whenever one needs to infer the shape of a configuration of points (relative positions) from the distances between them, such as in biology, [4] sensor networks, [5] surveying, navigation, cartography, and physics.

Introduction and definitions

The concepts of distance geometry will first be explained by describing two particular problems.

Problem of hyperbolic navigation Hyperbolic Navigation.svg
Problem of hyperbolic navigation

First problem: hyperbolic navigation

Consider three ground radio stations A, B, C, whose locations are known. A radio receiver is at an unknown location. The times it takes for a radio signal to travel from the stations to the receiver, , are unknown, but the time differences, and , are known. From them, one knows the distance differences and , from which the position of the receiver can be found.

Second problem: dimension reduction

In data analysis, one is often given a list of data represented as vectors , and one needs to find out whether they lie within a low-dimensional affine subspace. A low-dimensional representation of data has many advantages, such as saving storage space, computation time, and giving better insight into data.

Definitions

Now we formalize some definitions that naturally arise from considering our problems.

Semimetric space

Given a list of points on , , we can arbitrarily specify the distances between pairs of points by a list of , . This defines a semimetric space: a metric space without triangle inequality.

Explicitly, we define a semimetric space as a nonempty set equipped with a semimetric such that, for all ,

  1. Positivity:   if and only if  .
  2. Symmetry: .

Any metric space is a fortiori a semimetric space. In particular, , the -dimensional Euclidean space, is the canonical metric space in distance geometry.

The triangle inequality is omitted in the definition, because we do not want to enforce more constraints on the distances than the mere requirement that they be positive.

In practice, semimetric spaces naturally arise from inaccurate measurements. For example, given three points on a line, with , an inaccurate measurement could give , violating the triangle inequality.

Isometric embedding

Given two semimetric spaces, , an isometric embedding from to is a map that preserves the semimetric, that is, for all , .

For example, given the finite semimetric space defined above, an isometric embedding into is defined by points , such that for all .

Affine independence

Given the points , they are defined to be affinely independent, iff they cannot fit inside a single -dimensional affine subspace of , for any , iff the - simplex they span, , has positive -volume, that is, .

In general, when , they are affinely independent, since a generic n-simplex is nondegenerate. For example, 3 points in the plane, in general, are not collinear, because the triangle they span does not degenerate into a line segment. Similarly, 4 points in space, in general, are not coplanar, because the tetrahedron they span does not degenerate into a flat triangle.

When , they must be affinely dependent. This can be seen by noting that any -simplex that can fit inside must be "flat".

Cayley–Menger determinants

Cayley–Menger determinants, named after Arthur Cayley and Karl Menger, are determinants of matrices of distances between sets of points.

Let be n + 1 points in a semimetric space, their Cayley–Menger determinant is defined by

If , then they make up the vertices of a possibly degenerate n-simplex in . It can be shown that [6] the n-dimensional volume of the simplex satisfies

Note that, for the case of , we have , meaning the "0-dimensional volume" of a 0-simplex is 1, that is, there is 1 point in a 0-simplex.

are affinely independent iff , that is, . Thus Cayley–Menger determinants give a computational way to prove affine independence.

If , then the points must be affinely dependent, thus . Cayley's 1841 paper studied the special case of , that is, any five points in 3-dimensional space must have .

History

The first result in distance geometry is Heron's formula, from 1st century AD, which gives the area of a triangle from the distances between its 3 vertices. Brahmagupta's formula, from 7th century AD, generalizes it to cyclic quadrilaterals. Tartaglia, from 16th century AD, generalized it to give the volume of tetrahedron from the distances between its 4 vertices.

The modern theory of distance geometry began with Arthur Cayley and Karl Menger. [7] Cayley published the Cayley determinant in 1841, [8] which is a special case of the general Cayley–Menger determinant. Menger proved in 1928 a characterization theorem of all semimetric spaces that are isometrically embeddable in the n-dimensional Euclidean space . [9] [10] In 1931, Menger used distance relations to give an axiomatic treatment of Euclidean geometry. [11]

Leonard Blumenthal's book [12] gives a general overview for distance geometry at the graduate level, a large part of which is treated in English for the first time when it was published.

Menger characterization theorem

Menger proved the following characterization theorem of semimetric spaces: [2]

A semimetric space is isometrically embeddable in the -dimensional Euclidean space , but not in for any , if and only if:

  1. contains an -point subset that is isometric with an affinely independent -point subset of ;
  2. any -point subset , obtained by adding any two additional points of to , is congruent to an -point subset of .

A proof of this theorem in a slightly weakened form (for metric spaces instead of semimetric spaces) is in. [13]

Characterization via Cayley–Menger determinants

The following results are proved in Blumethal's book. [12]

Embedding n + 1 points in the real numbers

Given a semimetric space , with , and , , an isometric embedding of into is defined by , such that for all .

Again, one asks whether such an isometric embedding exists for .

A necessary condition is easy to see: for all , let be the k-simplex formed by , then

The converse also holds. That is, if for all ,

then such an embedding exists.

Further, such embedding is unique up to isometry in . That is, given any two isometric embeddings defined by , and , there exists a (not necessarily unique) isometry , such that for all . Such is unique if and only if , that is, are affinely independent.

Embedding n + 2 and n + 3 points

If points can be embedded in as , then other than the conditions above, an additional necessary condition is that the -simplex formed by , must have no -dimensional volume. That is, .

The converse also holds. That is, if for all ,

and

then such an embedding exists.

For embedding points in , the necessary and sufficient conditions are similar:

  1. For all , ;

Embedding arbitrarily many points

The case turns out to be sufficient in general.

In general, given a semimetric space , it can be isometrically embedded in if and only if there exists , such that, for all , , and for any ,

And such embedding is unique up to isometry in .

Further, if , then it cannot be isometrically embedded in any . And such embedding is unique up to unique isometry in .

Thus, Cayley–Menger determinants give a concrete way to calculate whether a semimetric space can be embedded in , for some finite , and if so, what is the minimal .

Applications

There are many applications of distance geometry. [3]

In telecommunication networks such as GPS, the positions of some sensors are known (which are called anchors) and some of the distances between sensors are also known: the problem is to identify the positions for all sensors. [5] Hyperbolic navigation is one pre-GPS technology that uses distance geometry for locating ships based on the time it takes for signals to reach anchors.

There are many applications in chemistry. [4] [12] Techniques such as NMR can measure distances between pairs of atoms of a given molecule, and the problem is to infer the 3-dimensional shape of the molecule from those distances.

Some software packages for applications are:

See also

Related Research Articles

<span class="mw-page-title-main">Euclidean space</span> Fundamental space of geometry

Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's Elements, it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are Euclidean spaces of any positive integer dimension n, which are called Euclidean n-spaces when one wants to specify their dimension. For n equal to one or two, they are commonly called respectively Euclidean lines and Euclidean planes. The qualifier "Euclidean" is used to distinguish Euclidean spaces from other spaces that were later considered in physics and modern mathematics.

<span class="mw-page-title-main">Metric space</span> Mathematical space with a notion of distance

In mathematics, a metric space is a set together with a notion of distance between its elements, usually called points. The distance is measured by a function called a metric or distance function. Metric spaces are the most general setting for studying many of the concepts of mathematical analysis and geometry.

In commutative algebra, the prime spectrum of a ring R is the set of all prime ideals of R, and is usually denoted by ; in algebraic geometry it is simultaneously a topological space equipped with the sheaf of rings .

<span class="mw-page-title-main">Simplex</span> Multi-dimensional generalization of triangle

In geometry, a simplex is a generalization of the notion of a triangle or tetrahedron to arbitrary dimensions. The simplex is so-named because it represents the simplest possible polytope in any given dimension. For example,

In mathematics, a quadric or quadric surface (quadric hypersurface in higher dimensions), is a generalization of conic sections (ellipses, parabolas, and hyperbolas). It is a hypersurface (of dimension D) in a (D + 1)-dimensional space, and it is defined as the zero set of an irreducible polynomial of degree two in D + 1 variables; for example, D = 1 in the case of conic sections. When the defining polynomial is not absolutely irreducible, the zero set is generally not considered a quadric, although it is often called a degenerate quadric or a reducible quadric.

<span class="mw-page-title-main">Algebraic variety</span> Mathematical object studied in the field of algebraic geometry

Algebraic varieties are the central objects of study in algebraic geometry, a sub-field of mathematics. Classically, an algebraic variety is defined as the set of solutions of a system of polynomial equations over the real or complex numbers. Modern definitions generalize this concept in several different ways, while attempting to preserve the geometric intuition behind the original definition.

<span class="mw-page-title-main">Isometry</span> Distance-preserving mathematical transformation

In mathematics, an isometry is a distance-preserving transformation between metric spaces, usually assumed to be bijective. The word isometry is derived from the Ancient Greek: ἴσος isos meaning "equal", and μέτρον metron meaning "measure".

<span class="mw-page-title-main">Projective variety</span>

In algebraic geometry, a projective variety over an algebraically closed field k is a subset of some projective n-space over k that is the zero-locus of some finite family of homogeneous polynomials of n + 1 variables with coefficients in k, that generate a prime ideal, the defining ideal of the variety. Equivalently, an algebraic variety is projective if it can be embedded as a Zariski closed subvariety of .

This is a glossary of some terms used in Riemannian geometry and metric geometry — it doesn't cover the terminology of differential topology.

In statistics, compositional data are quantitative descriptions of the parts of some whole, conveying relative information. Mathematically, compositional data is represented by points on a simplex. Measurements involving probabilities, proportions, percentages, and ppm can all be thought of as compositional data.

In mathematics, a hyperbolic metric space is a metric space satisfying certain metric relations between points. The definition, introduced by Mikhael Gromov, generalizes the metric properties of classical hyperbolic geometry and of trees. Hyperbolicity is a large-scale property, and is very useful to the study of certain infinite groups called Gromov-hyperbolic groups.

<span class="mw-page-title-main">Real coordinate space</span> Space formed by the n-tuples of real numbers

In mathematics, the real coordinate space of dimension n, denoted Rn or , is the set of the n-tuples of real numbers, that is the set of all sequences of n real numbers. Special cases are called the real lineR1 and the real coordinate planeR2. With component-wise addition and scalar multiplication, it is a real vector space, and its elements are called coordinate vectors.

In functional analysis and related areas of mathematics, a sequence space is a vector space whose elements are infinite sequences of real or complex numbers. Equivalently, it is a function space whose elements are functions from the natural numbers to the field K of real or complex numbers. The set of all such functions is naturally identified with the set of all possible infinite sequences with elements in K, and can be turned into a vector space under the operations of pointwise addition of functions and pointwise scalar multiplication. All sequence spaces are linear subspaces of this space. Sequence spaces are typically equipped with a norm, or at least the structure of a topological vector space.

In algebraic geometry, Proj is a construction analogous to the spectrum-of-a-ring construction of affine schemes, which produces objects with the typical properties of projective spaces and projective varieties. The construction, while not functorial, is a fundamental tool in scheme theory.

In linear algebra, geometry, and trigonometry, the Cayley–Menger determinant is a formula for the content, i.e. the higher-dimensional volume, of a -dimensional simplex in terms of the squares of all of the distances between pairs of its vertices. The determinant is named after Arthur Cayley and Karl Menger.

<span class="mw-page-title-main">De Gua's theorem</span> Three-dimensional analog of the Pythagorean theorem

In mathematics, De Gua's theorem is a three-dimensional analog of the Pythagorean theorem named after Jean Paul de Gua de Malves. It states that if a tetrahedron has a right-angle corner, then the square of the area of the face opposite the right-angle corner is the sum of the squares of the areas of the other three faces:

In mathematics, a Euclidean distance matrix is an n×n matrix representing the spacing of a set of n points in Euclidean space. For points in k-dimensional space k, the elements of their Euclidean distance matrix A are given by squares of distances between them. That is

<span class="mw-page-title-main">Quasi-isometry</span>

In mathematics, a quasi-isometry is a function between two metric spaces that respects large-scale geometry of these spaces and ignores their small-scale details. Two metric spaces are quasi-isometric if there exists a quasi-isometry between them. The property of being quasi-isometric behaves like an equivalence relation on the class of metric spaces.

In mathematics, and especially affine differential geometry, the affine focal set of a smooth submanifold M embedded in a smooth manifold N is the caustic generated by the affine normal lines. It can be realised as the bifurcation set of a certain family of functions. The bifurcation set is the set of parameter values of the family which yield functions with degenerate singularities. This is not the same as the bifurcation diagram in dynamical systems.

In metric geometry, asymptotic dimension of a metric space is a large-scale analog of Lebesgue covering dimension. The notion of asymptotic dimension was introduced by Mikhail Gromov in his 1993 monograph Asymptotic invariants of infinite groups in the context of geometric group theory, as a quasi-isometry invariant of finitely generated groups. As shown by Guoliang Yu, finitely generated groups of finite homotopy type with finite asymptotic dimension satisfy the Novikov conjecture. Asymptotic dimension has important applications in geometric analysis and index theory.

References

  1. Yemini, Y. (1978). "The positioning problem — a draft of an intermediate summary". Conference on Distributed Sensor Networks, Pittsburgh.
  2. 1 2 Liberti, Leo; Lavor, Carlile; MacUlan, Nelson; Mucherino, Antonio (2014). "Euclidean Distance Geometry and Applications". SIAM Review. 56: 3–69. arXiv: 1205.0349 . doi:10.1137/120875909. S2CID   15472897.
  3. 1 2 Mucherino, A.; Lavor, C.; Liberti, L.; Maculan, N. (2013). Distance Geometry: Theory, Methods and Applications.
  4. 1 2 3 Crippen, G.M.; Havel, T.F. (1988). Distance Geometry and Molecular Conformation. John Wiley & Sons.
  5. 1 2 Biswas, P.; Lian, T.; Wang, T.; Ye, Y. (2006). "Semidefinite programming based algorithms for sensor network localization". ACM Transactions on Sensor Networks. 2 (2): 188–220. doi:10.1145/1149283.1149286. S2CID   8002168.
  6. "Simplex Volumes and the Cayley–Menger Determinant". www.mathpages.com. Archived from the original on 16 May 2019. Retrieved 2019-06-08.
  7. Liberti, Leo; Lavor, Carlile (2016). "Six mathematical gems from the history of distance geometry". International Transactions in Operational Research. 23 (5): 897–920. arXiv: 1502.02816 . doi:10.1111/itor.12170. ISSN   1475-3995. S2CID   17299562.
  8. Cayley, Arthur (1841). "On a theorem in the geometry of position". Cambridge Mathematical Journal. 2: 267–271.
  9. Menger, Karl (1928-12-01). "Untersuchungen über allgemeine Metrik". Mathematische Annalen (in German). 100 (1): 75–163. doi:10.1007/BF01448840. ISSN   1432-1807. S2CID   179178149.
  10. Blumenthal, L. M.; Gillam, B. E. (1943). "Distribution of Points in n-Space". The American Mathematical Monthly. 50 (3): 181. doi:10.2307/2302400. JSTOR   2302400.
  11. Menger, Karl (1931). "New Foundation of Euclidean Geometry". American Journal of Mathematics. 53 (4): 721–745. doi:10.2307/2371222. ISSN   0002-9327. JSTOR   2371222.
  12. 1 2 3 Blumenthal, Leonard M. (1953). Theory and Applications of Distance Geometry . Oxford University Press. (2nd edition, Chelsea: 1970)
  13. Bowers, John C.; Bowers, Philip L. (2017-12-13). "A Menger Redux: Embedding Metric Spaces Isometrically in Euclidean Space". The American Mathematical Monthly. 124 (7): 621. doi:10.4169/amer.math.monthly.124.7.621. S2CID   50040864.