Gabor transform

Last updated November 27, 2023

The Gabor transform, named after Dennis Gabor, is a special case of the short-time Fourier transform. It is used to determine the sinusoidal frequency and phase content of local sections of a signal as it changes over time. The function to be transformed is first multiplied by a Gaussian function, which can be regarded as a window function, and the resulting function is then transformed with a Fourier transform to derive the time-frequency analysis.^[1] The window function means that the signal near the time being analyzed will have higher weight. The Gabor transform of a signal x(t) is defined by this formula:

Inverse Gabor transform

The Gabor transform is invertible. Because it is over-complete, the original signal can be recovered in a variety of ways. For example, the "unwindowing" approach can be used for any $\tau _{0}\in (-\infty ,\infty )$ :

x(t)=e^{\pi (t-\tau _{0})^{2}}{\frac {1}{2\pi }}\int _{-\infty }^{\infty }G_{x}(\tau _{0},\omega )e^{j\omega t}\,d\omega

Alternatively, all of the time components can be combined:

x(t)=\int _{-\infty }^{\infty }{\frac {1}{2\pi }}\int _{-\infty }^{\infty }G_{x}(\tau ,\omega )e^{j\omega t}\,d\omega \,d\tau

Properties of the Gabor transform

The Gabor transform has many properties like those of the Fourier transform. These properties are listed in the following tables.

	Signal	Gabor transform	Remarks
	$x(t)\,$	$G_{x}(\tau ,\omega )=\int _{-\infty }^{\infty }x(t)e^{-\pi (t-\tau )^{2}}e^{-j\omega t}\,dt$
1	$a\cdot x(t)+b\cdot y(t)\,$	$a\cdot G_{x}(\tau ,\omega )+b\cdot G_{y}(\tau ,\omega )\,$	Linearity property
2	$x(t-t_{0})\,$	$G_{x}(\tau -t_{0},\omega )e^{-j\omega t_{0}}\,$	Shifting property
3	$x(t)e^{j\omega _{0}t}\,$	$G_{x}(\tau ,\omega -\omega _{0})\,$	Modulation property

		Remarks
1	$\int _{-\infty }^{\infty }\left\|G_{x}(\tau ,\omega )\right\|^{2}\,d\omega =\int _{-\infty }^{\infty }\left\|x(t)\right\|^{2}e^{-2\pi (t-\tau )^{2}}dt\approx \int _{\tau -1.9143}^{\tau +1.9143}\left\|x(t)\right\|^{2}e^{-2\pi (t-\tau )^{2}}dt$	Power integration property
2	$\int _{-\infty }^{\infty }\int _{-\infty }^{\infty }G_{x}(\tau ,\omega )G_{y}^{}(\tau ,\omega )\,d\omega \,d\tau =\int _{-\infty }^{\infty }x(t)y^{}(t)\,d\tau$	Energy sum property
3	${\displaystyle {\begin{cases}\displaystyle \int _{-\infty }^{\infty }\left\|G_{x}(\tau ,\omega )\right\|^{2}d\omega <e^{-2\pi (t-t_{0})^{2}}\int _{-\infty }^{\infty }\left\|G_{x}(\tau _{0},\omega )\right\|^{2}\,d\omega$	Power decay property
4	$\int _{-\infty }^{\infty }G_{x}(\tau ,\omega )e^{j\omega t}\,d\omega =2\pi e^{-\pi \tau ^{2}}x(0)$	Recovery property

Application and example

The main application of the Gabor transform is used in time–frequency analysis. Take the following function as an example. The input signal has 1 Hz frequency component when t ≤ 0 and has 2 Hz frequency component when t > 0

x(t)={\begin{cases}\cos(2\pi t)&{\text{for }}t\leq 0,\\\cos(4\pi t)&{\text{for }}t>0.\end{cases}}

But if the total bandwidth available is 5 Hz, other frequency bands except x(t) are wasted. Through time–frequency analysis by applying the Gabor transform, the available bandwidth can be known and those frequency bands can be used for other applications and bandwidth is saved. The right side picture shows the input signal x(t) and the output of the Gabor transform. As was our expectation, the frequency distribution can be separated into two parts. One is t ≤ 0 and the other is t > 0. The white part is the frequency band occupied by x(t) and the black part is not used. Note that for each point in time there is both a negative (upper white part) and a positive (lower white part) frequency component.

Discrete Gabor-transformation

A discrete version of Gabor representation

y(t)=\sum _{m=-\infty }^{\infty }\sum _{n=-\infty }^{\infty }C_{nm}\cdot g_{nm}(t)

with $g_{nm}(t)=s(t-m\tau _{0})\cdot e^{j\Omega nt}$

can be derived easily by discretizing the Gabor-basis-function in these equations. Hereby the continuous parameter t is replaced by the discrete time k. Furthermore, the now finite summation limit in Gabor representation has to be considered. In this way, the sampled signal y(k) is split into M time frames of length N. According to $\Omega \leq {\tfrac {2\pi }{\tau _{0}}}$ , the factor Ω for critical sampling is $\Omega ={\tfrac {2\pi }{N}}$ .

Similar to the DFT (discrete Fourier transformation) a frequency domain split into N discrete partitions is obtained. An inverse transformation of these N spectral partitions then leads to N values y(k) for the time window, which consists of N sample values. For overall M time windows with N sample values, each signal y(k) contains K = N $\cdot$ M sample values: (the discrete Gabor representation)

y(k)=\sum _{m=0}^{M-1}\sum _{n=0}^{N-1}C_{nm}\cdot g_{nm}(k)

with $g_{nm}(k)=s(k-mN)\cdot e^{j\Omega nk}$

According to the equation above, the N $\cdot$ M coefficients $C_{nm}$ correspond to the number of sample values K of the signal.

For over-sampling $\Omega$ is set to $\Omega \leq {\tfrac {2\pi }{N}}={\tfrac {2\pi }{N^{\prime }}}$ with N′ > N, which results in N′ > N summation coefficients in the second sum of the discrete Gabor representation. In this case, the number of obtained Gabor-coefficients would be M $\cdot$ N′ > K. Hence, more coefficients than sample values are available and therefore a redundant representation would be achieved.

Scaled Gabor transform

As in short time Fourier transform, the resolution in time and frequency domain can be adjusted by choosing different window function width. In Gabor transform cases, by adding variance $\sigma$ , as following equation:

The scaled (normalized) Gaussian window denotes as:

W_{\text{gaussian}}(t)=e^{-\sigma \pi t^{2}}

So the Scaled Gabor transform can be written as:

G_{x}(t,f)={\sqrt[{4}]{\sigma }}\textstyle \int _{-\infty }^{\infty }\displaystyle e^{-\sigma \pi (\tau -t)^{2}}e^{-j2\pi f\tau }x(\tau )d\tau \qquad

With a large $\sigma$ , the window function will be narrow, causing higher resolution in time domain but lower resolution in frequency domain. Similarly, a small $\sigma$ will lead to a wide window, with higher resolution in frequency domain but lower resolution in time domain.

Time-causal analogue of the Gabor transform

When processing temporal signals, data from the future cannot be accessed, which leads to problems if attempting to use Gabor functions for processing real-time signals. A time-causal analogue of the Gabor filter has been developed in ^[2] based on replacing the Gaussian kernel in the Gabor function with a time-causal and time-recursive kernel referred to as the time-causal limit kernel. In this way, time-frequency analysis based on the resulting complex-valued extension of the time-causal limit kernel makes it possible to capture essentially similar transformations of a temporal signal as the Gabor function can, and corresponding to the Heisenberg group, see ^[2] for further details.

Related Research Articles

<span class="mw-page-title-main">Fourier transform</span> Mathematical transform that expresses a function of time as a function of frequency

In physics, engineering and mathematics, the Fourier transform (FT) is an integral transform that converts a function into a form that describes the frequencies present in the original function. The output of the transform is a complex-valued function of frequency. The term Fourier transform refers to both this complex-valued function and the mathematical operation. When a distinction needs to be made the Fourier transform is sometimes called the frequency domain representation of the original function. The Fourier transform is analogous to decomposing the sound of a musical chord into the intensities of its constituent pitches.

<span class="mw-page-title-main">Short-time Fourier transform</span> Fourier-related transform suited to signals that change rather quickly in time

The short-time Fourier transform (STFT), is a Fourier-related transform used to determine the sinusoidal frequency and phase content of local sections of a signal as it changes over time. In practice, the procedure for computing STFTs is to divide a longer time signal into shorter segments of equal length and then compute the Fourier transform separately on each shorter segment. This reveals the Fourier spectrum on each shorter segment. One then usually plots the changing spectra as a function of time, known as a spectrogram or waterfall plot, such as commonly used in software defined radio (SDR) based spectrum displays. Full bandwidth displays covering the whole range of an SDR commonly use fast Fourier transforms (FFTs) with 2^24 points on desktop computers.

In signal processing, a finite impulse response (FIR) filter is a filter whose impulse response is of finite duration, because it settles to zero in finite time. This is in contrast to infinite impulse response (IIR) filters, which may have internal feedback and may continue to respond indefinitely.

In mathematics and signal processing, the Hilbert transform is a specific singular integral that takes a function, $u (t)$ of a real variable and produces another function of a real variable $H(u)(t)$ . The Hilbert transform is given by the Cauchy principal value of the convolution with the function $(see § Definition). The Hilbert transform has a particularly simple representation in the frequency domain: It imparts a phase shift of \pm90° (π /2 radians) to every frequency component of a function, the sign of the shift depending on the sign of the frequency (see § Relationship with the Fourier transform). The Hilbert transform is important in signal processing, where it is a component of the analytic representation of a real-valued signal u (t) . The Hilbert transform was first introduced by David Hilbert in this setting, to solve a special case of the Riemann-Hilbert problem for analytic functions.$

In signal processing, time–frequency analysis comprises those techniques that study a signal in both the time and frequency domains simultaneously, using various time–frequency representations. Rather than viewing a 1-dimensional signal and some transform, time–frequency analysis studies a two-dimensional signal – a function whose domain is the two-dimensional real plane, obtained from the signal via a time–frequency transform.

Stransform as a time–frequency distribution was developed in 1994 for analyzing geophysics data. In this way, the S transform is a generalization of the short-time Fourier transform (STFT), extending the continuous wavelet transform and overcoming some of its disadvantages. For one, modulation sinusoids are fixed with respect to the time axis; this localizes the scalable Gaussian window dilations and translations in S transform. Moreover, the S transform doesn't have a cross-term problem and yields a better signal clarity than Gabor transform. However, the S transform has its own disadvantages: the clarity is worse than Wigner distribution function and Cohen's class distribution function.

In signal processing, linear phase is a property of a filter where the phase response of the filter is a linear function of frequency. The result is that all frequency components of the input signal are shifted in time by the same constant amount, which is referred to as the group delay. Consequently, there is no phase distortion due to the time delay of frequencies relative to one another.

In mathematics, a Dirac comb is a periodic function with the formula

In system analysis, among other fields of study, a linear time-invariant (LTI) system is a system that produces an output signal from any input signal subject to the constraints of linearity and time-invariance; these terms are briefly defined below. These properties apply (exactly or approximately) to many important physical systems, in which case the response $y (t)$ of the system to an arbitrary input $x (t)$ can be found directly using convolution: $y (t) = (x * h)(t)$ where $h (t)$ is called the system's impulse response and ∗ represents convolution (not to be confused with multiplication). What's more, there are systematic methods for solving any such system (determining $h (t)$ ), whereas systems not meeting both properties are generally more difficult (or impossible) to solve analytically. A good example of an LTI system is any electrical circuit consisting of resistors, capacitors, inductors and linear amplifiers.

The Havriliak–Negami relaxation is an empirical modification of the Debye relaxation model in electromagnetism. Unlike the Debye model, the Havriliak–Negami relaxation accounts for the asymmetry and broadness of the dielectric dispersion curve. The model was first used to describe the dielectric relaxation of some polymers, by adding two exponential parameters to the Debye equation:

A cyclostationary process is a signal having statistical properties that vary cyclically with time. A cyclostationary process can be viewed as multiple interleaved stationary processes. For example, the maximum daily temperature in New York City can be modeled as a cyclostationary process: the maximum temperature on July 21 is statistically different from the temperature on December 20; however, it is a reasonable approximation that the temperature on December 20 of different years has identical statistics. Thus, we can view the random process composed of daily maximum temperatures as 365 interleaved stationary processes, each of which takes on a new value once per year.

In signal processing, a causal filter is a linear and time-invariant causal system. The word causal indicates that the filter output depends only on past and present inputs. A filter whose output also depends on future inputs is non-causal, whereas a filter whose output depends only on future inputs is anti-causal. Systems that are realizable must be causal because such systems cannot act on a future input. In effect that means the output sample that best represents the input at time $comes out slightly later. A common design practice for digital filters is to create a realizable filter by shortening and/or time-shifting a non-causal impulse response. If shortening is necessary, it is often accomplished as the product of the impulse-response with a window function.$

In applied mathematics, the Wiener–Khinchin theorem or Wiener–Khintchine theorem, also known as the Wiener–Khinchin–Einstein theorem or the Khinchin–Kolmogorov theorem, states that the autocorrelation function of a wide-sense-stationary random process has a spectral decomposition given by the power spectral density of that process.

In mathematics, a wavelet series is a representation of a square-integrable function by a certain orthonormal series generated by a wavelet. This article provides a formal, mathematical definition of an orthonormal wavelet and of the integral wavelet transform.

The Wigner distribution function (WDF) is used in signal processing as a transform in time-frequency analysis.

In many-body theory, the term Green's function is sometimes used interchangeably with correlation function, but refers specifically to correlators of field operators or creation and annihilation operators.

A Modified Wigner distribution function is a variation of the Wigner distribution function (WD) with reduced or removed cross-terms.

Bilinear time–frequency distributions, or quadratic time–frequency distributions, arise in a sub-field of signal analysis and signal processing called time–frequency signal processing, and, in the statistical analysis of time series data. Such methods are used where one needs to deal with a situation where the frequency composition of a signal may be changing over time; this sub-field used to be called time–frequency signal analysis, and is now more often called time–frequency signal processing due to the progress in using these methods to a wide range of signal-processing problems.

Time–frequency analysis for music signals is one of the applications of time–frequency analysis. Musical sound can be more complicated than human vocal sound, occupying a wider band of frequency. Music signals are time-varying signals; while the classic Fourier transform is not sufficient to analyze them, time–frequency analysis is an efficient tool for such use. Time–frequency analysis is extended from the classic Fourier approach. Short-time Fourier transform (STFT), Gabor transform (GT) and Wigner distribution function (WDF) are famous time–frequency methods, useful for analyzing music signals such as notes played on a piano, a flute or a guitar.

In signal processing, nonlinear multidimensional signal processing (NMSP) covers all signal processing using nonlinear multidimensional signals and systems. Nonlinear multidimensional signal processing is a subset of signal processing (multidimensional signal processing). Nonlinear multi-dimensional systems can be used in a broad range such as imaging, teletraffic, communications, hydrology, geology, and economics. Nonlinear systems cannot be treated as linear systems, using Fourier transformation and wavelet analysis. Nonlinear systems will have chaotic behavior, limit cycle, steady state, bifurcation, multi-stability and so on. Nonlinear systems do not have a canonical representation, like impulse response for linear systems. But there are some efforts to characterize nonlinear systems, such as Volterra and Wiener series using polynomial integrals as the use of those methods naturally extend the signal into multi-dimensions. Another example is the Empirical mode decomposition method using Hilbert transform instead of Fourier Transform for nonlinear multi-dimensional systems. This method is an empirical method and can be directly applied to data sets. Multi-dimensional nonlinear filters (MDNF) are also an important part of NMSP, MDNF are mainly used to filter noise in real data. There are nonlinear-type hybrid filters used in color image processing, nonlinear edge-preserving filters use in magnetic resonance image restoration. Those filters use both temporal and spatial information and combine the maximum likelihood estimate with the spatial smoothing algorithm.

References

↑ E. Sejdić, I. Djurović, J. Jiang, “Time-frequency feature representation using energy concentration: An overview of recent advances,” Digital Signal Processing, vol. 19, no. 1, pp. 153-183, January 2009.
1 2 Lindeberg, T. (23 January 2023). "A time-causal and time-recursive scale-covariant scale-space representation of temporal signals and past time". Biological Cybernetics: 1–39. doi: 10.1007/s00422-022-00953-6 .

D. Gabor, Theory of Communication, Part 1, J. Inst. of Elect. Eng. Part III, Radio and Communication, vol 93, p. 429 1946 (http://genesis.eecg.toronto.edu/gabor1946.pdf)
Jian-Jiun Ding, Time frequency analysis and wavelet transform class note, the Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan, 2007.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] E. Sejdić, I. Djurović, J. Jiang, “Time-frequency feature representation using energy concentration: An overview of recent advances,” Digital Signal Processing, vol. 19, no. 1, pp. 153-183, January 2009.

[Lin23-2] 1 2 Lindeberg, T. (23 January 2023). "A time-causal and time-recursive scale-covariant scale-space representation of temporal signals and past time". Biological Cybernetics: 1–39. doi: 10.1007/s00422-022-00953-6 .

[1]

[2]