Explore the World of Pro Audio

Fourier Analysis Applied to Audio: Time-Frequency Domain

Fourier analysis is a fundamental mathematical tool in signal processing, enabling the decomposition of a time-domain signal into its frequency components. In the context of audio, this analysis is essential for tasks such as compression, filtering, synthesis, and pattern recognition. However, since audio signals are dynamic and vary over time, classical Fourier analysis (based on the Fourier Transform) is insufficient, as it only provides frequency-domain information without considering temporal evolution.

To address this issue, time-frequency analysis techniques are used, which allow studying how the spectral content of an audio signal changes over time. Among these techniques, the most notable are:

Short-Time Fourier Transform (STFT)
Wavelet Transform (WT)
Time-Frequency Energy Distributions (e.g., Spectrogram, Wigner-Ville)

This document explores Fourier analysis applied to audio in detail, with a focus on the STFT and its representation via spectrograms.

Time-Frequency Domain

Mathematical Foundations: From the Fourier Transform to the STFT

Continuous Fourier Transform (FT)

The Fourier Transform (FT) of a continuous signal is defined as:

This representation indicates which frequencies are present in the signal but does not show when they occur. For stationary signals (whose properties do not change over time), the FT is sufficient. However, audio is a non-stationary signal, requiring a different approach.

Short-Time Fourier Transform (STFT)

The STFT introduces a sliding window $w (t)$ that isolates short-time segments of the signal before applying the FT. Mathematically:

Where:

is a window function (e.g., Hann, Hamming, Blackman).
is the time instant being analyzed.
$f$ is the frequency.

The STFT produces a time-frequency representation, which can be visualized using a spectrogram.

Key Parameters in the STFT

The quality and resolution of the STFT depend on:

Window size ( $N$ ):
- Larger windows → Higher frequency resolution, lower time resolution.
- Smaller windows → Higher time resolution, lower frequency resolution (Heisenberg-Gabor Uncertainty Principle).
Overlap:
- To avoid information loss, an overlap of 50%-75% is typically used.
Window type:
- Rectangular: Simple but with high side lobes (spectral leakage).
- Hann/Hamming: Reduces leakage, better for spectral analysis.

Spectrogram: Visualizing the Time-Frequency Domain

A spectrogram is a graphical representation of the STFT magnitude ( $∣ X (t, f) ∣$ ):

X-axis: Time.
Y-axis: Frequency.
Color/Intensity: Energy (dB) or magnitude.

Example features in an audio spectrogram:

Formants: High-energy bands corresponding to vocal tract resonances.
Harmonics: Periodic components in musical instrument signals.

Types of Spectrograms

Amplitude Spectrogram: Displays $∣ X (t, f) ∣$ .
Power Spectrogram: Displays $∣ X (t, f) ∣^{2}$ .
Log-Scale Spectrogram: Useful for audio, as human hearing perceives frequencies logarithmically (e.g., Mel scale).

Applications in Audio Processing

Audio Compression (e.g., MP3, AAC)

The STFT identifies irrelevant components (auditory masking) for selective compression.

Speech and Music Recognition

Extraction of MFCCs (Mel-Frequency Cepstral Coefficients), based on filter banks applied to the spectrogram.

Audio Synthesis and Modification

Time-Stretching & Pitch-Shifting (e.g., Phase Vocoder).
Noise Reduction: Time-frequency filtering (e.g., Wiener Filtering).

Musical Instrument Analysis

Identification of harmonic partials and transients.

Limitations and Alternatives

Limitations of the STFT

Time-frequency trade-off: Cannot achieve high resolution in both domains simultaneously.
Smearing effect: Due to window convolution.

Alternative Methods

Wavelet Transform: Better time resolution for high frequencies.
Spectral Component Analysis (PCA/ICA): For complex signals.
Wigner-Ville Distribution: Higher precision but with cross-term interference.

Conclusions

Fourier analysis in the time-frequency domain (STFT + Spectrogram) is a powerful technique for audio processing, enabling:

Visualization of frequency evolution.
Feature extraction for machine learning.
Sound modification and synthesis.

However, its effectiveness depends on proper parameter selection (window type, overlap, FFT size). For advanced applications, methods like wavelets or neural network-based models (e.g., TF-GAN, Wavenet) can be explored.

References

Oppenheim, A. V., & Schafer, R. W. (2010). Discrete-Time Signal Processing.
Smith, J. O. (2007). Spectral Audio Signal Processing.
Mallat, S. (2008). A Wavelet Tour of Signal Processing.

Explore Other Topics



Acoustic Science

Dive into the principles and theories that define sound and its behavior in different environments.



Audio Technology

Stay updated with the latest advancements and innovations in audio equipment and software.



Sound Design

Explore the art and techniques of creating immersive soundscapes for various media.



Music Production

Learn about the processes and tools used in crafting and producing music.



Environmental Acoustics

Understand how sound interacts with natural and urban environments.



Psychoacoustics

Study how humans perceive and interpret different sounds.



Audio Engineering

Gain insights into the technical aspects of recording and manipulating sound.



Sound History

Discover the evolution of sound technology and its impact on society.

Featured Articles

All
Acoustic Science
Audio Engineering
Audio Technology
Environmental Acoustics
Music Production
Psychoacoustics
Sound Design
Sound History

Fourier Analysis Applied to Audio: Time-Frequency Domain

Explore the World of Pro Audio

Fourier Analysis Applied to Audio: Time-Frequency Domain

Time-Frequency Domain

Mathematical Foundations: From the Fourier Transform to the STFT

Continuous Fourier Transform (FT)

Short-Time Fourier Transform (STFT)

Key Parameters in the STFT

Spectrogram: Visualizing the Time-Frequency Domain

Types of Spectrograms

Applications in Audio Processing

Audio Compression (e.g., MP3, AAC)

Speech and Music Recognition

Audio Synthesis and Modification

Musical Instrument Analysis

Limitations and Alternatives

Limitations of the STFT

Alternative Methods

Conclusions

References

Explore Other Topics

Acoustic Science

Audio Technology

Sound Design

Music Production

Environmental Acoustics

Psychoacoustics

Audio Engineering

Sound History

Featured Articles

Tecnare

Loudspeakers Series

E Series

IBZA Series

V Series

ALIS Series

Array Series

SW Series

KT Series

TANIT Series

CS Series

Amplifiers Series

Digital Processors

Accessories

REINVENTING THE RULES

Tecnare