Explore the World of Pro Audio

Fourier Analysis Applied to Audio: Time-Frequency Domain

Fourier analysis is a fundamental mathematical tool in signal processing, enabling the decomposition of a time-domain signal into its frequency components. In the context of audio, this analysis is essential for tasks such as compression, filtering, synthesis, and pattern recognition. However, since audio signals are dynamic and vary over time, classical Fourier analysis (based on the Fourier Transform) is insufficient, as it only provides frequency-domain information without considering temporal evolution.

To address this issue, time-frequency analysis techniques are used, which allow studying how the spectral content of an audio signal changes over time. Among these techniques, the most notable are:

  • Short-Time Fourier Transform (STFT)

  • Wavelet Transform (WT)

  • Time-Frequency Energy Distributions (e.g., Spectrogram, Wigner-Ville)

This document explores Fourier analysis applied to audio in detail, with a focus on the STFT and its representation via spectrograms.

Time-Frequency Domain

Mathematical Foundations: From the Fourier Transform to the STFT

 

Continuous Fourier Transform (FT)

 

The Fourier Transform (FT) of a continuous signal x(t) is defined as:

Fourier Analysis Applied to Audio: Time-Frequency Domain 1

This representation indicates which frequencies are present in the signal but does not show when they occur. For stationary signals (whose properties do not change over time), the FT is sufficient. However, audio is a non-stationary signal, requiring a different approach.

 

Short-Time Fourier Transform (STFT)

 

The STFT introduces a sliding window w(t) that isolates short-time segments of the signal before applying the FT. Mathematically:

Fourier Analysis Applied to Audio: Time-Frequency Domain 2

Where:

  • w(t) is a window function (e.g., Hann, Hamming, Blackman).

  • t is the time instant being analyzed.

  •  is the frequency.

The STFT produces a time-frequency representation, which can be visualized using a spectrogram.


 

Key Parameters in the STFT

 

The quality and resolution of the STFT depend on:

  • Window size (N):

    • Larger windows → Higher frequency resolution, lower time resolution.

    • Smaller windows → Higher time resolution, lower frequency resolution (Heisenberg-Gabor Uncertainty Principle).

  • Overlap:

    • To avoid information loss, an overlap of 50%-75% is typically used.

  • Window type:

    • Rectangular: Simple but with high side lobes (spectral leakage).

    • Hann/Hamming: Reduces leakage, better for spectral analysis.


 

Spectrogram: Visualizing the Time-Frequency Domain

 

spectrogram is a graphical representation of the STFT magnitude (∣X(t,f)∣):

  • X-axis: Time.

  • Y-axis: Frequency.

  • Color/Intensity: Energy (dB) or magnitude.

Example features in an audio spectrogram:

  • Formants: High-energy bands corresponding to vocal tract resonances.

  • Harmonics: Periodic components in musical instrument signals.

 

Types of Spectrograms

 

  • Amplitude Spectrogram: Displays ∣X(t,f)∣.

  • Power Spectrogram: Displays ∣X(t,f)∣2.

  • Log-Scale Spectrogram: Useful for audio, as human hearing perceives frequencies logarithmically (e.g., Mel scale).


 

Applications in Audio Processing

 

Audio Compression (e.g., MP3, AAC)

 

  • The STFT identifies irrelevant components (auditory masking) for selective compression.

 

Speech and Music Recognition

 

  • Extraction of MFCCs (Mel-Frequency Cepstral Coefficients), based on filter banks applied to the spectrogram.

 

Audio Synthesis and Modification

 

  • Time-Stretching & Pitch-Shifting (e.g., Phase Vocoder).

  • Noise Reduction: Time-frequency filtering (e.g., Wiener Filtering).

 

Musical Instrument Analysis

 

  • Identification of harmonic partials and transients.


 

Limitations and Alternatives

 

Limitations of the STFT

 

  • Time-frequency trade-off: Cannot achieve high resolution in both domains simultaneously.

  • Smearing effect: Due to window convolution.

 

Alternative Methods

 

  • Wavelet Transform: Better time resolution for high frequencies.

  • Spectral Component Analysis (PCA/ICA): For complex signals.

  • Wigner-Ville Distribution: Higher precision but with cross-term interference.


 

Conclusions

 

Fourier analysis in the time-frequency domain (STFT + Spectrogram) is a powerful technique for audio processing, enabling:

  • Visualization of frequency evolution.

  • Feature extraction for machine learning.

  • Sound modification and synthesis.

However, its effectiveness depends on proper parameter selection (window type, overlap, FFT size). For advanced applications, methods like wavelets or neural network-based models (e.g., TF-GAN, Wavenet) can be explored.


References

  • Oppenheim, A. V., & Schafer, R. W. (2010). Discrete-Time Signal Processing.

  • Smith, J. O. (2007). Spectral Audio Signal Processing.

  • Mallat, S. (2008). A Wavelet Tour of Signal Processing.

 
 
 
 
 
 

Explore Other Topics

Acoustic Science

Dive into the principles and theories that define sound and its behavior in different environments.

Audio Technology

Stay updated with the latest advancements and innovations in audio equipment and software.

Sound Design

Explore the art and techniques of creating immersive soundscapes for various media.

Music Production

Learn about the processes and tools used in crafting and producing music.

Environmental Acoustics

Understand how sound interacts with natural and urban environments.

Psychoacoustics

Study how humans perceive and interpret different sounds.

Audio Engineering

Gain insights into the technical aspects of recording and manipulating sound.

Sound History

Discover the evolution of sound technology and its impact on society.

Featured Articles

    Tecnare

    Loudspeakers Series

    E Series

    IBZA Series

    V Series

    ALIS Series

    Array Series

    SW Series

    KT Series

    TANIT Series

    CS Series

    Amplifiers Series

    Digital Processors

    Accessories

    AUDIO

    REINVENTING THE RULES

    Tecnare

    Tecnare Sound Systems horizontal white Logo
    Privacy Overview

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.