SPolDB: an audio dataset for artificial intelligence-based identification of noise pollutants

S. Ganguly, H. Mukherjee, A. Dhar, M. Marciano, K. Roy

Research output: Contribution to journalArticlepeer-review

Abstract

Worldwide technological advancements have introduced machines for various tasks, from simple spice mixing to heavy drilling. Many of these devices produce multifarious sounds which are often over tolerable limits, catering to noise pollution. This rising threat affects biodiversity and human health, with visible impacts like the disappearance of birds and health complications. Artificial Intelligence (AI) has been employed to detect sound pollutants, aiding in soundscape mapping for better urban planning and biodiversity protection. However, data scarcity remains a challenge for training machine learning models. We present SPolDB, a comprehensive sound pollution dataset with 54 classes from indoor and outdoor sources, featuring over 133,000 clips of varying lengths. The dataset was composed using recordings in the natural ambiance as well as sourcing audio clips of real-world scenarios. Baseline results are reported using an established handcrafted feature-based approach (Mel Frequency Cepstral Coefficient + Random Forest/Multi-layered Perceptron) and deep learning approach. The highest balanced accuracy (to mitigate bias introduced by class imbalance) of 82.9% was obtained on the test set (20% data for each class) leveraging a customized lightweight Convolutional Neural Network architecture named XZ-Net along with Mel Spectrogram.

Original languageEnglish (US)
Article number105261
Pages (from-to)10651-10668
JournalInternational Journal of Environmental Science and Technology
Volume22
Issue number11
DOIs
StatePublished - Jul 2025

Keywords

  • Deep learning
  • Handcrafted features
  • Indoor noise
  • Noise pollution
  • Outdoor noise

ASJC Scopus subject areas

  • Environmental Engineering
  • Environmental Chemistry
  • General Agricultural and Biological Sciences

Fingerprint

Dive into the research topics of 'SPolDB: an audio dataset for artificial intelligence-based identification of noise pollutants'. Together they form a unique fingerprint.

Cite this