A Neural Architecture for Acoustic Question Answering

This repository contains the code used in the experiments for the paper (DOI 10.1109/TPAMI.2022.3194311):

@article{AbdelnourEtAl2023PAMI,
  author = 	 {Jérôme Abdelnour and Jean Rouat and Giampiero Salvi},
  title = 	 {NAAQA: A Neural Architecture for Acoustic Question Answering},
  journal = 	 {IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year = 	 {2023},
  volume = 	 {45},
  number = 	 {4},
  pages = 	 {4997--5009},
  month = 	 apr,
}

The code creates a neural archtecture for solving the acoustic question answering problem defined by the CLEAR dataset. Please consider citing the paper if you find this useful.

Installing requirements (Ubuntu 20.04)

sudo apt install python3.8-venv
sudo apt install libpq-dev libhdf5-dev cython3 python-dev libfreetype6-dev

For automatically synching with google doc (but need to configure ~/.config/rclone/rclone.conf):

sudo apt install rclone

Downloading the data

... assuming it is downloaded on ../data

Setting up for running

ln -s ../data .
python3 -m venv venv
ln -snf venv/bin/activate activate_venv
source activate_venv

Torch 1.5 (older GPUs)

pip install -r requirements.txt

Torch 1.7 (newer GPUs requiring CUDA 11)

pip install -r requirements_torch1.7.txt -f https://download.pytorch.org/whl/torch_stable.html

Name		Name	Last commit message	Last commit date
Latest commit History 987 Commits
.idea/runConfigurations		.idea/runConfigurations
config		config
data_interfaces		data_interfaces
images		images
models		models
stats		stats
tools		tools
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
baselines.py		baselines.py
main.py		main.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
requirements_torch1.7.txt		requirements_torch1.7.txt
runner.py		runner.py
visualization.py		visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A Neural Architecture for Acoustic Question Answering

Installing requirements (Ubuntu 20.04)

Downloading the data

Setting up for running

Torch 1.5 (older GPUs)

Torch 1.7 (newer GPUs requiring CUDA 11)

About

Uh oh!

Releases

Packages

Languages

License

NECOTIS/NAAQA-Acoustic-Question-Answering

Folders and files

Latest commit

History

Repository files navigation

A Neural Architecture for Acoustic Question Answering

Installing requirements (Ubuntu 20.04)

Downloading the data

Setting up for running

Torch 1.5 (older GPUs)

Torch 1.7 (newer GPUs requiring CUDA 11)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages