Paderbox: A collection of utilities for audio / speech processing

This repository started in late 2014 as an internal development repository for the Communications Engineering Group at Paderborn University, Germany. Over the years it emerged to a collection of IO helper, feature extraction modules and numerous smaller tools adding functionality to Numpy, Pandas, and others.

The main purpose here is to limit code duplication across our other public repositories.

We ensured that most functions/ classes contain Python Docstrings such that automatic tooltips for most functions are supported. It was deliberately decided against a lengthy documentation: most emphasis is put on the Python Docstrings and code readability itself.

Examples

Without going through all functions, we here select two examples which demonstrate why we rely on this very implementation.

Short-time Fourier transform

The Short-time Fourier transform (STFT) is a widely used feature extraction method when dealing with time series such as audio/ speech. Most repositories, including Deep Learning frameworks such as TensorFlow, provide an STFT implementation. However, it is rarely seen, that these implementations allow an exact reconstruction when applying the STFT followed by an inverse STFT.

Two important issues often overseen are:

How do I need to calculate the biorthogonal reconstruction window when using any STFT window function?
How much padding depeding on STFT window length, DFT length, and shift is needed to compensate for fade-in, fade-out, and uneven signal length?

Our STFT implementation addresses aforementioned issues, can operate on any number of independent dimensions and is already battle tested in our publications on audio/ speech since 2015. Numerous STFT tests ensure that the code remains stable and in particular test for the aforementioned problems.

Fast access to the IPython audio player

The function paderbox.play.play() is a somewhat elaborated wrapper around IPython.display.Audio. A single function allows to play audio from the waveform, from the STFT signal, and from file. It therefore serves as a great tool within Jupyter Notebooks and helps for quick inspection of simulation results.

Installation

Install it from PyPI with pip

pip install paderbox[all]

The [all] flag is optional and indicates to install all dependencies. Remove it, when you want to have the minimal dependencies.

Alternatively, you can clone this repository and install it as follows

git clone https://github.com/fgnt/paderbox.git
cd paderbox
pip install --editable .[all]

How to cite?

There is no clear way how to cite this repository for research. However, we would be grateful for direct imports from this repository if you use, e.g., the STFT. We are also fine when you copy the code as long as it remains visible where you copied the code from.

If you use one of our other repositories relying on this work we would be thankful if you respect citation hints for that repository.

Name		Name	Last commit message	Last commit date
Latest commit History 2,677 Commits
.github/workflows		.github/workflows
bash		bash
paderbox		paderbox
scripts		scripts
tests		tests
.bumpversion.cfg		.bumpversion.cfg
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
azure-pipelines.yml		azure-pipelines.yml
jenkins.bash		jenkins.bash
jenkins_common.bash		jenkins_common.bash
maintenance.md		maintenance.md
pylint.cfg		pylint.cfg
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paderbox: A collection of utilities for audio / speech processing

Examples

Short-time Fourier transform

Fast access to the IPython audio player

Installation

How to cite?

About

Releases

Packages

Languages

License

AWS-BugBust-37/paderbox-37

Folders and files

Latest commit

History

Repository files navigation

Paderbox: A collection of utilities for audio / speech processing

Examples

Short-time Fourier transform

Fast access to the IPython audio player

Installation

How to cite?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages