FlowDec (ICLR 2025) is a full-band audio codec for general audio sampled at 48 kHz that combines non-adversarial codec training with a stochastic postfilter based on a novel conditional flow matching method.
From abstract:
Quote
Compared to the prior work ScoreDec which is based on score matching, we generalize from speech to general audio and move from 24 kbit/s to as low as 4 kbit/s, while improving output quality and reducing the required postfilter DNN evaluations from 60 to 6 without any fine-tuning or distillation techniques.
DEMO: https://sp-uhh.github.io/FlowDec/
License: Creative Commons Attribution-NonCommercial 4.0 International License
Official git: https://github.com/facebookresearch/FlowDec#readme