Hyperbolic embedding using AudioSet ontology
Embed AudioSet ontology to Poincare disk.
Embed AudioSet ontology to Poincare disk.
Published in EUSIPCO, 2021
Published in ICASSP, 2021
This paper propose an IDLMA extension, empirical Bayesian IDLMA (EB-IDLMA) to implicitly consider the reliability of the estimated source power spectrograms for the estimation of demixing filters through the hyperparameters of the prior distribution estimated by the DNN.
Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 2680-2694, 2023, 2023
This paper proposes PoP-IDLMA, an extension of independent deeply learned matrix analysis (IDLMA).
Published in ASJ Autumn Meeting, 2024
In this paper, we investigate the performance of various text-speech alignment methods to build a high-quality emotional parallel text-to-speech system.