|
DSpAST: Disentangled Representations for Spatial Audio Reasoning with Large Language Models
|
|
K. Wilkinghoff, Z.-H. Tan. arXiv:2509.13927, under review.
|
|
Quantization-Based Score Calibration for Few-Shot Keyword Spotting with Dynamic Time Warping in Noisy Environments
|
|
K. Wilkinghoff, A. Cornaggia-Urrigshardt, Z.-H. Tan. arXiv:2510.15432, under review.
|
|
Local Density-Based Anomaly Score Normalization for Domain Generalization
|
|
K. Wilkinghoff, H. Yang, J. Ebbers, F. G. Germain, G. Wichern, J. Le Roux. arXiv:2509.10951, accepted for publication in IEEE ACM Trans. Audio Speech Lang. Process.
|
|
Keeping the Balance: Anomaly Score Calculation for Domain Generalization
|
|
K. Wilkinghoff, H. Yang, J. Ebbers, F. G. Germain, G. Wichern, J. Le Roux. ICASSP.
|
|
No Class Left Behind: A Closer Look at Class Balancing for Audio Tagging
|
|
J. Ebbers, F. G. Germain, K. Wilkinghoff, G. Wichern, J. Le Roux. ICASSP.
|
|
Handling Domain Shifts for Anomalous Sound Detection: A Review of DCASE-Related Work
|
|
K. Wilkinghoff, T. Fujimura, K. Imoto, J. Le Roux, Z.-H. Tan, T. Toda. DCASE.
|
|
ASDKit: A Toolkit for Comprehensive Evaluation of Anomalous Sound Detection Methods
|
|
T. Fujimura, K. Wilkinghoff, K. Imoto, T. Toda. DCASE.
|
|
Personalized Speech Synthesis for Zero-Shot Keyword Spotting
|
|
F. Gökgöz, A. Cornaggia-Urrigshardt, K. Wilkinghoff. ITG Speech.
|
|
F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-Supervised Anomaly Detection
|
|
K. Wilkinghoff, K. Imoto. ICASSP.
|
|
Self-Supervised Learning for Anomalous Sound Detection
|
|
K. Wilkinghoff. ICASSP.
|
|
TACos: Learning Temporally Structured Embeddings for Few-Shot Keyword Spotting with Dynamic Time Warping
|
|
K. Wilkinghoff, A. Cornaggia-Urrigshardt. ICASSP.
|
|
Multi-Sample Dynamic Time Warping for Few-Shot Keyword Spotting
|
|
K. Wilkinghoff, A. Cornaggia-Urrigshardt. EUSIPCO.
|
|
AdaProj: Adaptively Scaled Angular Margin Subspace Projections for Anomalous Sound Detection with Auxiliary Classification Tasks
|
|
K. Wilkinghoff. DCASE.
|
|
Analyzing the Impact of HF-Specific Signal Degradation on Automatic Speech Recognition
|
|
F. Fritz, A. Cornaggia-Urrigshardt, L. Henneke, F. Kurth, K. Wilkinghoff. ICMCIS.
|
|
Strong Label Generation for Preparing Speech Data in Military Applications Using CTC Loss
|
|
F. Gökgöz, A. Cornaggia-Urrigshardt, K. Wilkinghoff. ICMCIS.
|
|
Design Choices for Learning Embeddings from Auxiliary Tasks for Domain Generalization in Anomalous Sound Detection
|
|
K. Wilkinghoff. ICASSP.
|
|
Novel Generative Classifier for Acoustic Events
|
|
P. M. Baggenstoss, K. Wilkinghoff. EUSIPCO.
|
|
On Using Pre-Trained Embeddings for Detecting Anomalous Sounds with Limited Training Data
|
|
K. Wilkinghoff, F. Fritz. EUSIPCO.
|
|
Language Recognition for SSB modulated HF Radio Signals of Short Duration
|
|
A. Cornaggia-Urrigshardt, F. Fritz, L. Henneke, F. Kurth, C. Schlich, K. Wilkinghoff. ITG Speech.
|
|
Towards Human-Machine Integration for Signal Intelligence Applications
|
|
J. D. Rockbach, L.-F. Bluhm, I. Schlangen, L. Over, S. Apfeld, L. Henneke, K. Wilkinghoff. SDF.
|
|
SCALA-Speech: An Interactive System for Finding and Analyzing Speech Content in Audio Data
|
|
A. Cornaggia-Urrigshardt, N. Jarocky, F. Kurth, S. Urrigshardt, K. Wilkinghoff. GI-Jahrestagung.
|
|
Speech Recognition Lab
|
|
A. Cornaggia-Urrigshardt, F. Gökgöz, F. Kurth, H.-C. Schmitz, K. Wilkinghoff. ICMCIS.
|
|
On Open-Set Classification with L3-Net Embeddings for Machine Listening Applications.
|
|
K. Wilkinghoff. EUSIPCO.
|
|
Using Look, Listen, and Learn Embeddings for Detecting Anomalous Sounds in Machine Condition Monitoring
|
|
K. Wilkinghoff. DCASE.
|
|
On Open-Set Speaker Identification with I-Vectors
|
|
K. Wilkinghoff. Odyssey.
|
|
Towards Robust Speech Interfaces for the ISS
|
|
H.-C. Schmitz, F. Kurth, K. Wilkinghoff, U. Müllerschkowski, C. Karrasch, V. Schmid. IUI Companion.
|
|
Robust Detection of Jittered Multiply Repeating Audio Events Using Iterated Time-Warped ACF
|
|
F. Kurth, K. Wilkinghoff. ICASSP.
|
|
General-Purpose Audio Tagging by Ensembling Convolutional Neural Networks based on Multiple Features
|
|
K. Wilkinghoff. DCASE.
|
|
Accurately Capturing Speech Feature Distributions by Extending Supervectors for Robust Speaker Recognition
|
|
K. Wilkinghoff. ITG Speech.
|
|
Robust Speaker Identification by Fusing Classification Scores with a Neural Network
|
|
K. Wilkinghoff, P. M. Baggenstoss, A. Cornaggia-Urrigshardt, F. Kurth. ITG Speech.
|