Section 01
F5-TTS-DPS: Guide to the Winning Solution for WildSpoof2026 TTS Track
This article introduces F5-TTS-DPS, the winning solution for the TTS track of the WildSpoof 2026 Challenge. Based on the F5-TTS architecture, this model incorporates Exponential Moving Average (EMA) and a dual-score prompt selection mechanism. It achieved the best a-DCF scores on three advanced SASV detection systems, generating speech with high naturalness that is difficult to detect.
Original author team: WildSpoof 2026 TTS track participating team Source platform: arXiv Release date: May 22, 2026 Original link: http://arxiv.org/abs/2605.23859v1
Keywords: TTS, speech synthesis, anti-spoofing detection, EMA, prompt selection, WildSpoof, F5-TTS, deepfake, speech security