Audio deepfakes: a survey

HIGHLIGHTS

SUMMARY

Others have improved the networks used or have worked on both of the networks and features (Lavrentyeva et_al, 2017; Nagarsheth et_al, 2017; Gonzalez-Rodriguez et_al, 2018; Huang and Pun, 2019, 2020; Lai et_al, 2019; Li et_al, 2019). Given the evaluation of different audio deepfake frameworks` performance, the Mean Opinion Score (MOS) of the generated audio is better when the framework is trained using single speaker datasets (Oord et_al, 2016; Ping et_al, 2018; Kumar et_al, 2019; Kong et_al, 2020). The frameworks including their code as well as the datasets used that are available publicly (Sotelo . . .

If you want to have access to all the content you need to log in!

Thanks :)

If you don't have an account, you can create one here.

Add A Knowledge Base Question !