HIGHLIGHTS
- who: Andreas Triantafyllopoulos from the Texas AandM University, United States have published the Article: Multistage linguistic conditioning of convolutional layers for speech emotion recognition, in the Journal: (JOURNAL)
- what: In this contribution, the authors investigate the effectiveness of deep fusion of text and audio features for categorical and dimensional SER. The authors propose a novel, multistage fusion method where the two information streams are integrated in several layers of a DNN, and contrast it with a single-stage one where the streams are merged in a single point. Inspired by such works, the authors propose . . .
If you want to have access to all the content you need to log in!
Thanks :)
If you don't have an account, you can create one here.