Transfer learning with audioSet to voice pathologies identification in continuous speech

Guedes, Victor; Teixeira, Felipe; Oliveira, Alessa Anjos de; Fernandes, Joana Filipa Teixeira; Silva, Letícia; Candido Junior, Arnaldo; Teixeira, João Paulo

http://hdl.handle.net/10198/21796

Utilize este identificador para referenciar este registo.

Nome:	Descrição:	Tamanho:	Formato:
artigopublicado_HCIST2019_TransferLearning.pdf		865.27 KB	Adobe PDF	Ver/Abrir

Contacte-nos

Autores

Guedes, Victor

Teixeira, Felipe

Oliveira, Alessa Anjos de

Fernandes, Joana Filipa Teixeira

Silva, Letícia

Candido Junior, Arnaldo

Teixeira, João Paulo

Resumo(s)

The classification of pathological diseases with the implementation of concepts of Deep Learning has been increasing considerably in recent times. Among the works developed there are good results for the classification in sustained speech with vowels, but few related works for the classification in continuous speech. This work uses the German Saarbrücken Voice Database with the phrase “Guten Morgen, wie geht es Ihnen?” to classify four classes: dysphonia, laryngitis, paralysis of vocal cords and healthy voices. Transfer learning concepts were used with the AudioSet database. Two models were developed based on Long-Short-Term-Memory and Convolutional Network for classification of extracted embeddings and comparison of the best results, using cross-validation. The final results allowed to obtaining 40% of f1-score for the four classes, 66% f1-score for Dysphonia x Healthy, 67% for Laryngitis x healthy and 80% for Paralysis x Healthy.

Palavras-chave

Long short term memory Convolutional neural network SVD Deep learning Voice pathologies diagnose

URI

http://hdl.handle.net/10198/21796

Citação

Guedes, Victor; Teixeira, Felipe; Oliveira, Alessa; Fernandes, Joana; Silva, Leticia; Junior, Arnaldo; Teixeira, João Paulo (2019). Transfer learning with audioSet to voice pathologies identification in continuous speech. In International Conference on ENTERprise Information Systems, International Conference on Project MANagement. Tunisia. 164, p. 662-669

Editora

Elsevier

DOI

10.1016/j.procs.2019.12.233

Coleções

ESTiG - Publicações em Proceedings Indexadas à WoS/Scopus

Licença CC

cclicense-by

Métricas Alternativas

Ver registo completo