Publicação
Nonperiodic pathologic voice signals classification using mel-spectrogram and VGGish
| datacite.subject.fos | Engenharia e Tecnologia | |
| dc.contributor.author | Fernandes, Joana | |
| dc.contributor.author | Pinto, João | |
| dc.contributor.author | Moura, Carla | |
| dc.contributor.author | Vilarinho, Helena | |
| dc.contributor.author | Teixeira, Felipe | |
| dc.contributor.author | Freitas, D. | |
| dc.contributor.author | Teixeira, João Paulo | |
| dc.date.accessioned | 2026-05-18T16:22:00Z | |
| dc.date.available | 2026-05-18T16:22:00Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | In this work and the literature, voice signals can be classified as peri-odic (type 1) or either some periodicity (type 2) and chaos (type 3). This work aims to classify signs into types 1, 2 or 3 to be subsequently applied in a classifi-cation system for pathological/control signs. The original dataset is composed of 466 type 1 individuals, 900 type 2 individuals, and 84 type 3 individuals classi-fied by an otolaryngologist. 15% of the data was used for testing and the remain-ing 85% was used for training and validation. A data augmentation technique was applied to balance the data in training set. Therefore, for the test set, 3380 sounds were used, 1020 type 1, 1280 type 2 and 1080 type 3. Of these, 80% were used for training and 20% for validation. The Mel spectrograms of the signals were used in the input of a VGGish to retrain the model in classifying the 3 types of signals. Regarding test accuracy, this network obtained 71.2%. | por |
| dc.description.sponsorship | This work was supported by national funds through FCT/MCTES (PIDDAC): CeDRI, UIDB/05757/2020 (DOI: 10.54499/UIDB/05757/2020) and UIDP/05757/2020 (DOI: 10.54499/UIDP/05757/2020); and SusTEC, LA/P/0007/2020 (DOI: 10.54499/LA/P/0007/2020) and 2021.04729.BD (DOI: 10.54499/2021.04729.BD). | |
| dc.identifier.citation | Fernandes, Joana; Pinto, João; Moura, Carla; Vilarinho, Helena; Teixeira, Felipe; Freitas, D.; Teixeira, João Paulo (2025) Nonperiodic pathologic voice signals classification using mel-spectrogram and VGGish. In International Conference on Demographic Transition, Health and Technologies, ICDTHT 2025. p. 3-13. ISBN 978-303194900-5. DOI: 10.1007/978-3-031-94901-2_1 | |
| dc.identifier.doi | 10.1007/978-3-031-94901-2_1 | |
| dc.identifier.isbn | 978-303194900-5 | |
| dc.identifier.uri | http://hdl.handle.net/10198/36722 | |
| dc.language.iso | eng | |
| dc.peerreviewed | yes | |
| dc.publisher | Springer Nature Switzerland | |
| dc.relation | Research Centre in Digitalization and Intelligent Robotics | |
| dc.relation | Associate Laboratory for Sustainability and Tecnology in Mountain Regions - LA/P/0007/2020 | |
| dc.relation.ispartof | Springer Proceedings in Business and Economics | |
| dc.relation.ispartof | Health Technologies and Demographic Challenges | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.title | Nonperiodic pathologic voice signals classification using mel-spectrogram and VGGish | por |
| dc.type | conference object | |
| dspace.entity.type | Publication | |
| oaire.awardNumber | UIDB/05757/2020 | |
| oaire.awardNumber | LA/P/0007/2020 | |
| oaire.awardTitle | Research Centre in Digitalization and Intelligent Robotics | |
| oaire.awardTitle | Associate Laboratory for Sustainability and Tecnology in Mountain Regions - LA/P/0007/2020 | |
| oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F05757%2F2020/PT | |
| oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/LA%2FP%2F0007%2F2020/PT | |
| oaire.citation.endPage | 13 | |
| oaire.citation.startPage | 3 | |
| oaire.citation.title | International Conference on Demographic Transition, Health and Technologies, ICDTHT 2025 | |
| oaire.fundingStream | 6817 - DCRRNI ID | |
| oaire.fundingStream | 6817 - DCRRNI ID | |
| oaire.version | http://purl.org/coar/version/c_b1a7d7d4d402bcce | |
| person.familyName | Teixeira | |
| person.familyName | Teixeira | |
| person.givenName | Felipe | |
| person.givenName | João Paulo | |
| person.identifier | 663194 | |
| person.identifier.ciencia-id | 0E17-62FB-AA17 | |
| person.identifier.ciencia-id | 4F15-B322-59B4 | |
| person.identifier.orcid | 0000-0002-6679-5702 | |
| person.identifier.rid | N-6576-2013 | |
| person.identifier.scopus-author-id | 57069567500 | |
| project.funder.identifier | http://doi.org/10.13039/501100001871 | |
| project.funder.identifier | http://doi.org/10.13039/501100001871 | |
| project.funder.name | Fundação para a Ciência e a Tecnologia | |
| project.funder.name | Fundação para a Ciência e a Tecnologia | |
| relation.isAuthorOfPublication | 764c5209-b9ab-479e-b5be-59fbe07c784b | |
| relation.isAuthorOfPublication | 33f4af65-7ddf-46f0-8b44-a7470a8ba2bf | |
| relation.isAuthorOfPublication.latestForDiscovery | 764c5209-b9ab-479e-b5be-59fbe07c784b | |
| relation.isProjectOfPublication | 6e01ddc8-6a82-4131-bca6-84789fa234bd | |
| relation.isProjectOfPublication | 6255046e-bc79-4b82-8884-8b52074b4384 | |
| relation.isProjectOfPublication.latestForDiscovery | 6e01ddc8-6a82-4131-bca6-84789fa234bd |
