Publications

Publications

Explore a selection of our publications, reflecting our research, insights, and ongoing work throughout all faculty at The Center Of Neuromusculoskeletal Research.

  • De Silva, Upeka, Samaneh Madanian, Ajit Narayanan, John Michael Templeton, Christian Poellabauer, Sandra L Schneider, and Rahmina Rubaiat. (2025) 2025. “A Proof-of-Concept Development on Speech Analysis for Concussion Detection.”. Studies in Health Technology and Informatics 329: 1008-12. https://doi.org/10.3233/SHTI250991.

    Speech signal analysis to support objective clinical decision-making has gained immense interest, especially in neurological disorders. This research assessed the feasibility of speech analysis on the detection of concussions. Using a speech dataset from 82 concussed and 82 healthy participants, we extracted two speech feature sets focusing on Mel Frequency Cepstral Coefficients (MFCCs) to characterize speech articulation. A machine learning pipeline was developed to discriminate concussion speech from healthy speech by applying Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Decision Tree (DT) classifiers. All three classifiers trained on the MFCC-based feature set achieved Matthew's correlation coefficient score above 0.5 on the holdout data set. DT model achieved a 78% sensitivity and 75% specificity. The findings of this research serve as proof-of-concept for speech analysis of concussion detection.

  • Madanian, Samaneh, Olayinka Adeleye, John Michael Templeton, Talen Chen, Christian Poellabauer, Enshi Zhang, and Sandra L Schneider. (2025) 2025. “A Multi-Dilated Convolution Network for Speech Emotion Recognition.”. Scientific Reports 15 (1): 8254. https://doi.org/10.1038/s41598-025-92640-2.

    Speech emotion recognition (SER) is an important application in Affective Computing and Artificial Intelligence. Recently, there has been a significant interest in Deep Neural Networks using speech spectrograms. As the two-dimensional representation of the spectrogram includes more speech characteristics, research interest in convolution neural networks (CNNs) or advanced image recognition models is leveraged to learn deep patterns in a spectrogram to effectively perform SER. Accordingly, in this study, we propose a novel SER model based on the learning of the utterance-level spectrogram. First, we use the Spatial Pyramid Pooling (SPP) strategy to remove the size constraint associated with the CNN-based image recognition task. Then, the SPP layer is deployed to extract both the global-level prominent feature vector and multi-local-level feature vector, followed by an attention model to weigh the feature vectors. Finally, we apply the ArcFace layer, typically used for face recognition, to the SER task, thereby obtaining improved SER performance. Our model achieved an unweighted accuracy of 67.9% on IEMOCAP and 77.6% on EMODB datasets.

  • Templeton, John Michael, Christian Poellabauer, Sandra Schneider, Morteza Rahimi, Taofeek Braimoh, Fhaheem Tadamarry, Jason Margolesky, Shanna Burke, and Zeina Al Masry. (2025) 2025. “Modernizing the Staging of Parkinson Disease Using Digital Health Technology.”. Journal of Medical Internet Research 27: e63105. https://doi.org/10.2196/63105.

    Due to the complicated nature of Parkinson disease (PD), a number of subjective considerations (eg, staging schemes, clinical assessment tools, or questionnaires) on how best to assess clinical deficits and monitor clinical progression have been published; however, none of these considerations include a comprehensive, objective assessment of all functional areas of neurocognition affected by PD (eg, motor, memory, speech, language, executive function, autonomic function, sensory function, behavior, and sleep). This paper highlights the increasing use of digital health technology (eg, smartphones, tablets, and wearable devices) for the classification, staging, and monitoring of PD. Furthermore, this Viewpoint proposes a foundation for a new staging schema that builds from multiple clinically implemented scales (eg, Hoehn and Yahr Scale and Berg Balance Scale) for ease and homogeneity, while also implementing digital health technology to expand current staging protocols. This proposed staging system foundation aims to provide an objective, symptom-specific assessment of all functional areas of neurocognition via inherent device capabilities (eg, device sensors and human-device interactions). As individuals with PD may manifest different symptoms at different times across the spectrum of neurocognition, the modernization of assessments that include objective, symptom-specific monitoring is imperative for providing personalized medicine and maintaining individual quality of life.

  • De Silva, Upeka, Samaneh Madanian, Sharon Olsen, John Michael Templeton, Christian Poellabauer, Sandra L Schneider, Ajit Narayanan, and Rahmina Rubaiat. (2025) 2025. “Clinical Decision Support Using Speech Signal Analysis: Systematic Scoping Review of Neurological Disorders.”. Journal of Medical Internet Research 27: e63004. https://doi.org/10.2196/63004.

    BACKGROUND: Digital biomarkers are increasingly used in clinical decision support for various health conditions. Speech features as digital biomarkers can offer insights into underlying physiological processes due to the complexity of speech production. This process involves respiration, phonation, articulation, and resonance, all of which rely on specific motor systems for the preparation and execution of speech. Deficits in any of these systems can cause changes in speech signal patterns. Increasing efforts are being made to develop speech-based clinical decision support systems.

    OBJECTIVE: This systematic scoping review investigated the technological revolution and recent digital clinical speech signal analysis trends to understand the key concepts and research processes from clinical and technical perspectives.

    METHODS: A systematic scoping review was undertaken in 6 databases guided by a set of research questions. Articles that focused on speech signal analysis for clinical decision-making were identified, and the included studies were analyzed quantitatively. A narrower scope of studies investigating neurological diseases were analyzed using qualitative content analysis.

    RESULTS: A total of 389 articles met the initial eligibility criteria, of which 72 (18.5%) that focused on neurological diseases were included in the qualitative analysis. In the included studies, Parkinson disease, Alzheimer disease, and cognitive disorders were the most frequently investigated conditions. The literature explored the potential of speech feature analysis in diagnosis, differentiating between, assessing the severity and monitoring the treatment of neurological conditions. The common speech tasks used were sustained phonations, diadochokinetic tasks, reading tasks, activity-based tasks, picture descriptions, and prompted speech tasks. From these tasks, conventional speech features (such as fundamental frequency, jitter, and shimmer), advanced digital signal processing-based speech features (such as wavelet transformation-based features), and spectrograms in the form of audio images were analyzed. Traditional machine learning and deep learning approaches were used to build predictive models, whereas statistical analysis assessed variable relationships and reliability of speech features. Model evaluations primarily focused on analytical validations. A significant research gap was identified: the need for a structured research process to guide studies toward potential technological intervention in clinical settings. To address this, a research framework was proposed that adapts a design science research methodology to guide research studies systematically.

    CONCLUSIONS: The findings highlight how data science techniques can enhance speech signal analysis to support clinical decision-making. By combining knowledge from clinical practice, speech science, and data science within a structured research framework, future research may achieve greater clinical relevance.

  • Rahimi, Morteza, Zeina Al Masry, John Michael Templeton, Sandra Schneider, and Christian Poellabauer. (2025) 2025. “A Comprehensive Multifunctional Approach for Measuring Parkinson’s Disease Severity.”. Applied Clinical Informatics 16 (1): 11-23. https://doi.org/10.1055/a-2420-0413.

    OBJECTIVES:  This research study aims to advance the staging of Parkinson's disease (PD) by incorporating machine learning to assess and include a broader multifunctional spectrum of neurocognitive symptoms in the staging schemes beyond motor-centric assessments. Specifically, we provide a novel framework to modernize and personalize PD staging more objectively by proposing a hybrid feature scoring approach.

    METHODS:  We recruited 37 individuals diagnosed with PD, each of whom completed a series of tablet-based neurocognitive tests assessing motor, memory, speech, executive functions, and tasks ranging in complexity from single to multifunctional. Then, the collected data were used to develop a hybrid feature scoring system to calculate a weighted vector for each function. We evaluated the current PD staging schemes and developed a new approach based on the features selected and extracted using random forest and principal component analysis.

    RESULTS:  Our findings indicate a substantial bias in current PD staging systems toward fine motor skills, that is, other neurological functions (memory, speech, executive function, etc.) do not map into current PD stages as well as fine motor skills do. The results demonstrate that a more accurate and personalized assessment of PD severity could be achieved by including a more exhaustive range of neurocognitive functions in the staging systems either by involving multiple functions in a unified staging score or by designing a function-specific staging system.

    CONCLUSION:  The proposed hybrid feature score approach provides a comprehensive understanding of PD by highlighting the need for a staging system that covers various neurocognitive functions. This approach could potentially lead to more effective, objective, and personalized treatment strategies. Further, this proposed methodology could be adapted to other neurodegenerative conditions such as Alzheimer's disease or amyotrophic lateral sclerosis.