Digital Audio Processing Methods for Voice Pathology Detection

Voice pathology is a diverse field that includes various disorders affecting vocal quality and production. Using audio machine learning for voice pathology classification represents an innovative approach to diagnosing a wide range of voice disorders. Despite extensive research in this area, there remains a significant gap in the development of classifiers and their ability to adapt and generalize effectively. This thesis aims to address this gap by contributing new insights and methods. This research provides a comprehensive exploration of automatic voice pathology classification, focusing on challenges such as data limitations and the potential of integrating multiple modalities to enhance diagnostic accuracy and adaptability. To achieve generalization capabilities and enhance the flexibility of the classifier across diverse types of voice disorders, this research explores various datasets and pathology types comprehensively. It covers a broad range of voice disorders, including functional dysphonia, ...

Ioanna Miliaresi — University of Pireaus


Machine Learning-Aided Monitoring and Prediction of Respiratory and Neurodegenerative Diseases Using Wearables

This thesis focuses on wearables for health status monitoring, covering applications aimed at emergency solutions to the COVID-19 pandemic and aging society. The methods of ambient assisted living (AAL) are presented for the neurodegenerative disease Parkinson’s disease (PD), facilitating ’aging in place’ thanks to machine learning and around wearables - solutions of mHealth. Furthermore, the approaches using machine learning and wearables are discussed for early-stage COVID-19 detection, with encouraging accuracy. Firstly, a publicly available dataset containing COVID-19, influenza, and healthy control data was reused for research purposes. The solution presented in this thesis is considering the classification problem and outperformed the state-of-the-art methods, whereas the original paper introduced just anomaly detection and not shown the specificity of the created models. The proposed model in the thesis for early detection of COVID-19 achieved 78 % for the k-NN classifier. Moreover, a ...

Justyna Skibińska — Brno University of Technology & Tampere University


Multimodal signal analysis for unobtrusive characterization of obstructive sleep apnea

Obstructive sleep apnea (OSA) is the most prevalent sleep related breathing disorder, nevertheless subjects suffering from it often remain undiagnosed due to the cumbersome diagnosis procedure. Moreover, the prevalence of OSA is increasing and a better phenotyping of patients is needed in order to prioritize treatment. The goal of this thesis was to tackle those challenges in OSA diagnosis. Additionally, two main algorithmic contributions which are generally applicable were proposed within this thesis. The binary interval coded scoring algorithm was extended to multilevel problems and novel monotonicity constraints were introduced. Moreover, improvements to the random-forest based feature selection were proposed including the use of the Cohen’s kappa value, patient independent validation, and further feature pruning steered by the correlation between features. These novel methods were applied together with classification and feature selection methods from the literature to improve the OSA ...

Deviaene, Margot — KU Leuven


Automated quantification of preterm brain maturation using electroencephalography

Around 10 percent of all human births is premature, which means that annually about 15 million babies are born before 37 completed weeks of gestation. About one third of the admissions to the Neonatal Intensive Care Unit (NICU) consists of this patient group. Due to complications, 1 million babies die from premature delivery, and it is therefore the most important cause of neonatal death. In general, premature and immature babies have a high risk for neurological abnormalities by maturation in extra-uterine life. Even though improved health care has increased the survival changes of these neonates, they are sensitive to brain damage and consequently, neurocognitive disabilities. Nowadays, critical information about the brain development can be extracted from the electroencephalography (EEG). Clinical experts visually assess evolving EEG characteristics over both short and long periods to evaluate maturation of patients at risk and, ...

Koolen, Ninah — KU Leuven


Spectral Variability in Hyperspectral Unmixing: Multiscale, Tensor, and Neural Network-based Approaches

The spectral signatures of the materials contained in hyperspectral images, also called endmembers (EMs), can be significantly affected by variations in atmospheric, illumination or environmental conditions typically occurring within an image. Traditional spectral unmixing (SU) algorithms neglect the spectral variability of the endmembers, what propagates significant mismodeling errors throughout the whole unmixing process and compromises the quality of the estimated abundances. Therefore, significant effort have been recently dedicated to mitigate the effects of spectral variability in SU. However, many challenges still remain in how to best explore a priori information about the problem in order to improve the quality, the robustness and the efficiency of SU algorithms that account for spectral variability. In this thesis, new strategies are developed to address spectral variability in SU. First, an (over)-segmentation-based multiscale regularization strategy is proposed to explore spatial information about the abundance ...

Borsoi, Ricardo Augusto — Université Côte d'Azur; Federal University of Santa Catarina


Machine Learning Methods for Recognizing Brain Disorders

Brain disorders represent a significant health challenge. It is estimated that approximately 165 million people suffer from a brain disorder in Europe, while 1 in 3 people will experience such a disorder during their lifetime. Some types of the brain disorders are the following: Alzheimer’s disease, dementias, epilepsy, Parkinson’s disease, Mental disorders, and more. These disorders affect the way people think, feel, or perform daily activities. However, if these disorders are diagnosed early and the person receives suitable medication, their progression may be delayed. For this reason, early diagnosis is crucial. Artificial Intelligence (AI) holds the promise of transforming how we tackle societal issues and enhancing the welfare of both individuals and communities. “AI for Social Good”, also known as “AI for Social Impact” is a new research field aiming to tackle some of the most important social, environmental, and ...

Ilias, Loukas — National Technical University of Athens


Automatic Detection, Classification and Restoration of Defects in Historical Images

Historical photos are significant attestations of the inheritance of the past. Since Photography is an art that is more than 150 years old, more and more diffuse are the photographic archives all over the world. Nevertheless, time and bad preservation corrupts physical supports, and many important historical documents risk to be ruined and their content lost. Therefore solutions must be implemented to preserve their state and to recover damaged information. This PhD thesis proposes a general methodology, and several applicative solutions, to handle these problems, by means of digitization and digital restoration. The purpose is to create a useful tool to support non-expert users in the restoration process of damaged historical images. The content of this thesis is outlined as follows: Chapter 1 gives an overview on the problems related to management and preservation of cultural repositories, and discusses about ...

Mazzola, Giuseppe — Università degli studi di Palermo - Dipartimento di Ingegneria Informatica


Biomechanics based analysis of sleep

The fact that a third of a human life is spent in a bed indicates the essential character of sleep. While some people might opt voluntarily for sleep deprivation, others don’t get to choose. Their healthy pattern of sleep is disrupted due to sleep disorders such as sleep apnea, insomnia and restless legs syndrome. Most clinical diagnoses revolve around complaints of excessive daytime sleepiness. People usually wait quite long however before contacting professional help, and might only do so when complaints have gone from minor to serious. It can be argued that people with minor complaints will have negligible compliance to rather obtrusive therapies, and should not be treated with pharmaceuticals. However, cognitive and behavioral therapy has proven its effectiveness for clinically diagnosed patients in different domains, and might thus also enhance the quality of life for people with minor ...

Willemen, Tim — KU Leuven


Tissue Characterisation from Intravascular Ultrasound using Texture Analysis

Intravascular ultrasound has, over the past decade, significantly changed the clinical diagnosis and therapeutic strategy of coronary and vascular disease assessment, as it not only allows visualisation of the vessel lumen, but gives a unique view of the pathophysiologic structure of the artery wall. This information is currently unavailable from the universally accepted instrument for artery assessment, angiography, which has on several occasions had its diagnostic accuracy questioned. With intravascular ultrasound, there is the potential to categorise diseased arterial tissue belonging to distinct pathological groups which can ultimately aid in the understanding of individual lesions as well as making a significant contribution to treatment choice and management of cardiac patients. The high resolution image information offered by intravascular ultrasound provides excellent crosssectional views of coronary artery disease at the level of the disease process itself. This information can be used ...

Nailon, William Henry — University Of Edinburgh


Low Complexity Image Recognition Algorithms for Handheld Devices

Content Based Image Retrieval (CBIR) has gained a lot of interest over the last two decades. The need to search and retrieve images from databases, based on information (“features”) extracted from the image itself, is becoming increasingly important. CBIR can be useful for handheld image recognition devices in which the image to be recognized is acquired with a camera, and thus there is no additional metadata associated to it. However, most CBIR systems require large computations, preventing their use in handheld devices. In this PhD work, we have developed low-complexity algorithms for content based image retrieval in handheld devices for camera acquired images. Two novel algorithms, ‘Color Density Circular Crop’ (CDCC) and ‘DCT-Phase Match’ (DCTPM), to perform image retrieval along with a two-stage image retrieval algorithm that combines CDCC and DCTPM, to achieve the low complexity required in handheld devices ...

Ayyalasomayajula, Pradyumna — EPFL


Ultra low-power biomedical signal processing: an analog wavelet filter approach for pacemakers

The purpose of this thesis is to describe novel signal processing methodologies and analog integrated circuit techniques for low-power biomedical systems. Physiological signals, such as the electrocardiogram (ECG), the electroencephalogram (EEG) and the electromyogram (EMG) are mostly non-stationary. The main difficulty in dealing with biomedical signal processing is that the information of interest is often a combination of features that are well localized temporally (e.g., spikes) and others that are more diffuse (e.g., small oscillations). This requires the use of analysis methods sufficiently versatile to handle events that can be at opposite extremes in terms of their time-frequency localization. Wavelet Transform (WT) has been extensively used in biomedical signal processing, mainly due to the versatility of the wavelet tools. The WT has been shown to be a very efficient tool for local analysis of nonstationary and fast transient signals due ...

Haddad, Sandro Augusto Pavlík — Delft University of Technology


Transformation methods in signal processing

This dissertation is concerned with the application of the theory of rational functions in signal processing. The PhD thesis summarizes the corresponding results of the author’s research. Since the systems of rational functions are defined by the collection of inverse poles with multiplicities, the following parameters should be determined: the number, the positions and the multiplicities of the inverse poles. Therefore, we develop the hyperbolic variant of the so-called Nelder–Mead and the particle swarm optimization algorithm. In addition, the latter one is integrated into a more general multi-dimensional framework. Furthermore, we perform a detailed stability and error analysis of these methods. We propose an electrocardiogram signal generator based on spline interpolation. It turns to be an efficient tool for testing and evaluating signal models, filtering techniques, etc. In this thesis, the synthesized heartbeats are used to test the diagnostic distortion ...

Kovács, Péter — Eötvös L. University, Budapest, Hungary


Improvements in Pose Invariance and Local Description for Gabor-based 2D Face Recognition

Automatic face recognition has attracted a lot of attention not only because of the large number of practical applications where human identification is needed but also due to the technical challenges involved in this problem: large variability in facial appearance, non-linearity of face manifolds and high dimensionality are some the most critical handicaps. In order to deal with the above mentioned challenges, there are two possible strategies: the first is to construct a “good” feature space in which the manifolds become simpler (more linear and more convex). This scheme usually comprises two levels of processing: (1) normalize images geometrically and photometrically and (2) extract features that are stable with respect to these variations (such as those based on Gabor filters). The second strategy is to use classification structures that are able to deal with non-linearities and to generalize properly. To ...

Gonzalez-Jimenez, Daniel — University of Vigo


Advanced models for monitoring stress and development trajectories in premature infants

This thesis focuses on the design of various automatic signal processing algorithms to extract information from physiological signals of preterm infants. Overall, the aim was to improve the neurodevelopmental outcome of the neonate. More specifically, three main research objectives were carried out. The first objective was to describe the maturation of neonates during their stay in the neonatal intensive care unit. The second objective was to assess the stress and pain in premature infants and their impact on the development of neonates. The third objective was to predict developmental disabilities, such as autism. The first part of this thesis presents an extensive overview of various developmental models to describe the maturation of premature infants. Three main strategies were proposed. The first strategy proposed an investigation of EEG connectivity networks. A variety of functional and effective connectivity methods were combined with ...

Lavanga, Mario — KU Leuven


Discrete-time speech processing with application to emotion recognition

The subject of this PhD thesis is the efficient and robust processing and analysis of the audio recordings that are derived from a call center. The thesis is comprised of two parts. The first part is dedicated to dialogue/non-dialogue detection and to speaker segmentation. The systems that are developed are prerequisite for detecting (i) the audio segments that actually contain a dialogue between the system and the call center customer and (ii) the change points between the system and the customer. This way the volume of the audio recordings that need to be processed is significantly reduced, while the system is automated. To detect the presence of a dialogue several systems are developed. This is the first effort found in the international literature that the audio channel is exclusively exploited. Also, it is the first time that the speaker utterance ...

Kotti, Margarita — Aristotle University of Thessaloniki

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.