Spatio-temporal characterization of the surface electrocardiogram for catheter ablation outcome prediction in persistent atrial fibrillation

Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia encountered in clinical practice, and one of the main causes of ictus and strokes. Despite the advances in the comprehension of its mechanisms, its thorough characterization and the quantification of its effects on the human heart are still an open issue. In particular, the choice of the most appropriate therapy is frequently a hard task. Radiofrequency catheter ablation (CA) is becoming one of the most popular solutions for the treatment of the disease. Yet, very little is known about its impact on heart substrate during AF, thus leading to an inaccurate selection of positive responders to therapy and a low success rate; hence, the need for advanced signal processing tools able to quantify AF impact on heart substrate and assess the effectiveness of the CA therapy in an objective and ...

Marianna Meo — Université Nice Sophia Antipolis


Spatio-Temporal Speech Enhancement in Adverse Acoustic Conditions

Never before has speech been captured as often by electronic devices equipped with one or multiple microphones, serving a variety of applications. It is the key aspect in digital telephony, hearing devices, and voice-driven human-to-machine interaction. When speech is recorded, the microphones also capture a variety of further, undesired sound components due to adverse acoustic conditions. Interfering speech, background noise and reverberation, i.e. the persistence of sound in a room after excitation caused by a multitude of reflections on the room enclosure, are detrimental to the quality and intelligibility of target speech as well as the performance of automatic speech recognition. Hence, speech enhancement aiming at estimating the early target-speech component, which contains the direct component and early reflections, is crucial to nearly all speech-related applications presently available. In this thesis, we compare, propose and evaluate existing and novel approaches ...

Dietzen, Thomas — KU Leuven


Informed spatial filters for speech enhancement

In modern devices which provide hands-free speech capturing functionality, such as hands-free communication kits and voice-controlled devices, the received speech signal at the microphones is corrupted by background noise, interfering speech signals, and room reverberation. In many practical situations, the microphones are not necessarily located near the desired source, and hence, the ratio of the desired speech power to the power of the background noise, the interfering speech, and the reverberation at the microphones can be very low, often around or even below 0 dB. In such situations, the comfort of human-to-human communication, as well as the accuracy of automatic speech recognisers for voice-controlled applications can be signi cantly degraded. Therefore, e ffective speech enhancement algorithms are required to process the microphone signals before transmitting them to the far-end side for communication, or before feeding them into a speech recognition ...

Taseska, Maja — Friedrich-Alexander Universität Erlangen-Nürnberg


Data-Driven Estimation of Spatiotemporal Performance Maps in Cellular Networks

For a large class of non-delay-critical applications (e.g., buffered video streaming or data transfer from cloud services to local devices), end-to-end throughput becomes the most crucial key performance indicator (KPI). In cellular networks, the achievable end-user throughput (the maximum throughput a user will get when attempting to download as much data as possible) is a spatiotemporal function, and its estimation poses a challenging and as-yet unsolved problem. The ability to accurately predict achievable throughput in a given location and time interval would, for example, allow mobile operators to further optimize their networks and design more personalized offers for the customers, or allow end-users with mobile broadband modems to make more informed decisions when selecting a provider. This work investigates the impact of individual parameters on the end-user achievable throughput in cellular networks and analyzes the feasibility and limitations of constructing ...

Vaclav Raida — TU Wien


Development of a Framework to Enhance BVOC Imaging

Air pollution remains a major global challenge, particularly in urban areas where high pollutant concentrations negatively impact public health and contribute to climate change. Among the various pollutants, biogenic volatile organic compounds (BVOCs) play a critical role in atmospheric chemistry, influencing the formation of secondary organic aerosols and ground-level ozone, affecting air quality and climate dynamics. Accurately estimating BVOC emissions at high spatial resolution is challenging due to the limitations of satellite observations and computational models. Additionally, forecasting nitrogen dioxide (NO2) concentrations in urban environments is vital for effective air quality management, yet existing models often struggle to capture complex spatiotemporal dependencies. The thesis aims to address these challenges by proposing novel deep learning (DL) frameworks to tackle two key tasks: (i) improving the spatial resolution of BVOC emission maps through super-resolution (SR) techniques and (ii) developing a robust model ...

Giganti, Antonio — Politecnico di Milano


Sparsity in Linear Predictive Coding of Speech

This thesis deals with developing improved modeling methods for speech and audio processing based on the recent developments in sparse signal representation. In particular, this work is motivated by the need to address some of the limitations of the well-known linear prediction (LP) based all-pole models currently applied in many modern speech and audio processing systems. In the first part of this thesis, we introduce \emph{Sparse Linear Prediction}, a set of speech processing tools created by introducing sparsity constraints into the LP framework. This approach defines predictors that look for a sparse residual rather than a minimum variance one, with direct applications to coding but also consistent with the speech production model of voiced speech, where the excitation of the all-pole filter is model as an impulse train. Introducing sparsity in the LP framework, will also bring to develop the ...

Giacobello, Daniele — Aalborg University


Super-Resolution Image Reconstruction Using Non-Linear Filtering Techniques

Super-resolution (SR) is a filtering technique that combines a sequence of under-sampled and degraded low-resolution images to produce an image at a higher resolution. The reconstruction takes advantage of the additional spatio-temporal data available in the sequence of images portraying the same scene. The fundamental problem addressed in super-resolution is a typical example of an inverse problem, wherein multiple low-resolution (LR)images are used to solve for the original high-resolution (HR) image. Super-resolution has already proved useful in many practical cases where multiple frames of the same scene can be obtained, including medical applications, satellite imaging and astronomical observatories. The application of super resolution filtering in consumer cameras and mobile devices shall be possible in the future, especially that the computational and memory resources in these devices are increasing all the time. For that goal, several research problems need to be ...

Trimeche, Mejdi — Tampere University of Technology


A flexible scalable video coding framework with adaptive spatio-temporal decompositions

The work presented in this thesis covers topics that extend the scalability functionalities in video coding and improve the compression performance. Two main novel approaches are presented, each targeting a different part of the scalable video coding (SVC) architecture: motion adaptive wavelet transform based on the wavelet transform in lifting implementation, and a design of a flexible framework for generalised spatio-temporal decomposition. Motion adaptive wavelet transform is based on the newly introduced concept of connectivity-map. The connectivity-map describes the underlying irregular structure of regularly sampled data. To enable a scalable representation of the connectivity-map, the corresponding analysis and synthesis operations have been derived. These are then employed to define a joint wavelet connectivity-map decomposition that serves as an adaptive alternative to the conventional wavelet decomposition. To demonstrate its applicability, the presented decomposition scheme is used in the proposed SVC framework, ...

Sprljan, Nikola — Queen Mary University of London


Microphone arrays for imaging of aerospace noise sources

With the continuous growth in demand for air traffic and wind turbines, the noise emissions they generate are becoming an increasingly important issue. To reduce their noise levels, it is essential to obtain accurate information about all the sound sources present. Phased microphone arrays and acoustic imaging methods allow for the estimation of the location and strength of sound sources. Experiments with these devices are one of the main approaches in the current research in aeroacoustics, along with computational simulations or noise prediction models. This thesis presents a detailed literature review on the most common aerospace noise sources, challenges in aeroacoustic measurements, and the acoustic imaging methods typically used to overcome them. Practical recommendations are provided for selecting the appropriate imaging technique depending on the type of experiment. New integration techniques for distributed sound sources, such as leading– or trailing–edge ...

Merino-Martinez, Roberto — Delft University of Technology


Video person recognition strategies using head motion and facial appearance

In this doctoral dissertation, we principally explore the use of the temporal information available in video sequences for person and gender recognition; in particular, we focus on the analysis of head and facial motion, and their potential application as biometric identifiers. We also investigate how to exploit as much video information as possible for the automatic recognition; more precisely, we examine the possibility of integrating the head and mouth motion information with facial appearance into a multimodal biometric system, and we study the extraction of novel spatio-temporal facial features for recognition. We initially present a person recognition system that exploits the unconstrained head motion information, extracted by tracking a few facial landmarks in the image plane. In particular, we detail how each video sequence is firstly pre-processed by semiautomatically detecting the face, and then automatically tracking the facial landmarks over ...

Matta, Federico — Eurécom / Multimedia communications


Energy Efficient Network for Rural Broadband Access

This thesis proposes and discusses aspects of a low-cost wireless network called “Hopscotch” as a potential solution to the rural broadband problem. Providing broadband internet access to rural locations is challenging due to the long distances between internet backbone and households, the sparse population density and difficult terrain. Hopscotch uses a network of renewable powered base stations, termed “WindFi”, connected by point-to-point links, to deliver internet access to rural communities. A combination of frequency bands are used within Hopscotch. Standard IEEE 802.11 5GHz WiFi access technology is used for high capacity links, and an ultra high frequency TV “white space” spectrum overlay in the 600-800 MHz band provides long distance coverage. The advantages of “white space” spectrum are demonstrated for a rural wireless scenario; reducing the number of base stations required to cover a community and decreasing the transmit power ...

McGuire, Colin — University of Strathclyde


Polynomial Predictive Filters: Implementation and Applications

In this thesis, smoothness of sampled real-world signals is exploited through the application of polynomial predictive filters. The principal reason for employing the polynomial signal model is principally twofold: firstly, assuming that the sampling rate is adequate, all real-world signals exhibit piecewise polynomial-like behavior, and secondly, polynomial-based signal processing is computationally efficient. By definition, polynomial predictive filters provide estimates of future values of polynomial-like signals. Thus, the potential applications of this research include a vast number of different delay sensitive operations on measurements like temperature, position, velocity, or power, especially in control engineering field. The polynomial-based predictive signal processing is a well-known technique, but polynomial-predictive filters have had severe drawbacks, which have hindered their application; their white noise attenuation is generally low, or they exhibit considerable passband gain peaks, rendering them unattractive for most applications. It has been possible to ...

Tanskanen, Jarno M. A. — Helsinki University of Technology


Solving inverse problems in room acoustics using physical models, sparse regularization and numerical optimization

Reverberation consists of a complex acoustic phenomenon that occurs inside rooms. Many audio signal processing methods, addressing source localization, signal enhancement and other tasks, often assume absence of reverberation. Consequently, reverberant environments are considered challenging as state-ofthe-art methods can perform poorly. The acoustics of a room can be described using a variety of mathematical models, among which, physical models are the most complete and accurate. The use of physical models in audio signal processing methods is often non-trivial since it can lead to ill-posed inverse problems. These inverse problems require proper regularization to achieve meaningful results and involve the solution of computationally intensive large-scale optimization problems. Recently, however, sparse regularization has been applied successfully to inverse problems arising in different scientific areas. The increased computational power of modern computers and the development of new efficient optimization algorithms makes it possible ...

Antonello, Niccolò — KU Leuven


Toward sparse and geometry adapted video approximations

Video signals are sequences of natural images, where images are often modeled as piecewise-smooth signals. Hence, video can be seen as a 3D piecewise-smooth signal made of piecewise-smooth regions that move through time. Based on the piecewise-smooth model and on related theoretical work on rate-distortion performance of wavelet and oracle based coding schemes, one can better analyze the appropriate coding strategies that adaptive video codecs need to implement in order to be efficient. Efficient video representations for coding purposes require the use of adaptive signal decompositions able to capture appropriately the structure and redundancy appearing in video signals. Adaptivity needs to be such that it allows for proper modeling of signals in order to represent these with the lowest possible coding cost. Video is a very structured signal with high geometric content. This includes temporal geometry (normally represented by motion ...

Divorra Escoda, Oscar — EPFL / Signal Processing Institute


Bayesian State-Space Modelling of Spatio-Temporal Non-Gaussian Radar Returns

Radar backscatter from an ocean surface is commonly referred to as sea clutter. Any radar backscatter not due to the scattering from an ocean surface constitutes a potential target. This thesis is concerned with the study of target detection techniques in the presence of high resolution sea clutter. In this dissertation, the high resolution sea clutter is treated as a compound process, where a fast oscillating speckle component is modulated in power by a slowly varying modulating component. While the short term temporal correlations of the clutter are associated with the speckle, the spatial correlations are largely associated with the modulating component. Due to the disparate statistical and correlation properties of the two components, a piecemeal approach is adopted throughout this thesis, whereby the spatial and the temporal correlations of high resolution sea clutter are treated independently. As an extension ...

Noga, Jacek Leszek — University of Cambridge

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.