Robust Speech Recognition: Analysis and Equalization of Lombard Effect in Czech Corpora

When exposed to noise, speakers will modify the way they speak in an effort to maintain intelligible communication. This process, which is referred to as Lombard effect (LE), involves a combination of both conscious and subconscious articulatory adjustment. Speech production variations due to LE can cause considerable degradation in automatic speech recognition (ASR) since they introduce a mismatch between parameters of the speech to be recognized and the ASR system’s acoustic models, which are usually trained on neutral speech. The main objective of this thesis is to analyze the impact of LE on speech production and to propose methods that increase ASR system performance in LE. All presented experiments were conducted on the Czech spoken language, yet, the proposed concepts are assumed applicable to other languages. The first part of the thesis focuses on the design and acquisition of a ...

Boril, Hynek — Czech Technical University in Prague


Robust and multiresolution video delivery : From H.26x to Matching pursuit based technologies

With the joint development of networking and digital coding technologies multimedia and more particularly video services are clearly becoming one of the major consumers of the new information networks. The rapid growth of the Internet and computer industry however results in a very heterogeneous infrastructure commonly overloaded. Video service providers have nevertheless to oer to their clients the best possible quality according to their respective capabilities and communication channel status. The Quality of Service is not only inuenced by the compression artifacts, but also by unavoidable packet losses. Hence, the packet video stream has clearly to fulll possibly contradictory requirements, that are coding eciency and robustness to data loss. The rst contribution of this thesis is the complete modeling of the video Quality of Service (QoS) in standard and more particularly MPEG-2 applications. The performance of Forward Error Control (FEC) ...

Frossard, Pascal — Swiss Federal Institute of Technology


Audio-visual processing and content management techniques, for the study of (human) bioacoustics phenomena

The present doctoral thesis aims towards the development of new long-term, multi-channel, audio-visual processing techniques for the analysis of bioacoustics phenomena. The effort is focused on the study of the physiology of the gastrointestinal system, aiming at the support of medical research for the discovery of gastrointestinal motility patterns and the diagnosis of functional disorders. The term "processing" in this case is quite broad, incorporating the procedures of signal processing, content description, manipulation and analysis, that are applied to all the recorded bioacoustics signals, the auxiliary audio-visual surveillance information (for the monitoring of experiments and the subjects' status), and the extracted audio-video sequences describing the abdominal sound-field alterations. The thesis outline is as follows. The main objective of the thesis, which is the technological support of medical research, is presented in the first chapter. A quick problem definition is initially ...

Dimoulas, Charalampos — Department of Electrical and Computer Engineering, Faculty of Engineering, Aristotle University of Thessaloniki, Thessaloniki, Greece


Mixed structural models for 3D audio in virtual environments

In the world of Information and communications technology (ICT), strategies for innovation and development are increasingly focusing on applications that require spatial representation and real-time interaction with and within 3D-media environments. One of the major challenges that such applications have to address is user-centricity, reflecting e.g. on developing complexity-hiding services so that people can personalize their own delivery of services. In these terms, multimodal interfaces represent a key factor for enabling an inclusive use of new technologies by everyone. In order to achieve this, multimodal realistic models that describe our environment are needed, and in particular models that accurately describe the acoustics of the environment and communication through the auditory modality are required. Examples of currently active research directions and application areas include 3DTV and future internet, 3D visual-sound scene coding, transmission and reconstruction and teleconferencing systems, to name but ...

Geronazzo, Michele — University of Padova


EEG-Biofeedback and Epilepsy: Concept, Methodology and Tools for (Neuro)therapy Planning and Objective Evaluation

Objective diagnosis and therapy evaluation are still challenging tasks for many neurological disorders. This is highly related to the diversity of cases and the variety of treatment modalities available. Especially in the case of epilepsy, which is a complex disorder not well-explained at the biochemical and physiological levels, there is the need for investigations for novel features, which can be extracted and quantified from electrophysiological signals in clinical practice. Neurotherapy is a complementary treatment applied in various disorders of the central nervous system, including epilepsy. The method is subsumed under behavioral medicine and is considered an operant conditioning in psychological terms. Although the application areas of this promising unconventional approach are rapidly increasing, the method is strongly debated, since the neurophysiological underpinnings of the process are not yet well understood. Therefore, verification of the efficacy of the treatment is one ...

Kirlangic, Mehmet Eylem — Technische Universitaet Ilmenau


Indexation et Recherche de Video pour la Videosurveillance

The goal of this work is to propose a general approach for surveillance video indexing and retrieval. Based on the hypothesis that videos are preprocessed by an external video analysis module, this approach is composed of two phases : indexing phase and retrieval phase. In order to profit from the output of various video analysis modules, a general data model consisting of two main concepts, objects and events, is proposed. The indexing phase that aims at preparing data defined in the data model performs three tasks. Firstly, two new key blob detection methods in the object representation task choose for each detected object a set of key blobs associated with a weight. Secondly, the feature extraction task analyzes a number of visual and temporal features on detected objects. Finally, the indexing task computes attributes of the two concepts and stores ...

Thi-Lan, Le — INRIA, Sophia Antipolis


ULTRA WIDEBAND LOCATION IN SCENARIOS WITHOUT CLEAR LINE OF SIGHT: A PRACTICAL APPROACH

Indoor location has experienced a major boost in recent years. location based services (LBS), which until recently were restricted to outdoor scenarios and the use of GPS, have also been extended into buildings. From large public structures such as airports or hospitals to a multitude of industrial scenarios, LBS has become increasingly present in indoor scenarios. Of the various technologies that can be used to achieve this indoor location, the ones based on ultra- wideband (UWB) signals have become ones of the most demanded due primarily to their accuracy in position estimation. Additionally, the appearance in the market of more and more manufacturers and products has lowered the prices of these devices to levels that allow to think about their use for large deployments with a contained budget. By their nature, UWB signals are very resistant to the multi-path phenomenon, ...

Barral, Valentín — Universidade da Coruña


On the Energy Efficiency of Cooperative Wireless Networks

The aim of this dissertation is the study of cooperative communications in wireless networks. In cooperative networks, each user transmits its own data and also aids the communication of other users. User cooperation is particularly attractive for the wireless medium, where every user listens to the transmission of other users. The main benefit of user cooperation in wireless networks is, probably, its efficacy to combat the wireless channel impairments. Path loss and shadowing effects are overcome using intermediate nodes, with better channel conditions, to retransmit the received signal to the estination. Further, the channel fading effect can be also mitigated by means of cooperative spatial diversity (the information arrives at the destination through multiple independent paths). These benefits result in an increase of the users spectral efficiency and/or savings on the overall network power resource. Besides these gains, the simple ...

Gomez-Vilardebo, Jesus — Universidad Politecnica de Madrid


Modeling and Digital Mitigation of Transmitter Imperfections in Radio Communication Systems

To satisfy the continuously growing demands for higher data rates, modern radio communication systems employ larger bandwidths and more complex waveforms. Furthermore, radio devices are expected to support a rich mixture of standards such as cellular networks, wireless local-area networks, wireless personal area networks, positioning and navigation systems, etc. In general, a "smart'' device should be flexible to support all these requirements while being portable, cheap, and energy efficient. These seemingly conflicting expectations impose stringent radio frequency (RF) design challenges which, in turn, call for their proper understanding as well as developing cost-effective solutions to address them. The direct-conversion transceiver architecture is an appealing analog front-end for flexible and multi-standard radio systems. However, it is sensitive to various circuit impairments, and modern communication systems based on multi-carrier waveforms such as Orthogonal Frequency Division Multiplexing (OFDM) and Orthogonal Frequency Division Multiple ...

Kiayani, Adnan — Tampere University of Technology


New Higher-Order Active Contour Models, Shape Priors, and Multiscale Analysis - Their Application To Road Network Extraction From Very High Resolution Satelite Images

The objective of this thesis is to develop and validate robust approaches for the semi-automatic extraction of road networks in dense urban areas from very high resolution (VHR) optical satellite images. Our models are based on the recently developed higher-order active contour (HOAC) phase field framework. The problem is difficult for two main reasons: VHR images are intrinsically complex and network regions may have arbitrary topology. To tackle the complexity of the information contained in VHR images, we propose a multiresolution statistical data model and a multiresolution constrained prior model. They enable the integration of segmentation results from coarse resolution and fine resolution. Subsequently, for the particular case of road map updating, we present a specific shape prior model derived from an outdated GIS digital map. This specific prior term balances the effect of the generic prior knowledge carried by ...

Peng, Ting — Project-Team Ariana (INRIA-Sophia Antipolis, France); LIAMA (CASIA, China)


Microphone arrays for imaging of aerospace noise sources

With the continuous growth in demand for air traffic and wind turbines, the noise emissions they generate are becoming an increasingly important issue. To reduce their noise levels, it is essential to obtain accurate information about all the sound sources present. Phased microphone arrays and acoustic imaging methods allow for the estimation of the location and strength of sound sources. Experiments with these devices are one of the main approaches in the current research in aeroacoustics, along with computational simulations or noise prediction models. This thesis presents a detailed literature review on the most common aerospace noise sources, challenges in aeroacoustic measurements, and the acoustic imaging methods typically used to overcome them. Practical recommendations are provided for selecting the appropriate imaging technique depending on the type of experiment. New integration techniques for distributed sound sources, such as leading– or trailing–edge ...

Merino-Martinez, Roberto — Delft University of Technology


GNSS Array-based Acquisition: Theory and Implementation

This Dissertation addresses the signal acquisition problem using antenna arrays in the general framework of Global Navigation Satellite Systems (GNSS) receivers. GNSSs provide the necessary infrastructures for a myriad of applications and services that demand a robust and accurate positioning service. GNSS ranging signals are received with very low signal-to-noise ratio. Despite that the GNSS CDMA modulation offers limited protection against Radio Frequency Interferences (RFI), an interference that exceeds the processing gain can easily degrade receivers' performance or even deny completely the GNSS service. A growing concern of this problem has appeared in recent times. A single-antenna receiver can make use of time and frequency diversity to mitigate interferences, even though the performance of these techniques is compromised in the presence of wideband interferences. Antenna arrays receivers can benefit from spatial-domain processing, and thus mitigate the effects of interfering signals. ...

Arribas, Javier — Universitat Politecnica de Catalunya


Digital compensation of front-end non-idealities in broadband communication systems

The wireless communication industry has seen a tremendous growth in the last few decades. The ever increasing demand to stay connected at home, work, and on the move, with voice and data applications, has continued the need for more sophisticated end-user devices. A typical smart communication device these days consists of a radio system that can access a mixture of mobile cellular services (GSM, UMTS, etc), indoor wireless broadband services (WLAN-802.11b/g/n), short range and low energy personal communications (Bluetooth), positioning and navigation systems (GPS), etc. A smart device capable of meeting all these requirements has to be highly flexible and should be able to reconfigure radio transmitters and receivers as and when required. Further, the radio modules used in these devices should be extremely small so that the device itself is portable. In addition, the device should also be economical ...

Tandur, Deepaknath — Katholieke Universiteit Leuven


Design and development of multi-biometric systems

Biometric recognition for a long time has been used in confined spaces, usually indoor, where security-critical operations required high accuracy recognition systems, e.g. in police stations, banks, companies, airports. Field activities, on the contrary, required more portability and flexibility leading to the development of devices for less constrained biometric traits acquisition and consequently of robust algorithms for biometric recognition in less constrained conditions. However, the application of "portable" biometric recognition, was still limited in specific fields e.g. for immigration control, and still required dedicated devices. A further step would be to spread the use of biometric recognition on personal devices, as personal computers, tablets and smartphones. Some attempts in this direction were made embedding fingerprint scanners in laptops or smartphones. So far biometric recognition on personal devices has been employed just for a limited set of tasks, as to unlock ...

Galdi, Chiara — University of Salerno and EURECOM


Measurement and Modelling of Internet Traffic over 2.5 and 3G Cellular Core Networks

THE task of modeling data traffic in networks is as old as the first commercial telephony systems. In the recent past in mobile telephone networks the focus has moved from voice to packetswitched services. The new cellular mobile networks of the third generation (UMTS) and the evolved second generation (GPRS) offer the subscriber the possibility of staying online everywhere and at any time. The design and dimensioning is well known for circuit switched voice systems, but not for mobile packet-switched systems. The terms user expectation, grade of service and so on need to be defined. To find these parameters it is important to have an accurate traffic model that delivers good traffic estimates. In this thesis we carried out measurements in a live 3G core network of an Austrian operator, in order to find appropriate models that can serve as ...

Svoboda, Philipp — Vienna University of Technology

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.