Power/Energy Estimation and Optimization for Software-Oriented Embedded Systems (2009)
An Energy Aware Framework for Mobile Computing
Since their inception, energy dissipation has been a critical issue for mobile computing systems. Although a large research investment in low-energy circuit design and hardware level energy management has led to more energy-efficient architectures, even then, there is a growing realization that the contribution to energy conservation should be more rigorously considered at higher levels of the systems, such as operating systems and applications. This dissertation puts forth the claim that energy-aware compilation to improve appli- cation quality both in terms of execution time and energy consumption is essential for a high performance mobile computing embedded system design. Our work is a design paradigm shift from the logic gate being the basic silicon computation unit, to an in- struction running on an embedded processor. Multimedia DSP processors are the most lucrative choice to a mobile computing system design for their ...
Azeemi, N. Zafar — Vienna University of Technology
Digital design and experimental validation of high-performance real-time OFDM systems
The goal of this Ph.D. dissertation is to address a number of challenges encountered in the digital baseband design of modern and future wireless communication systems. The fast and continuous evolution of wireless communications has been driven by the ambitious goal of providing ubiquitous services that could guarantee high throughput, reliability of the communication link and satisfy the increasing demand for efficient re-utilization of the heavily populated wireless spectrum. To cope with these ever-growing performance requirements, researchers around the world have introduced sophisticated broadband physical (PHY)-layer communication schemes able to accommodate higher bandwidth, which indicatively include multiple antennas at the transmitter and receiver and are capable of delivering improved spectral efficiency by applying interference management policies. The merging of Multiple Input Multiple Output (MIMO) schemes with the Orthogonal Frequency Division Multiplexing (OFDM) offers a flexible signal processing substrate to implement ...
Font-Bach, Oriol — Centre Tecnològic de Telecomunicacions de Catalunya (CTTC)
Combined Word-Length Allocation and High-Level Synthesis of Digital Signal Processing Circuits
This work is focused on the synthesis of Digital Signal Processing (DSP) circuits usingc specific hardware architectures. Due to its complexity, the design process has been subdivided into separate tasks, thus hindering the global optimization of the resulting systems. The author proposes the study of the combination of two major design tasks, Word-Length Allocation (WLA) and High-Level Synthesis (HLS), aiming at the optimization of DSP implementations using modern Field Programmable Gate Array devices (FPGAs). A multiple word-length approach (MWL) is adopted since it leads to highly optimized implementations. MWL implies the customization of the word-lengths of the signals of an algorithm. This complicates the design, since the number possible assignations between algorithm operations and hardware resources becomes very high. Moreover, this work also considers the use of heterogeneous FPGAs where there are several types of resources: configurable logic-based blocks (LUT-based) ...
Caffarena, Gabriel — Universidad Politecnica de Madrid
Heuristic Optimization Methods for System Partitioning in HW/SW Co-Design
Nowadays, the design of embedded systems is confronted with the combination of complex signal processing algorithms on the one hand and a variety of computational intensive multimedia applications on the other hand, while time to product launch has been extremely reduced. Especially in the wireless domain those challenges are stacked with tough requirements on power consumption and chip size. Unfortunately, design productivity did not undergo a similar progression and therefore fails to cope with the heterogeneity of modern hardware architectures. Until now, electronic design automation do not provide for complete coverage of the design ow. In particular crucial design tasks as high level characterisation of algorithms, oating-point to xed-point conversion, automated hardware/software partitioning, and automated virtual prototyping are not suciently supported or completely absent. In recent years a consistent design framework named Open Tool Integration Environment (OTIE) has been established ...
Knerr, Bastian — Vienna University of Technology
Testbed Design for Wireless Communications Systems Assessment
Since Marconi succeeded in carrying out the first wireless transmission in 1894, experimental research has been always linked with wireless communications. Today, most wireless communications research relies only on computer simulations. Although computer simulations are necessary and recommendable for wireless systems evaluation, they only reflect the simulation environment rather than the actual scenarios in which wireless systems operate. Consequently, it is desirable to assess wireless communications systems in real-world scenarios while, at the same time, keeping the required effort within reasonable terms. Among the different strategies suitable for undertaking such assessment, the testbed approach constitutes a simple and flexible enough solution based on the software-defined radio concept in which only the fundamental operations (usually the transmission and the acquisition) are carried out in real- time, while the remaining tasks are implemented off-line in high-level programming languages (e.g. MATLAB) and using ...
Garcia Naya, Jose Antonio — Universidade da Coruna
Design and Evaluation of OFDM Radio Interfaces for High Mobility Communications
In the last two decades, multicarrier modulations have emerged as a low complexity solution to combat the effects of the multipath in wireless communications. Among them, Orthogonal Frequency Division Multiplexing (OFDM) is possibly the most studied modulation scheme, and has also been widely adopted as the foundation of industry standards such as WiMAX or LTE. However, OFDM is sensitive to time-selective channels, which are featured in mobility scenarios, due to the appearance of Inter-Carrier Interference (ICI). Implementation of hardware equipment for the end user is usually implemented in dedicated chips, but in research environments, more flexible solutions are preferred. One popular approach is the so-called Software Defined Radio (SDR), where the signal processing algorithms are implemented in reconfigurable hardware such as Digital Signal Processors (DSPs) and Field Programmable Gate Arrays (FPGAs). The aim of this work is two-fold. On the ...
Suárez Casal, Pedro — University of A Coruña
Adaptive Signal Processing for Power Line Communications
This thesis represents a significant part of the research activity conducted during the PhD program in Information Technologies, supported by Selta S.p.A, Cadeo, Italy, focused on the analysis and design of a Power Line Communications (PLC) system. In recent times the PLC technologies have been considered for integration in Smart Grids architectures, as they are used to exploit the existing power line infrastructure for information transmission purposes on low, medium and high voltage lines. The characterization of a reliable PLC system is a current object of research as well as it is the design of modems for communications over the power lines. In this thesis, the focus is on the analysis of a full-duplex PLC modem for communication over high-voltage lines, and, in particular, on the design of the echo canceller device and innovative channel coding schemes. The first part ...
Tripodi, Carlo — Università degli Studi di Parma
Sparsity-Aware Wireless Networks: Localization and Sensor Selection
Wireless networks have revolutionized nowadays world by providing real-time cost efficient service and connectivity. Even such an unprecedented level of service could not fulfill the insatiable desire of the modern world for more advanced technologies. As a result, a great deal of attention has been directed towards (mobile) wireless sensor networks (WSNs) which are comprised of considerably cheap nodes that can cooperate to perform complex tasks in a distributed fashion in extremely harsh environments. Unique features of wireless environments, added complexity owing to mobility, distributed nature of the network setup, and tight performance and energy constraints, pose a challenge for researchers to devise systems which strike a proper balance between performance and resource utilization. We study some of the fundamental challenges of wireless (sensor) networks associated with resource efficiency, scalability, and location-awareness. The pivotal point which distinguishes our studies from ...
Jamali-Rad, Hadi — TU Delft
In this thesis, we present the TM3270 VLIW media-processor, the latest of TriMedia processors, and describe the innovations with respect to its prede- cessor: the TM3260. We describe enhancements to the load/store unit design, such as a new data prefetching technique, and architectural enhancements, such as additions to the TriMedia Instruction Set Architecture (ISA). Examples of ISA enhancements include collapsed load operations, two-slot operations and H.264 specific CABAC decoding operations. All of the TM3270 innovations contribute to a common goal: a balanced processor design in terms of silicon area and power consumption, which enables audio and standard resolution video processing for both the connected and portable markets. To measure the speedup of the indi- vidual innovations of the TM3270 design, we evaluate processor performance on a set of complete video applications: motion estimation, MPEG2 encoding and temporal upconversion. Each of ...
van de Waerdt, Jan-Willem — Delft University of Technology
Modeling of Magnetic Fields and Extended Objects for Localization Applications
The level of automation in our society is ever increasing. Technologies like self-driving cars, virtual reality, and fully autonomous robots, which all were unimaginable a few decades ago, are realizable today, and will become standard consumer products in the future. These technologies depend upon autonomous localization and situation awareness where careful processing of sensory data is required. To increase efficiency, robustness and reliability, appropriate models for these data are needed. In this thesis, such models are analyzed within three different application areas, namely (1) magnetic localization, (2) extended target tracking, and (3) autonomous learning from raw pixel information. Magnetic localization is based on one or more magnetometers measuring the induced magnetic field from magnetic objects. In this thesis we present a model for determining the position and the orientation of small magnets with an accuracy of a few millimeters. This ...
Wahlström, Niklas — Linköping University
Signal Quantization and Approximation Algorithms for Federated Learning
Distributed signal or information processing using Internet of Things (IoT), facilitates real-time monitoring of signals, for example, environmental pollutants, health indicators, and electric energy consumption in a smart city. Despite the promising capabilities of IoTs, these distributed deployments often face the challenge of data privacy and communication rate constraints. In traditional machine learning, training data is moved to a data center, which requires massive data movement from distributed IoT devices to a third-party location, thus raising concerns over privacy and inefficient use of communication resources. Moreover, the growing network size, model size, and data volume combined lead to unusual complexity in the design of optimization algorithms beyond the compute capability of a single device. This necessitates novel system architectures to ensure stable and secure operations of such networks. Federated learning (FL) architecture, a novel distributed learning paradigm introduced by McMahan ...
A, Vijay — Indian Institute of Technology Bombay
A statistical approach to motion estimation
Digital video technology has been characterized by a steady growth in the last decade. New applications like video e-mail, third generation mobile phone video communications, videoconferencing, video streaming on the web continuously push for further evolution of research in digital video coding. In order to be sent over the internet or even wireless networks, video information clearly needs compression to meet bandwidth requirements. Compression is mainly realized by exploiting the redundancy present in the data. A sequence of images contains an intrinsic, intuitive and simple idea of redundancy: two successive images are very similar. This simple concept is called temporal redundancy. The research of a proper scheme to exploit the temporal redundancy completely changes the scenario between compression of still pictures and sequence of images. It also represents the key for very high performances in image sequence coding when compared ...
Moschetti, Fulvio — Swiss Federal Institute of Technology
Joint Downlink Beamforming and Discrete Resource Allocation Using Mixed-Integer Programming
Multi-antenna processing is widely adopted as one of the key enabling technologies for current and future cellular networks. Particularly, multiuser downlink beamforming (also known as space-division multiple access), in which multiple users are simultaneously served with spatial transmit beams in the same time and frequency resource, achieves high spectral efficiency with reduced energy consumption. To harvest the potential of multiuser downlink beamforming in practical systems, optimal beamformer design shall be carried out jointly with network resource allocation. Due to the specifications of cellular standards and/or implementation constraints, resource allocation in practice naturally necessitates discrete decision makings, e.g., base station (BS) association, user scheduling and admission control, adaptive modulation and coding, and codebook-based beamforming (precoding). This dissertation focuses on the joint optimization of multiuser downlink beamforming and discrete resource allocation in modern cellular networks. The problems studied in this thesis involve ...
Cheng, Yong — Technische Universität Darmstadt
Efficient Perceptual Audio Coding Using Cosine and Sine Modulated Lapped Transforms
The increasing number of simultaneous input and output channels utilized in immersive audio configurations primarily in broadcasting applications has renewed industrial requirements for efficient audio coding schemes with low bit-rate and complexity. This thesis presents a comprehensive review and extension of conventional approaches for perceptual coding of arbitrary multichannel audio signals. Particular emphasis is given to use cases ranging from two-channel stereophonic to six-channel 5.1-surround setups with or without the application-specific constraint of low algorithmic coding latency. Conventional perceptual audio codecs share six common algorithmic components, all of which are examined extensively in this thesis. The first is a signal-adaptive filterbank, constructed using instances of the real-valued modified discrete cosine transform (MDCT), to obtain spectral representations of successive portions of the incoming discrete time signal. Within this MDCT spectral domain, various intra- and inter-channel optimizations, most of which are of ...
Helmrich, Christian R. — Friedrich-Alexander-Universität Erlangen-Nürnberg
Quasi-static scheduling for fine-grained embedded multiprocessing
Designing energy-efficient multiprocessing hardware for applications such as video decoding or MIMO-OFDM baseband processing is challenging because these applications require high throughput, as well as flexibility for efficient use of the processing resources. Application specific hardwired accelerator circuits are the most energy-efficient processing resources, but are inflexible by nature. Furthermore, designing an application specific circuit is expensive and time-consuming. A solution that maintains the energy-efficiency of accelerator circuits, but makes them flexible as well, is to make the accelerator circuits fine-grained. Fine-grained application specific processing elements can be designed to implement general purpose functions that can be used in several applications and their small size makes the design and verification times reasonable. This thesis proposes an efficient method for orchestrating the use of heterogeneous fine-grained processing elements in dynamic applications without introducing tremendous orchestration overheads. Furthermore, the thesis presents a ...
Boutellier, Jani — University of Oulu
The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.
The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.