Implementation of a MIMO-OFDM receiver on an application specific processor
Introduction
The world of wireless communications is governed by a plethora of different standards, each tailored to the needs of its own target application domain (e.g., wireless local area network (WLAN) or cellular networks). Consequently, many standards employ different coding and modulation schemes. This heterogeneous, fast evolving environment is the motivation for the development of software defined radio (SDR) concepts. Ideally, SDRs are capable of handling all possible transmission techniques on the same platform, without the need of integrating dedicated modems for each standard.
Multiple-input multiple-output (MIMO) transmission is a recent development in the area of wireless communications that permits to increase the spectral efficiency by employing multiple antennas at both the transmitter and the receiver. Since its conception, MIMO communication has reached a level of maturity that permits integration of the technology into wireless communication standards. Examples of standards that already employ MIMO are the IEEE 802.11n standard for local area networks, the IEEE 802.16e standard for metropolitan area networks, or the 3GPP LTE standard for cellular systems.
Unfortunately, the tremendous computational complexity associated with MIMO signal processing at the receiver conflicts with the reduced amount of processing resources available in a low-cost software programmable architecture of an SDR solution. Hence, it appears reasonable to step back from the ambition of directly implementing the ultimate SDR transceiver that is capable of handling all present and future standards. The more pragmatic approach considered in this paper employs two levels of configurability: the first level is the use of an application specific processor design-framework which permits design-time customization to meet the requirements of a specific standard, thus enabling the rapid development of new products. The second level provides limited flexibility for reconfiguration at run-time. It enables bug-fixes, the incorporation of small changes to a standard, and upgrades to improved receiver algorithms.
A summary of the challenges associated with implementing fourth generation (4G) wireless communication systems on SDR architectures can be found in [1]. The authors conclude that technology scaling alone does not suffice to meet the stringent performance requirements of 4G systems. Hence, new architectural solutions are required. Two emerging approaches are investigated in [2]. The first approach comprises reconfigurable architectures with different levels of granularity. It is argued that success stories of companies that have followed this approach for mobile phones are still missing. The second approach follows a DSP-centric, accelerator assisted architecture. It is argued that this approach is more promising for the mobile phone market compared to the reconfigurable approach.
More specifically, concerning the application area under consideration in this paper, the potential of using reconfigurable hardware to support different systems which employ the same modulation techniques has been explored in [3] for the OFDM-based IEEE 802.11a and HIPERLAN/2 standards. In [4] the implementation of a minimal MIMO-OFDM receiver on the RaPiD coarse-grained reconfigurable architecture is considered. It is found that there is a sixfold increase in complexity compared to an ASIC implementation, while the cost is reduced by a factor of six compared to an implementation on conventional digital signal processors. A complete IEEE 802.11a baseband (BB) implementation on an application specific processor is presented in our previous work [5]. The implementation revealed that a standard-specific software programmable approach is feasible with only minor area overhead when compared to dedicated ASIC solutions. This encouraging result motivates the choice of the application specific processor design-framework for implementing the MIMO-OFDM receiver considered in this paper. While all of the above-described architectures consider cellular systems, single-antenna systems, or a minimal MIMO-OFDM receiver, a first implementation of a MIMO-OFDM WLAN receiver on the ADRES coarse grained array has been described in [6] and was further elaborated on in [7], [8].
In this paper, we consider the use of an application specific processor for implementing the BB processing of a MIMO-OFDM receiver. Our main contributions are the description of the application specific processor architecture and the description of the mapping of the MIMO-OFDM signal processing onto that architecture. We also provide a summary of the required algorithms together with an analysis of their computational complexity. Finally, our implementation results provide reference for the true silicon complexity of a real-time capable MIMO-OFDM receiver implemented on a software-programmable, i.e., reconfigurable hardware.
The remainder of this paper is structured as follows: Section 2 considers the algorithms required for the BB processing in a typical MIMO-OFDM receiver. It identifies the involved atomic operations, and analyzes the corresponding processing requirements for the general case and for the configuration under consideration in this paper. This analysis sets the stage for the subsequent discussion of our SDR implementation. We start with the introduction of the proposed application specific processor design-framework in Section 3. Section 4 describes a processor composition that is suitable for implementing a MIMO-OFDM receiver supporting a bandwidth of 20 MHz. It provides the relevant details of the corresponding run-time datapath configurations, the program flow, and the schedule. Section 5 summarizes our implementation results and compares the proposed solution to other works in the field. Conclusions are drawn in Section 6.
Section snippets
Linear MMSE MIMO-OFDM receiver system
Fig. 1 provides a high-level view on the MIMO-OFDM communication system considered in this paper with transmit and receive antennas. In spatial multiplexing mode, the transmitter starts by splitting the incoming binary data stream (TxData) into lower rate data streams. These lower-rate streams are transmitted concurrently in the same frequency band, using OFDM modulation. To this end, the following operation is performed for each stream independently: the incoming bits are
ASPE design-framework
Our application specific processor is based on the adaptive stream processing engine (ASPE) design-framework described in [13]. The ASPE is a modular coarse-grained architecture optimized for data processing, which mainly consists of regular and repetitive kernels. On one side, the ASPE is tightly coupled with a general purpose processor (GPP) responsible for controlling and setting up the ASPE, as well as for executing performance uncritical tasks. On the other side, the ASPE can autonomously
Implementation
The choice of the high-level architecture for implementing the MIMO-OFDM receiver was mainly guided by our findings reported in [5]. These results indicate that the ASPE, constituting the basis for the work described in this paper, is sufficient to achieve real-time performance for a single-antenna OFDM receiver. On the other hand, our first order complexity-estimate in Table 2 reveals that in a two-antenna MIMO-OFDM receiver, the OFDM processing alone requires slightly more than twice as
Results
The system composed of two different ASPE configurations presented in this paper is capable of implementing the BB processing relevant tasks required for a MIMO-OFDM receiver. Both ASPE designs were synthesized and placed in 1P/6M CMOS technology, and sent to fabrication on a multi-project wafer run. The achieved post-layout clock frequency of 250 MHz permits to follow the schedule of Fig. 5, allowing the system to operate in real-time. The total area required by the two programmable
Conclusions
In conclusion it can be stated that the application specific processors are a suitable vehicle to efficiently implement standard-specific SDR transceivers. Additionally, the proposed architecture leads to area-efficient implementations, since it allows application-specific design-time configuration. Nevertheless, it must be noted that some specialized units (in this case an additional Viterbi decoder) with little, or no configurability at all, might still be required to meet real-time
Acknowledgements
This research was supported by the Swiss Innovation Promotion Agency (CTI), Project no. KTI-8537. Many thanks go to Matthias Braendli, Benjamin Dietrich, and Lukas Haas, who were substantially involved in the design of ASPE B.
References (14)
- M. Woh, S. Seo, H. Lee, Y. Lin, S. Mahlke, T. Mudge, C. Chakrabarti, K. Flautner, The next generation challenge for...
Software-defined radio prospects for multistandard mobile phones
Computer
(2007)- T. Pionteck, L.D. Kabulepa, M. Glesner, Exploring the capabilities of reconfigurable hardware for OFDM-based WLANs, in:...
- et al.
Implementing an OFDM receiver on the RaPiD reconfigurable architecture
IEEE Transactions on Computers
(2004) - S. Eberli, A. Burg, T. Boesch, W. Fichtner, An IEEE 802.11a baseband receiver implementation on an application specific...
- D. Novo, W. Moffat, V. Derudder, B. Bougard, Mapping a multiple antenna SDM-OFDM receiver on the ADRES coarse-grained...
- B. Bougard, A. Bourdoux, F. Naessens, M. Glassee, V. Derudder, L. Van der Perre, Energy-efficient software-defined...