Elsevier

Microelectronics Journal

Volume 40, Issue 11, November 2009, Pages 1642-1649
Microelectronics Journal

Implementation of a 2×2 MIMO-OFDM receiver on an application specific processor

https://doi.org/10.1016/j.mejo.2009.02.005Get rights and content

Abstract

This paper describes the implementation of the hard computational kernels required for the baseband (BB) processing of a 2×2 multiple-input multiple-output (MIMO)-OFDM receiver on a design-framework for application specific processors. The employed low-complexity BB algorithms are described and their computational complexity is derived. The receiver is split into two parts which are mapped onto two application specific processors, each tailored to the computational needs of the associated digital signal processing kernels. The first processor performs the per stream MIMO-OFDM processing. The second processor handles the MIMO detection. Finally, the 0.18μm 1P/6M CMOS technology layout of both fabricated application specific processors is presented. Real-time BB processing is possible on these engines running at a clock frequency of 250 MHz.

Introduction

The world of wireless communications is governed by a plethora of different standards, each tailored to the needs of its own target application domain (e.g., wireless local area network (WLAN) or cellular networks). Consequently, many standards employ different coding and modulation schemes. This heterogeneous, fast evolving environment is the motivation for the development of software defined radio (SDR) concepts. Ideally, SDRs are capable of handling all possible transmission techniques on the same platform, without the need of integrating dedicated modems for each standard.

Multiple-input multiple-output (MIMO) transmission is a recent development in the area of wireless communications that permits to increase the spectral efficiency by employing multiple antennas at both the transmitter and the receiver. Since its conception, MIMO communication has reached a level of maturity that permits integration of the technology into wireless communication standards. Examples of standards that already employ MIMO are the IEEE 802.11n standard for local area networks, the IEEE 802.16e standard for metropolitan area networks, or the 3GPP LTE standard for cellular systems.

Unfortunately, the tremendous computational complexity associated with MIMO signal processing at the receiver conflicts with the reduced amount of processing resources available in a low-cost software programmable architecture of an SDR solution. Hence, it appears reasonable to step back from the ambition of directly implementing the ultimate SDR transceiver that is capable of handling all present and future standards. The more pragmatic approach considered in this paper employs two levels of configurability: the first level is the use of an application specific processor design-framework which permits design-time customization to meet the requirements of a specific standard, thus enabling the rapid development of new products. The second level provides limited flexibility for reconfiguration at run-time. It enables bug-fixes, the incorporation of small changes to a standard, and upgrades to improved receiver algorithms.

A summary of the challenges associated with implementing fourth generation (4G) wireless communication systems on SDR architectures can be found in [1]. The authors conclude that technology scaling alone does not suffice to meet the stringent performance requirements of 4G systems. Hence, new architectural solutions are required. Two emerging approaches are investigated in [2]. The first approach comprises reconfigurable architectures with different levels of granularity. It is argued that success stories of companies that have followed this approach for mobile phones are still missing. The second approach follows a DSP-centric, accelerator assisted architecture. It is argued that this approach is more promising for the mobile phone market compared to the reconfigurable approach.

More specifically, concerning the application area under consideration in this paper, the potential of using reconfigurable hardware to support different systems which employ the same modulation techniques has been explored in [3] for the OFDM-based IEEE 802.11a and HIPERLAN/2 standards. In [4] the implementation of a minimal MIMO-OFDM receiver on the RaPiD coarse-grained reconfigurable architecture is considered. It is found that there is a sixfold increase in complexity compared to an ASIC implementation, while the cost is reduced by a factor of six compared to an implementation on conventional digital signal processors. A complete IEEE 802.11a baseband (BB) implementation on an application specific processor is presented in our previous work [5]. The implementation revealed that a standard-specific software programmable approach is feasible with only minor area overhead when compared to dedicated ASIC solutions. This encouraging result motivates the choice of the application specific processor design-framework for implementing the 2×2 MIMO-OFDM receiver considered in this paper. While all of the above-described architectures consider cellular systems, single-antenna systems, or a minimal MIMO-OFDM receiver, a first implementation of a 2×2 MIMO-OFDM WLAN receiver on the ADRES coarse grained array has been described in [6] and was further elaborated on in [7], [8].

In this paper, we consider the use of an application specific processor for implementing the BB processing of a 2×2 MIMO-OFDM receiver. Our main contributions are the description of the application specific processor architecture and the description of the mapping of the MIMO-OFDM signal processing onto that architecture. We also provide a summary of the required algorithms together with an analysis of their computational complexity. Finally, our implementation results provide reference for the true silicon complexity of a real-time capable 2×2 MIMO-OFDM receiver implemented on a software-programmable, i.e., reconfigurable hardware.

The remainder of this paper is structured as follows: Section 2 considers the algorithms required for the BB processing in a typical MIMO-OFDM receiver. It identifies the involved atomic operations, and analyzes the corresponding processing requirements for the general case and for the 2×2 configuration under consideration in this paper. This analysis sets the stage for the subsequent discussion of our SDR implementation. We start with the introduction of the proposed application specific processor design-framework in Section 3. Section 4 describes a processor composition that is suitable for implementing a 2×2 MIMO-OFDM receiver supporting a bandwidth of 20 MHz. It provides the relevant details of the corresponding run-time datapath configurations, the program flow, and the schedule. Section 5 summarizes our implementation results and compares the proposed solution to other works in the field. Conclusions are drawn in Section 6.

Section snippets

Linear MMSE MIMO-OFDM receiver system

Fig. 1 provides a high-level view on the MR×MT MIMO-OFDM communication system considered in this paper with MT=2 transmit and MR=2 receive antennas. In spatial multiplexing mode, the transmitter starts by splitting the incoming binary data stream (TxData) into MT lower rate data streams. These lower-rate streams are transmitted concurrently in the same frequency band, using OFDM modulation. To this end, the following operation is performed for each stream independently: the incoming bits are

ASPE design-framework

Our application specific processor is based on the adaptive stream processing engine (ASPE) design-framework described in [13]. The ASPE is a modular coarse-grained architecture optimized for data processing, which mainly consists of regular and repetitive kernels. On one side, the ASPE is tightly coupled with a general purpose processor (GPP) responsible for controlling and setting up the ASPE, as well as for executing performance uncritical tasks. On the other side, the ASPE can autonomously

Implementation

The choice of the high-level architecture for implementing the 2×2 MIMO-OFDM receiver was mainly guided by our findings reported in [5]. These results indicate that the ASPE, constituting the basis for the work described in this paper, is sufficient to achieve real-time performance for a single-antenna OFDM receiver. On the other hand, our first order complexity-estimate in Table 2 reveals that in a two-antenna MIMO-OFDM receiver, the OFDM processing alone requires slightly more than twice as

Results

The system composed of two different ASPE configurations presented in this paper is capable of implementing the BB processing relevant tasks required for a 2×2 MIMO-OFDM receiver. Both ASPE designs were synthesized and placed in 0.18μm 1P/6M CMOS technology, and sent to fabrication on a multi-project wafer run. The achieved post-layout clock frequency of 250 MHz permits to follow the schedule of Fig. 5, allowing the system to operate in real-time. The total area required by the two programmable

Conclusions

In conclusion it can be stated that the application specific processors are a suitable vehicle to efficiently implement standard-specific SDR transceivers. Additionally, the proposed architecture leads to area-efficient implementations, since it allows application-specific design-time configuration. Nevertheless, it must be noted that some specialized units (in this case an additional Viterbi decoder) with little, or no configurability at all, might still be required to meet real-time

Acknowledgements

This research was supported by the Swiss Innovation Promotion Agency (CTI), Project no. KTI-8537. Many thanks go to Matthias Braendli, Benjamin Dietrich, and Lukas Haas, who were substantially involved in the design of ASPE B.

References (14)

  • M. Woh, S. Seo, H. Lee, Y. Lin, S. Mahlke, T. Mudge, C. Chakrabarti, K. Flautner, The next generation challenge for...
  • U. Ramacher

    Software-defined radio prospects for multistandard mobile phones

    Computer

    (2007)
  • T. Pionteck, L.D. Kabulepa, M. Glesner, Exploring the capabilities of reconfigurable hardware for OFDM-based WLANs, in:...
  • C. Ebeling et al.

    Implementing an OFDM receiver on the RaPiD reconfigurable architecture

    IEEE Transactions on Computers

    (2004)
  • S. Eberli, A. Burg, T. Boesch, W. Fichtner, An IEEE 802.11a baseband receiver implementation on an application specific...
  • D. Novo, W. Moffat, V. Derudder, B. Bougard, Mapping a multiple antenna SDM-OFDM receiver on the ADRES coarse-grained...
  • B. Bougard, A. Bourdoux, F. Naessens, M. Glassee, V. Derudder, L. Van der Perre, Energy-efficient software-defined...
There are more references available in the full text version of this article.

Cited by (0)

View full text