Journal of Engineering and Technology

: 2015  |  Volume : 5  |  Issue : 1  |  Page : 36--40

VLSI Design and Investigation of an Area Efficient and Low Power MOD-R2MDC FFT for MOMO-OFDM

Kirubanandasarathy Nageswaran, Karthikeyan Kottaisamy 
 Department of Electronics and Communication Engineering, Syed Ammal Engineering College, Ramanathapuram, Tamil Nadu, India

Correspondence Address:
Kirubanandasarathy Nageswaran
Department of Electronics and Communication Engineering, Syed Ammal Engineering College, Ramanathapuram, Tamil Nadu


In this paper, an area-efficient low power fast Fourier transform processor is proposed for multi input multi output-orthogonal frequency division multiplexing (MIMO-OFDM) in wireless communication system. It consists of a modified architecture of radix-2 (R2) algorithm which is described as modified R2 multipath delay commutation (MOD-R2MDC). OFDM is a popular method for high data rate wireless transmission. This paper describes the very large scale integration design of an area efficient MOD-R2MDC FFT for MIMO-OFDM system targeted to future wireless communication systems. The very high speed integrated hardware description language simulation results have been tested practically by implementing in the Altera development and education-2 field programmed gate array (FPGA) development board. Also the existing OFDM system has been tested with these FFT algorithms and their performances were analyzed with respect to occupation of area in FPGA and power consumption. A low-power and area efficient architecture enables the real-time operations of MIMO-OFDM system.

How to cite this article:
Nageswaran K, Kottaisamy K. VLSI Design and Investigation of an Area Efficient and Low Power MOD-R2MDC FFT for MOMO-OFDM.J Eng Technol 2015;5:36-40

How to cite this URL:
Nageswaran K, Kottaisamy K. VLSI Design and Investigation of an Area Efficient and Low Power MOD-R2MDC FFT for MOMO-OFDM. J Eng Technol [serial online] 2015 [cited 2020 Jun 6 ];5:36-40
Available from:

Full Text

 1. Introduction

Multi input multi output-orthogonal frequency division multiplexing (MIMO-OFDM) is an efficient solution for transmitting and receiving the data over a long distance. The sub-carrier frequency has been chosen in our proposed MIMO-OFDM transceivers so that cross-talk between the sub-channels are eliminated, hence, the inter-carrier guard bands are not required [1] . This greatly simplifies the design of both the transmitter and the receiver; unlike conventional frequency division multiplexing, a separate filter for each sub-channel is not required [2] . The orthogonally allows for the efficient modulator and demodulator implementation using the fast Fourier transform (FFT) algorithm [3] . OFDM transceiver is popular for wideband communications today by way of low-cost MIMO-OFDM in wireless telecommunication system. It requires very accurate frequency synchronization between the receiver and they have reduced the complexity [4] . In the transmitter; with frequency deviation, the sub-carriers shall no longer be orthogonal, causing inter-symbol interference [5] . The 5/6 coding rate would be not effective for error correcting by a viterbi decoder [6] . This paper describes the very large scale integration (VLSI) implementation of the proposed modified radix-2 multipath delay commutation (MOD-R2MDC) for MIMO-OFDM systems, that is, MOD-R2MDC pipeline FFT based MIMO-OFDM system. The R2 algorithm with multi delay commutation architecture is to support 4 channel 8, 16, 32, 64, 128, 512, 1024 and 2048 point FFT operations [7],[ 8] . We compare this proposed architecture with existing 8 point R2, R4 FFT and existing R2MDC FFT and also give the design and implementation results of the proposed MOD-R2MDC FFT processor.

 2. Overview of MIMO OFDM

The general transceiver structure of MIMO-OFDM is presented in [Figure 1]. The system consists of N transmitter antennas and M receiver antennas. According to [9] and [10] , the cyclic prefix is assumed to be a longer than the channel delay spread. The OFDM signal for each antenna is obtained by using IFFT and can be detected by FFT. There are two methods widely used for transmitting MIMO data. If the channel has a negligible error rate, we can send several data simultaneously over multiple antennas. This is known as spatial multiplexing, which utilizes the spectrum very efficiently.{Figure 1}

In contrast, if the environment has high error rate, we transmit the same data over multiple antennas. This is called as space-time coding. The purpose of this approach is to increase the diversity of MIMO to combat signal fading. The essential purpose of an MIMO system is to determine, which antenna is corresponding to which data on the receiver side. As shown in [Figure 1], R×1 receives data from all the transmitter antennas, T×1, T×2, T×3 and T×4. Thus, we must have a special decoding algorithm to identify which antenna has transmitted which data to R×1. N×M MIMO-OFDM: N indicates the number of transmitter antennas and M indicates the number of receiver antennas, respectively. For example, 4×4 MIMO-OFDM has four transmitter antennas and four receiver antennas as shown in [Figure 1].

OFDM is a multi-carrier system where data bits are encoded to multiple sub-carriers. Unlike single carrier systems, all the frequencies are sent simultaneously in time. OFDM offers several advantages over single carrier system like better multipath effect immunity, simpler channel equalization and relaxed timing acquisition constraints. However, it is more susceptible to local frequency offset and radio front-end non-linearity [11] . The frequencies used in OFDM system are orthogonal. Neighboring frequencies with overlapping spectrum can, therefore, be used [12] .

 3. Inverse Fast Fourier Transform/Fast Fourier Transform Algorithm

In this section, a brief overview of IFFT and FFT algorithms is provided to be effectively used in OFDM applications. The N-point Discrete FFT (DFT) is defined as:


X(k) is the k th harmonic and x(n) is the n th input sample. Direct DFT calculation requires a computational complexity of O (N 2 ). By using The Cooley-Tukey FFT algorithm, the complexity can be reduced to O (N log r N). The Cooley-Tukey FFT is the most universal of all FFT algorithms, due to any factorization of N is possible.

The Cooley-Tukey algorithm is based on a divide-conquers approach in the frequency domain and therefore is referred to as decimation-in-frequency (DIF) FFT. The DFT formula is split into two summations:


X[k] can be decimated into even-and odd indexed frequency samples:


The computational procedure can be repeated through decimation of the N/2-point DFTs X(2k) and DFTs X(2K+1). The entire algorithm involves log 2 N stages, where each stage involves N/2 operation units (butterflies). The computation of the N point DFT via the DIF FFT, as in the decimation-in-time algorithm requires (N/2).log 2 N complex multiplication and N.log 2 N complex addition [13] .

 4. Proposed Modified Radix-2 Multipath Delay Commutation Architecture

The R2 butterfly processor is consists of a complex adder and complex subtraction. Besides that, an additional complex multiplier for the twiddle factors W N is implemented. The complex multiplication with the twiddle factor requires four real multiplications and two add/subtract operations as shown in [Figure 2].{Figure 2}

The MOD-R2MDC is one of the commutated architectures of R2 FFT algorithm which is used to commutate the values as fast as possible in order to process the values and to commutate the FFT inputs, the architecture shown in the [Figure 3] is consists of different blocks which must be used in the MOD-R2MDC.{Figure 3}

One of the most straightforward approaches for pipeline implementation of R2 FFT algorithm is MOD-R2MDC architecture. It is the simplest way to rearrange data for the FFT/IFFT algorithm, the input data sequence are broken into two parallel data stream flowing forward, with correct distance between data elements entering the butterfly scheduled by proper delays [14] . At each stage of the 8-point FFT in MOD-R2MDC architecture, half of the data flow is delayed via the memory (Register) and processed with the second half data stream [15] .

The A input comes from the previous component twiddle factor multipliers. The B output is fed to the next component, normally butterfly II (BF II). In first cycles, multiplexors direct the input data to the feedback registers until they are filled (position"0").

On next cycles, the multiplexers select the output of the adders or subtractors (position "1"), the butterfly computes a 2-point DFT with incoming data and the data stored in the feedback registers. The architecture of BF I and BF II supporting two receive chains is shown in [Figure 4] (a) and 4 (b). In BF I structure, the sample routing multiplexers and demultiplexers at the input and output of the BF-random access memories (BF-RAMs) are controlled based on c2 and c3 control signals while the computation unit is controlled by c1 control signal. Depending on the programming of number of receive chains, the extra BF-RAMs are enabled. Based on the requirement extra buffers can be extended to the existing BF structure. Since the handling −1, +j and −j multiplication is handled inside the BF II structure, two control signals c1 and c2 are used in the basic computation unit. The multiplexers and the demultiplexers are controlled by c3 and c4 control signals. The product with '−j' term is implemented by swapping the real and imaginary part considering the sign of the sample. The algorithm used here is to commutate the R2 algorithm in the IFFT architecture [2] . In order to optimize the processor, the proposed shift and add method that eliminates the non-trivial complex multiplication with the twiddle factors (W8 1 , W8 3 ) and implements the processor without complex multiplication. The proposed butterfly processor performs the multiplication with the trivial factor W8 2 = −j by switching from real to imaginary part and imaginary to real part, with the factor W8 0 by a simple cable. With the non-trivial factors W8 1 = e−jπ/4, W8 3 = e−j3π/4, the processor realize the multiplication by the factor 1/√2 using hardwired shift/add operation as shown in [Figure 5].{Figure 4}{Figure 5}

 5. Results and Discussion

The prime objective is to construct a FFT in order to have low power consumption and lesser area. The parameters (i) power consumption (ii) area occupancy were given due consideration for comparing the proposed circuit with other FFTs. The experimental results analysis consists of six different types of architectures such as R2, R4, spilt radix, mixed R4/2, R2MDC and MOD R2MDC FFT that can be implemented in the Altera cyclone II development and education 2 (DE2) field programmed gate array (FPGA). We have designed all coding using hardware description language. To get power, and area report, we use XILINX ISE design suite 10.1 as synthesis tool and modelsim 6.3c for simulation. The purpose is to determine the resource usage of this proposed design attempts to eliminate the complex multiplication, hence avoid this expensive operation of multiplication and consumes less chip area. The proposed MOD-R2MDC FFT gives better result than R2 FFT and existing R2MDC FFT in terms of area and power consumption as shown in the [Table 1].{Table 1}

The simulation results for various FFT algorithms have been tested practically by implementing in the Altera DE-2 FPGA development board. The Quartus-II tool is used to download the design in to FPGA development board. In the FPGA board, the reset signal input is connected to the rightmost switch. For set the binary inputs at the remaining switches, after the process in the FPGA, the outputs are seen in light emitting diode display in the board. Also these FPGA output can be verified with simulation results obtained using Modelsim 6.3c. The FPGA board has developed to verify their circuit behavior and implementation of MIMO-OFDM in wireless telecommunication system.

 6. Conclusion

In this work, several FFT algorithms such as R2, R2MDC and the proposed MOD R2MDC FFT were designed using VLSI design process and their performances were analyzed. From the results, it was observed that the proposed MOD R2MDC uses least numbers of configurable logic block slices and save the area of approximately 10% and it consumes <20% of power when compared to other FFT. It is seen that the new MOD-R2MDC FFT algorithm provides lesser area and low power consumption. The very high speed integrated hardware description language simulation results have been tested practically by implementing in the Altera DE-2 FPGA development board. Also the existing OFDM system has been tested with these FFT algorithms and their performance was analyzed with respect to occupation of area in FPGA and power consumption. We conclude that the proposed MOD-R2MDC architecture is occupied a low area and consumed less power than the existing R2 and R2MDC FFT algorithm architecture. The proposed architecture is shows that it can be used in for low power applications such as MIMO-OFDM in wireless communication system.


1J. Chung, Y. Yun, S. Choi, "Experiments on MIMO-OFDM system combined with adaptive beamforming based on IEEE 802.16e WMAN standard," Telecommunication systems Journal, Vol.52, pp. 1931-1944, 2013.
2G. Jongren, M. Skoglund, and B. Ottersten, "Combining beamforming and orthogonal space time block coding," IEEE Transactions on Information Theory, Vol. 48, no. 3, pp. 611-627, 2002.
3D. W. Byun, Y. M. Ki, and D. K. Kim, "Channel state aware joint dynamic cell coordination scheme using adaptive modulation and variable reuse factor in OFDMA down link," Telecommunication systems Journal, Vol. 36, no.1-3, pp. 85-96, 2007.
4T. Aruna, and M. Suganthi, "Variable power adaptive MIMO OFDM system under imperfect CSI for mobile adhoc networks," Telecommunication systems Journal, Vol. 50, no. 1, pp. 47-53, 2012.
5H. Bolcskei, D. Gesbert, and A. J. Paulraj, "On the capacity of OFDM-Based Spatial Multiplexing systems," IEEE Transactions Communication, vol. 50, no. 2, pp. 225-234, 2002.
6R. Holakouei, A. Silva, and A. Gameiro, "Multiuser precoding techniques for a distributed broadband wireless system," Telecommunication systems Journal, Vol. 52, pp. 1819-1829, 2013.
7P. Coulton, and D. Carline, "An SDR inspired design for the FPGA implementation of 802.11a baseband system," UK: Proceedings of the IEEE International Symposium on Consumer Electronics, pp. 470-475, Sep 2004.
8C. Dick, and F. Harris, "FPGA implementation of an OFDM PHY," Proceedings of the Conference Record of the 37th Asilomar Conference on Signals, Systems and Computers, vol. 1, Pacific Grove, CA., USA., pp. 905-909, Nov 2003.
9S. M. Alamouti, "A Simple Transmit Diversity Technique for Wireless Communications," IEEE Journal on select areas in communications, Vol. 16, no.8, pp. 1451-1458, 1998.
10N. Kirubanandasarathy, K. Karthikeyan, and K. Thirunadanasikamani, "VLSI Design of Mixed Radix FFT for MIMO OFDM In the wireless communication," Proceedings of the IEEE International conference on communication computing control technologies, Ramanathapuram, India, pp. 98-102, Oct 2010.
11R. S. Blum, Y. G. Li, J. H. Winters, and Q. Yan, "Improved space time coding for MIMO-OFDM wireless communications," IEEE Transactions Communications, vol. 49, no. 11, pp. 1873-1878, 2001.
12G. Femenias and F. Riera-Palou, "Enhancing IEEE 802.11n WLANs using group orthogonal code division multiplex," Telecommunication systems Journal, Vol. 38, no. 1-2, pp. 37-44, 2008.
13M. Arioua, S. Belkouch, M. Agdad, and M. M. Hassani, "VHDL implementation of an optimized 8-point FFT/IFFT processor in pipeline architecture for OFDM systems," Proceedings of the IEEE International conference on Multimedia computing and system, pp. 1-5, 2011.
14J. Becker, "Configurable Systems on Chip.' Proceedings of the 15 th Symposium on Integrated Circuits and Systems Design, (SICSD'2002), Karlsruhe University, Germany, pp. 379-384, 2002.
15W. Han, T. Arslan, A. T. Erdogan, and M. Hasan, "Multiplier-less based parallel pipelined FFT architectures for wireless communication applications," Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Volume 5, 18-23 Edinburgh University, UK., pp. v/45-v/48, Mar 2005.