Parallel processing intensive digital front-end for IEEE 802.11ac receiver
Aghababaeetafreshi, Mona; Yli-Kaakinen, Juha; Levanen, Toni; Korhonen, Ville; Jääskeläinen, Pekka; Renfors, Markku; Valkama, Mikko; Takala, Jarmo (2016)
Aghababaeetafreshi, Mona
Yli-Kaakinen, Juha
Levanen, Toni
Korhonen, Ville
Jääskeläinen, Pekka
Renfors, Markku
Valkama, Mikko
Takala, Jarmo
IEEE
2016
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202010297697
https://urn.fi/URN:NBN:fi:tuni-202010297697
Kuvaus
Peer reviewed
Tiivistelmä
Modern computing platforms offer increasing levels of parallelism for fast execution of different signal processing tasks. In this paper, we develop and elaborate on a digital front-end concept for an IEEE 802.11ac receiver with 80 MHz bandwidth where parallel processing is adopted in multiple ways. First, the inherent structure of the 802.11ac waveform is utilized such that it is divided, through time-domain digital filtering and decimation, to two parallel 40 MHz signals that can be processed further in parallel using smaller-size FFTs and, e.g, legacy 802.11n digital receiver chains. This filtering task is very challenging, as the latency and the cyclic prefix budget of the receiver cannot be compromised, and because the number of unused subcarriers in the middle of the 80 MHz signal is only three, thus necessitating very narrow transition bandwidth in the deployed filters. Both linear and circular filtering based multirate channelization architectures are developed and reported, together with the corresponding filter coefficient optimization. Also, full radio link performance simulations with commonly adopted indoor WiFi channel profiles are provided, verifying that the channelization does not degrade the overall link performance. Then, both C and OpenCL software implementations of the processing are developed and simulated for comparison purposes on an Intel CPU, to demonstrate that the parallelism provided by the OpenCL will result in substantially faster realization. Furthermore, we provide complete software implementation results in terms of time, number of clock cycles, power, and energy consumption on the ARM Mali GPU with half precision floating-point arithmetic along with the ARM Cortex A7 CPU.
Kokoelmat
- TUNICRIS-julkaisut [16951]