Application-Specific Integrated Circuit Design

At the Integrated Information Processing (IIP) Group, we strive to develop application specific integrated circuits (ASICs) that achieve highest throughput, lowest power, minimal latency, and best solution quality. In order to accomplish this task, we develop new and transformative algorithms from the ground up, which are designed to be hardware friendly. While implementing our algorithms in hardware, we regularly discover new theoretical aspects and architecture-level tricks that that further improve efficiency. We also levarage emerging architecture and ASIC-​design concepts, including in- and near-​memory processing, and semi-​custom standard-​cell design.  

Our unique cross-stack design approach that jointly considers theory, algorithms, and hardware design, allows us to develop state-of-the-art ASICs for a number of different applications.

Our ASIC designs are often done in collaboration with the Microelectronics Design Center. Some of our recent ASIC implementations are as follows:

This image shows a square 9mm^2 22FDX ASIC containing PULPO, an accelerator designed by the IIP group. PULPO accelerates first-order optimization methods by using an architecture that is tightly coupled to the ASIC’s memory system. The ASIC features a logo with a Kraken.
ASIC implementation of PULPO

Kraken (2021; GF 22FDX)
This 9mm2
chip contains several designs based on the Parallel Ultra-Low Power (PULP) architecture, developed by the Digital Circuits and Systems Group. One of those designs is PULPO, which was created by the IIP group. PULPO is the first floating-point baseband-processing accelerator for massive MIMO basestations, and is designed to accelerate three atomic operations that are critical to several linear and nonlinear algorithms for both uplink and downlink. To increase its programmability and flexibility, PULPO is integrated in a system-on-chip (SoC) with a tight coupling to the system's data memory, facilitating data exchange and co-operation with 8 PULP RISC-V cores.

This image shows a 4mm x 2mm 65nm ASIC implementing the first scalable, all-digital massive MU-MIMO spatial equalizer. The ASIC contains 64 ADCs, SMUL for FFT computing, BEACHES for channel estimation, and PPAC for performing finite-alphabet MMSE equalization.
ASIC implementation of PPAC, BEACHES, SMUL-FFT, and FAME

SIR PPAC (2020; TSMC 65nm)
This 4mm x 2mm mixed-signal chip constitutes the first scalable all-digital massive MU-MIMO spatial equalizer. The ASIC is designed for a 32-antenna base station serving up to 16 user terminals. The design contains an analog front end composed by 64 programmable-resolution SAR ADCs, which were designed by our collaborators from the external pageMolnar Research Group at Cornell University. Furthermore, the chip implements novel low-resolution techniques for energy-efficient channel estimation, equalization, and beamspace processing. In specific, the ASIC features our patented beamspace channel estimation (BEACHES) algorithm, our high-throughput streaming-multiplierless (SMUL) FFT, and our all-digital, processing-in-memory (PIM) accelerator PPAC, which is implementing low-resolution finite-alphabet MMSE (FAME) equalization.
SIR PPAC was awarded with the Best Paper Award at ESSCIRC 2021.

This image shows a square 2mm^2 28nm ASIC containing LAMA, C1PO, and NOPE, three accelerators for large multi-antenna wireless communication systems designed by the IIP group. The ASIC features the artwork of three lamas contemplating the horizon, symbolizing the three implemented designs.
ASIC implementation of LAMA, C1PO, and NOPE

LAMA, C1PO, and NOPE (2018; TSMC 28nm)
This 2mm2 chip contains three different accelerators for large multi-antenna wireless communication systems. In specific, the ASIC includes  (i) the large-MIMO approximate message passing (LAMA) data detector, which offers near-optimal performance for realistic massive multiuser MIMO wirelss, (ii) the biconvex 1-bit precoding (C1PO) algorithm, which is the first ASIC implementation of a 1-bit precoder for the massive multiuser MIMO downlink, and (iii) the non-parametric equalizer (NOPE), a matrix-inversion-free, parameter-free equalizer that is robust to hardware impairments. All three designs exploit near-memory processing and energy-efficient latch-arrays to tightly integrate memory and computations. The ASIC also features the artwork of three lamas contemplating the horizon, symbolizing the three implemented designs.

JavaScript has been disabled in your browser