2024 Fall Research Update – MIT AI Hardware Program

MIT AI Hardware Program

Fall Research Update

Wednesday, November 13, 2024
11:00 - 1:10 PM ET
Virtual Event

Members of the MIT AI Hardware Program attended this annual virtual meeting covering updates on current research projects. Program co-leads Jesús del Alamo and Aude Oliva gave a program overview and researchers presented in-depth presentations with Q&As.

Watch the recorded talks below.

Artificial Intelligence brain shape in a complex and modern GPU card in shades of purple, blue, and gold

Agenda

11:00 – 11:05

Year in Review & the Year Ahead

Program Co-Leads

Jesús del Alamo, Donner Professor; Professor, Electrical Engineering and Computer Science; MacVicar Faculty Fellow
Aude Oliva, Director of Strategic Industry Engagement, MIT Schwarzman College of Computing; CSAIL Senior Research Scientist

11:05 – 11:20

Neuromorphic Devices and Systems Enabled by Wafer-Scale CVD Growth of 2D Transition Metal Dichalcogenides

Tomás Palacios, Clarence J. Lebel Professor in Electrical Engineering, Electrical Engineering and Computer Science; Director, Microsystems Technology Laboratories

This project aims to explore the use of two-dimensional transition metal dichalcogenides (TMDs), such as MoS2 and WSe2, as neuromorphic devices. We will leverage the extremely low leakage current of wide-bandgap TMD materials to develop floating-gate transistors (FGFETs), where changes in charge stored at the floating gate alter the MoS2 channel conductance. Floating gate structures will be first simulated and then experimentally demonstrated on TMD’s grown by metal-organic chemical vapor deposition (MOCVD) at back-end-of-line (BEOL)-compatible temperatures and integrated on a standard silicon CMOS process. For this, we will build on the MoS2 low-temperature 200 mm wafer-scale MOCVD growth technology recently demonstrated by our group, and we will fabricate highly-scaled heterostructure-based devices to ensure reproducible neuromorphic devices with record-low power consumption. We will also explore the impact of different processing steps such as lithography and deposition conditions on the device performance and stability.

In collaboration with Jing Kong, Jerry Mcafee (1940) Professor In Engineering, Electrical Engineering and Computer Science

11:20 – 11:35

Secure Digital In-Memory Compute for Machine Learning Applications

Maitreyi Ashok, PhD Candidate, Electrical Engineering and Computer Science

This research presents a general method for secure machine learning hardware using a digital in-memory compute architecture through decorrelation of circuit currents from neural network data, encrypting off-chip model storage, and local secret key generation.

In collaboration with Anantha Chandrakasan, Chief Innovation and Strategy Officer; Dean, MIT School of Engineering; Vannevar Bush Professor of Electrical Engineering and Computer Science

Slides

11:35 – 11:50

CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs

Lejla Skelić, MEng Candidate, Electrical Engineering and Computer Science

The role of Large Language Models (LLMs) has not been extensively explored in analog circuit design, which could benefit from a reasoning-based approach that transcends traditional optimization techniques. In particular, despite their growing relevance, there are no benchmarks to assess LLMs’ reasoning capability about circuits. Therefore, we created the CIRCUIT dataset consisting of 510 question-answer pairs spanning various levels of analog-circuit-related subjects. The best-performing model on our dataset, GPT-4o, achieves 48.04% accuracy when evaluated on the final numerical answer. To evaluate the robustness of LLMs on our dataset, we introduced a unique feature that enables unit-test-like evaluation by grouping questions into unit tests. In this case, GPT-4o can only pass 27.45% of the unit tests, highlighting that the most advanced LLMs still struggle with understanding circuits, which requires multi-level reasoning, particularly when involving circuit topologies. This circuit-specific benchmark highlights LLMs’ limitations, offering valuable insights for advancing their application in analog integrated circuit design.

In collaboration with Ruonan Han, Associate Professor, Electrical Engineering and Computer Science

Slides

11:50 – 12:05

Increasing Architectural Resilience to Small Delay Faults

Peter Deutsch and Vincent Ulitzsch, PhD Candidates, Electrical Engineering and Computer Science

We aim to create models for new fault modes in processors, addressing reliability challenges for large-scale data centers. Our research develops methods for designing resilient hardware, guiding cost-effective protection strategies for scalability.

In collaboration with Mengjia Yan, Assistant Professor, Electrical Engineering and Computer Science, and Joel S. Emer, Professor of the Practice, Electrical Engineering and Computer Science

Slides

12:05 – 12:20

Efficient Large Language Models and Generative AI

Guangxuan Xiao, PhD Candidate, Electrical Engineering and Computer Science

The rapid advancement of generative AI, particularly large language models (LLMs), presents unprecedented computational challenges. LLMs boast billions of parameters (e.g., Llama3: 70B),more than 1000× larger than TinyML models. The autoregressive nature of LLMs makes inference memory bounded. Generating long sequences further compounds the memory demand. Our research addresses these challenges by quantization, low-precision systems and KV cache optimization.

In collaboration with Song Han, Associate Professor, Electrical Engineering and Computer Science

Slides

12:20 – 12:50

More with Less: Fault-Tolerance and Information-Theoretic Optimality in Programmable Photonics

Ryan Hamerly, Research Scientist at MIT and Senior Scientist at NTT Research

This talk will explore calibration and optimality in programmable photonics, with a focus on error robustness and efficient phase-shifter usage in multiport interferometers. We examine the challenges in designing photonic circuits that are both efficient and robust to errors, particularly in the context of optical neural networks, boson sampling, and other advanced applications. Highlighting mesh architectures such as the Reck and Clements designs, the talk addresses how phase requirements can be minimized while maintaining system universality and low error sensitivity. We propose a novel 3-MZI structure, which reduces phase-shift demands and improves fault tolerance by offering more stable configurations. This architecture, when evaluated against information-theoretic bounds, approaches optimal phase efficiency for large-scale photonic meshes. Applications in self-configuring systems, error-aware training, and non-unitary photonic meshes will also be discussed, presenting a comprehensive approach to achieving near-optimal programmable photonic performance.

In collaboration with Dirk Englund, Associate Professor, Electrical Engineering and Computer Science

Slides

12:50 – 1:05

CMOS-Compatible Protonic Synapse Technology for Analog AI Training Accelerators

Dingyu Shen, PhD Candidate, Electrical Engineering and Computer Science

CMOS (Complementary Metal-Oxide-Semiconductor) technology and fabrication processes, the backbone of many in integrated circuits, provides several advantages: low power consumption, high density, scalability, and cost-effectiveness. Researchers are combining this with proton-based synapse elements that resemble human brain structures, making them well-suited for AI applications. This research focuses on integration using tungsten oxide, phosphosilicate glass (PSG) and a solid electrolyte in the fabrication process of different size arrays.

In collaboration with Jesús del Alamo, Donner Professor and Professor of Electrical Engineering, Department of Electrical Engineering and Computer Science

Slides

1:05 – 1:10

Closing Remarks

Program Co-Leads

Researchers

photo of Jesus del Alamo seated in his office

Jesús del Alamo

Donner Professor, Professor of Electrical Engineering and Computer Science

Nanoscale synaptic devices: physics, technology, modeling

Aude Oliva

Senior Research Scientist at MIT Schwarzman College of Computing

Energy efficient deep learning, and Computer vision

Tomás Palacios

Professor of Electrical Engineering and Computer Science

Materials graphene, Biosensors and Nanotechnologies

Jing Kong

Professor of Electrical Engineering and Computer Science

Nanotube electronic devices, Semiconductor nanowires

Anantha Chandrakasan

Vannevar Bush Professor of Electrical Engineering and Computer Science

Energy efficient wireless sensors, circuits, and architectures

Maitreyi Ashok

PhD Candidate, Electrical Engineering and Computer Science

Hardware security in digital and mixed-signal integrated circuits against physical attacks

Ruonan Han

Professor of Electrical Engineering and Computer Science

Microelectronic circuits and systems, Solid-state electronics

Lejla Skelić

MEng Candidate, Electrical Engineering and Computer Science

Computer systems and security, applied machine learning (ML), and nanotechnology

Song Han

Associate Professor of Electrical Engineering

Efficient hardware for computation-intensive AI

Guangxuan Xiao

PhD Candidate, Electrical Engineering and Computer Science

Efficient algorithms and systems for deep learning, specifically large foundation models

Dirk Englund

Associate Professor of Electrical Engineering and Computer Science

Photonic neural networks and circuits, Quantum photonics

Ryan Hamerly

Research Scientist at MIT's Quantum Photonics Laboratory

Integration of photonics, deep learning, quantum computing, and optimization

Joel Emer

Professor of the Practice, Electrical Engineering and Computer Science

Deep learning architectures, and Micro-architecture

Mengjia Yan

Assistant Professor of Electrical Engineering and Computer Science

Edge security, Computer architecture

Peter Deutsch

PhD candidate, MIT CSAIL

Architectural security against memory hierarchy side-channels

Vincent Ulitzsch

Visiting Researcher, MIT CSAIL

Micro-architectural and IT security with applied cryptography

Dingyu Shen

PhD Candidate, Electrical Engineering and Computer Science

MIT AI Hardware Program

Fall Research Update

Agenda

11:00 – 11:05

Year in Review & the Year Ahead

11:05 – 11:20

Neuromorphic Devices and Systems Enabled by Wafer-Scale CVD Growth of 2D Transition Metal Dichalcogenides

11:20 – 11:35

Secure Digital In-Memory Compute for Machine Learning Applications

11:35 – 11:50

CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs

11:50 – 12:05

Increasing Architectural Resilience to Small Delay Faults

12:05 – 12:20

Efficient Large Language Models and Generative AI

12:20 – 12:50

More with Less: Fault-Tolerance and Information-Theoretic Optimality in Programmable Photonics

12:50 – 1:05

CMOS-Compatible Protonic Synapse Technology for Analog AI Training Accelerators

1:05 – 1:10

Closing Remarks

Researchers

Jes​​ús del Alamo

Aude Oliva

Tomás Palacios

Jing Kong

Anantha Chandrakasan

Maitreyi Ashok

Ruonan Han

Lejla Skelić

Song Han

Guangxuan Xiao

Dirk Englund

Ryan Hamerly

Joel Emer

Mengjia Yan

Peter Deutsch

Vincent Ulitzsch

Dingyu Shen

Jesús del Alamo