Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Accelerating Deep Convolutional Neural Networks Using Specialized Hardware

Kalin Ovtcharov, Olatunji Ruwase, Joo-Young Kim, Jeremy Fowers, Karin Strauss, and Eric S. Chung

Abstract

We describe the design of a convolutional neural network accelerator running on a Stratix V FPGA. The design runs at three times the throughput of previous FPGA CNN accelerator designs. We show that the throughput/watt is significantly higher than for a GPU, and project the performance when ported to an Arria 10 FPGA.

Details

Publication typeMiscellaneous
> Publications > Accelerating Deep Convolutional Neural Networks Using Specialized Hardware