Think Fast: A Tensor Streaming Processor (TSP) for Accelerating Deep Learning Workloads

In this paper, we introduce the Tensor Streaming Processor (TSP) architecture, a functionally-sliced microarchitecture with memory units interleaved with vector and matrix deep learning functional units in order to take advantage of dataflow locality of deep learning operations. The TSP is built bas...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) S. 145 - 158
Hauptverfasser: Abts, Dennis, Ross, Jonathan, Sparling, Jonathan, Wong-VanHaren, Mark, Baker, Max, Hawkins, Tom, Bell, Andrew, Thompson, John, Kahsai, Temesghen, Kimmell, Garrin, Hwang, Jennifer, Leslie-Hurd, Rebekah, Bye, Michael, Creswick, E.R., Boyd, Matthew, Venigalla, Mahitha, Laforge, Evan, Purdy, Jon, Kamath, Purushotham, Maheshwari, Dinesh, Beidler, Michael, Rosseel, Geert, Ahmad, Omar, Gagarin, Gleb, Czekalski, Richard, Rane, Ashay, Parmar, Sahil, Werner, Jeff, Sproch, Jim, Macias, Adrian, Kurtz, Brian
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 01.05.2020
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!