Výsledky vyhľadávania - Computer systems organization Architectures Serial architectures Pipeline computing~

1

Načíta sa…

Lookup Table-based Multiplication-free All-digital DNN Accelerator Featuring Self-Synchronous Pipeline Accumulation Autor Tagata, Hiroto, Sato, Takashi, Awano, Hiromitsu

Vydavateľské údaje: IEEE 22.06.2025

Vydané v 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… Previous research has employed large analog computing circuits to convert inputs into LUT addresses, which presents challenges to area efficiency and computational accuracy…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
2

Načíta sa…

Bit-level Perceptron Prediction for Indirect Branches Autor Garza, Elba, Mirbagher-Ajorpaz, Samira, Khan, Tahsin Ahmad, Jimenez, Daniel A.

ISSN: 2575-713X

Vydavateľské údaje: ACM 01.06.2019

Vydané v 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)
“…Modern software uses indirect branches for various purposes including, but not limited to, virtual method dispatch and implementation of switch statements…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
3

Načíta sa…

UpPipe: A Novel Pipeline Management on In-Memory Processors for RNA-seq Quantification Autor Chen, Liang-Chi, Ho, Chien-Chung, Chang, Yuan-Hao

Vydavateľské údaje: IEEE 09.07.2023

Vydané v 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)
“…, memory wall problem on the conventional architecture. As the first publicly commercial processing-in-memory (PIM…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
4

Načíta sa…

Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Instruction Execution Autor Bera, Rahul, Ranganathan, Adithya, Rakshit, Joydeep, Mahto, Sujit, Nori, Anant V., Gaur, Jayesh, Olgun, Ataberk, Kanellopoulos, Konstantinos, Sadrosadati, Mohammad, Subramoney, Sreenivas, Mutlu, Onur

Vydavateľské údaje: IEEE 29.06.2024

Vydané v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“… (even on a correct prediction), which consumes hard-to-scale pipeline resources that otherwise could have been used to execute other load instructions…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
5

Načíta sa…

MixPipe: Efficient Bidirectional Pipeline Parallelism for Training Large-Scale Models Autor Zhang, Weigang, Zhou, Biyu, Tang, Xuehai, Wang, Zhaoxing, Hu, Songlin

Vydavateľské údaje: IEEE 09.07.2023

Vydané v 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)
“… Recently, bidirectional pipeline parallelism has been recognized as an effective approach for improving training throughput…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
6

Načíta sa…

Buffer Prospector: Discovering and Exploiting Untapped Buffer Resources in Many-Core DNN Accelerators Autor Wei, Yuchen, Cai, Jingwei, Gao, Mingyu, Peng, Sen, Wu, Zuotong, Shi, Guiming, Ma, Kaisheng

Vydavateľské údaje: IEEE 22.06.2025

Vydané v 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“…In large-scale DNN inference accelerators, the many-core architecture has emerged as a predominant design, with layer-pipeline (LP…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
7

Načíta sa…

Alternate Path μ-op Cache Prefetching Autor Singh, Sawan, Perais, Arthur, Jimborean, Alexandra, Ros, Alberto

Vydavateľské údaje: IEEE 29.06.2024

Vydané v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“…Datacenter applications are well-known for their large code footprints. This has caused frontend design to evolve by implementing decoupled fetching and large…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
8

Načíta sa…

Alternate Path Fetch Autor Deshmukh, Aniket, Cai, Lingzhe Chester, Patt, Yale N.

Vydavateľské údaje: IEEE 29.06.2024

Vydané v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“… This requires building wider pipelines with more accurate branch predictors. However, scaling the pipeline width is becoming more challenging due to limitations…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
9

Načíta sa…

Spatz: A Compact Vector Processing Unit for High-Performance and Energy-Efficient Shared-L1 Clusters Autor Cavalcante, Matheus, Wuthrich, Domenic, Perotti, Matteo, Riedel, Samuel, Benini, Luca

ISSN: 1558-2434

Vydavateľské údaje: ACM 29.10.2022

Vydané v 2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD) (29.10.2022)
“…While parallel architectures based on clusters of Processing Elements (PEs) sharing L1 memory are widespread, there is no consensus on how lean their PE…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
10

Načíta sa…

UDP: Utility-Driven Fetch Directed Instruction Prefetching Autor Oh, Surim, Xu, Mingsheng, Khan, Tanvir Ahmed, Kasikci, Baris, Litz, Heiner

Vydavateľské údaje: IEEE 29.06.2024

Vydané v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“… However, our study shows that existing implementations still fall far short of an ideal system with a perfect instruction cache…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
11

Načíta sa…

X-Layer: Building Composable Pipelined Dataflows for Low-Rank Convolutions Autor Vedula, Naveen, Hojabr, Reza, Khonsari, Ahmad, Shriraman, Arrvindh

Vydavateľské údaje: IEEE 01.09.2021

Vydané v 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)
“… X-Layer decouples the nested loops in a pipeline and combines them to create a common outer dataflow and several inner dataflows…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
12

Načíta sa…

AVM-BTB: Adaptive and Virtualized Multi-level Branch Target Buffer Autor Liu, Yunzhe, Li, Xinyu, Zhang, Tingting, Liu, Tianyi, Guo, Qi, Zhang, Fuxin, Wang, Jian

Vydavateľské údaje: IEEE 29.06.2024

Vydané v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“…Branch Target Buffer (BTB) plays an important role in modern processors. It is used to identify branches in the instruction stream and predict branch targets…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
13

Načíta sa…

PipeLink: A Pipelined Resource Sharing System for Dataflow High-Level Synthesis Autor Li, Rui, Berkley, Lincoln, Manohar, Rajit

Vydavateľské údaje: IEEE 22.06.2025

Vydané v 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“…Dynamically scheduled high-level synthesis (HLS) is an approach to HLS that maps programs into dataflow circuits. These circuits use distributed control for…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
14

Načíta sa…

Load value prediction via path-based address prediction: avoiding mispredictions due to conflicting stores Autor Sheikh, Rami, Cain, Harold W., Damodaran, Raguram

ISBN: 1450349528, 9781450349529

ISSN: 2379-3155

Vydavateľské údaje: New York, NY, USA ACM 14.10.2017

Vydané v MICRO-50 : the 50th annual IEEE/ACM International Symposium on Microarchitecture : proceedings : October 14-18, 2017, Cambridge, MA (14.10.2017)
“… Challenge #2: value mispredictions trigger costly pipeline flushes. To minimize the number of pipeline flushes, value predictors employ stringent, yet necessary, high confidence requirements to guarantee high prediction accuracy…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
15

Načíta sa…

Leaky MDU: ARM Memory Disambiguation Unit Uncovered and Vulnerabilities Exposed Autor Liu, Chang, Lyu, Yongqiang, Wang, Haixia, Qiu, Pengfei, Ju, Dapeng, Qu, Gang, Wang, Dongsheng

Vydavateľské údaje: IEEE 09.07.2023

Vydané v 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)
“…Memory Disambiguation Unit (MDU) is widely used on modern processors to speculatively execute load instructions and improve pipeline performance…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
16

Načíta sa…

Pipelining a triggered processing element Autor Repetti, Thomas J., Cerqueira, João P., Kim, Martha A., Seok, Mingoo

ISBN: 1450349528, 9781450349529

ISSN: 2379-3155

Vydavateľské údaje: New York, NY, USA ACM 14.10.2017

Vydané v MICRO-50 : the 50th annual IEEE/ACM International Symposium on Microarchitecture : proceedings : October 14-18, 2017, Cambridge, MA (14.10.2017)
“… We propose two new techniques to mitigate pipeline hazards particular to spatial accelerators and non-program-counter architectures, evaluating them using in-vivo performance counters from an FPGA…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
17

Načíta sa…

SMT-COP: Defeating Side-Channel Attacks on Execution Units in SMT Processors Autor Townley, Daniel, Ponomarev, Dmitry

ISSN: 2641-7936

Vydavateľské údaje: IEEE 01.09.2019

Vydané v Proceedings / International Conference on Parallel Architectures and Compilation Techniques (01.09.2019)
“…) architectures from the security standpoint. To address this problem, we propose SMT-COP-a system that eliminates all known side-channels through shared execution logic, including ports and functional units, on SMT processors…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
18

Načíta sa…

Filter Caching for Free: The Untapped Potential of the Store-Buffer Autor Alves, Ricardo, Ros, Alberto, Black-Schaffer, David, Kaxiras, Stefanos

ISSN: 2575-713X

Vydavateľské údaje: ACM 01.06.2019

Vydané v 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)
“…Modern processors contain store-buffers to allow stores to retire under a miss, thus hiding store-miss latency. The store-buffer needs to be large (for…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
19

Načíta sa…

FabScalar: composing synthesizable RTL designs of arbitrary cores within a canonical superscalar template Autor Choudhary, Niket K., Wadhavkar, Salil V., Shah, Tanmay A., Mayukh, Hiran, Gandhi, Jayneel, Dwiel, Brandon H., Navada, Sandeep, Najaf-abadi, Hashem H., Rotenberg, Eric

ISBN: 9781450304726, 1450304729

ISSN: 1063-6897

Vydavateľské údaje: New York, NY, USA ACM 04.06.2011

Vydané v 2011 38th Annual International Symposium on Computer Architecture (ISCA) (04.06.2011)
“…: superscalar width, pipeline depth, and sizes of structures for extracting instruction-level parallelism (ILP…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
20

Načíta sa…

Sparse-T: Hardware accelerator thread for unstructured sparse data processing Autor Vasireddy, Pranathi, Kavi, Krishna, Mehta, Gayatri

ISSN: 1558-2434

Vydavateľské údaje: ACM 29.10.2022

Vydané v 2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD) (29.10.2022)
“…Sparse matrix-dense vector (SpMV) multiplication is inherent in most scientific, neural networks and machine learning algorithms. To efficiently exploit…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:

Výsledky vyhľadávania - Computer systems organization Architectures Serial architectures Pipeline computing~

Lookup Table-based Multiplication-free All-digital DNN Accelerator Featuring Self-Synchronous Pipeline Accumulation Autor Tagata, Hiroto, Sato, Takashi, Awano, Hiromitsu

Bit-level Perceptron Prediction for Indirect Branches Autor Garza, Elba, Mirbagher-Ajorpaz, Samira, Khan, Tahsin Ahmad, Jimenez, Daniel A.

UpPipe: A Novel Pipeline Management on In-Memory Processors for RNA-seq Quantification Autor Chen, Liang-Chi, Ho, Chien-Chung, Chang, Yuan-Hao

MixPipe: Efficient Bidirectional Pipeline Parallelism for Training Large-Scale Models Autor Zhang, Weigang, Zhou, Biyu, Tang, Xuehai, Wang, Zhaoxing, Hu, Songlin

Buffer Prospector: Discovering and Exploiting Untapped Buffer Resources in Many-Core DNN Accelerators Autor Wei, Yuchen, Cai, Jingwei, Gao, Mingyu, Peng, Sen, Wu, Zuotong, Shi, Guiming, Ma, Kaisheng

Alternate Path μ-op Cache Prefetching Autor Singh, Sawan, Perais, Arthur, Jimborean, Alexandra, Ros, Alberto

Alternate Path Fetch Autor Deshmukh, Aniket, Cai, Lingzhe Chester, Patt, Yale N.

Spatz: A Compact Vector Processing Unit for High-Performance and Energy-Efficient Shared-L1 Clusters Autor Cavalcante, Matheus, Wuthrich, Domenic, Perotti, Matteo, Riedel, Samuel, Benini, Luca

UDP: Utility-Driven Fetch Directed Instruction Prefetching Autor Oh, Surim, Xu, Mingsheng, Khan, Tanvir Ahmed, Kasikci, Baris, Litz, Heiner

X-Layer: Building Composable Pipelined Dataflows for Low-Rank Convolutions Autor Vedula, Naveen, Hojabr, Reza, Khonsari, Ahmad, Shriraman, Arrvindh

AVM-BTB: Adaptive and Virtualized Multi-level Branch Target Buffer Autor Liu, Yunzhe, Li, Xinyu, Zhang, Tingting, Liu, Tianyi, Guo, Qi, Zhang, Fuxin, Wang, Jian

PipeLink: A Pipelined Resource Sharing System for Dataflow High-Level Synthesis Autor Li, Rui, Berkley, Lincoln, Manohar, Rajit

Load value prediction via path-based address prediction: avoiding mispredictions due to conflicting stores Autor Sheikh, Rami, Cain, Harold W., Damodaran, Raguram

Leaky MDU: ARM Memory Disambiguation Unit Uncovered and Vulnerabilities Exposed Autor Liu, Chang, Lyu, Yongqiang, Wang, Haixia, Qiu, Pengfei, Ju, Dapeng, Qu, Gang, Wang, Dongsheng

Pipelining a triggered processing element Autor Repetti, Thomas J., Cerqueira, João P., Kim, Martha A., Seok, Mingoo

SMT-COP: Defeating Side-Channel Attacks on Execution Units in SMT Processors Autor Townley, Daniel, Ponomarev, Dmitry

Filter Caching for Free: The Untapped Potential of the Store-Buffer Autor Alves, Ricardo, Ros, Alberto, Black-Schaffer, David, Kaxiras, Stefanos

FabScalar: composing synthesizable RTL designs of arbitrary cores within a canonical superscalar template Autor Choudhary, Niket K., Wadhavkar, Salil V., Shah, Tanmay A., Mayukh, Hiran, Gandhi, Jayneel, Dwiel, Brandon H., Navada, Sandeep, Najaf-abadi, Hashem H., Rotenberg, Eric

Sparse-T: Hardware accelerator thread for unstructured sparse data processing Autor Vasireddy, Pranathi, Kavi, Krishna, Mehta, Gayatri

Vyhľadávacie nástroje:

Upresniť hľadanie

Médium

Predmetová oblasť

Téma

Jazyk

Rok vydania