Suchergebnisse - Computing methodologies Computer graphics Graphics systems AND interfaces Graphics processors~

  1. 1

    RT-NeRF: Real-Time On-Device Neural Radiance Fields Towards Immersive AR/VR Rendering von Li, Chaojian, Li, Sixu, Zhao, Yang, Zhu, Wenbo, Lin, Yingyan

    ISSN: 1558-2434
    Veröffentlicht: ACM 29.10.2022
    “… Neural Radiance Field (NeRF) based rendering has attracted growing attention thanks to its state-of-the-art (SOTA) rendering quality and wide applications in …”
    Volltext
    Tagungsbericht
  2. 2

    Mars: A MapReduce Framework on graphics processors von He, Bingsheng, Fang, Wenbin, Luo, Qiong, Govindaraju, Naga K., Wang, Tuyong

    Veröffentlicht: ACM 01.10.2008
    “… We design and implement Mars, a MapReduce framework, on graphics processors (GPUs). MapReduce is a distributed programming framework originally proposed …”
    Volltext
    Tagungsbericht
  3. 3

    GauRast: Enhancing GPU Triangle Rasterizers to Accelerate 3D Gaussian Splatting von Li, Sixu, Keller, Ben, Lin, Yingyan Celine, Khailany, Brucek

    Veröffentlicht: IEEE 22.06.2025
    “… This work proposes an acceleration strategy that leverages the similarities between the 3DGS pipeline and the highly optimized conventional graphics pipeline in modern GPUs …”
    Volltext
    Tagungsbericht
  4. 4

    Vulkan-Sim: A GPU Architecture Simulator for Ray Tracing von Saed, Mohammadreza, Chou, Yuan Hsi, Liu, Lufei, Nowicki, Tyler, Aamodt, Tor M.

    Veröffentlicht: IEEE 01.10.2022
    “… have started to make use of ray tracing APIs to bring more realistic graphics to their players …”
    Volltext
    Tagungsbericht
  5. 5

    Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs von Wang, Pengyu, Li, Chao, Wang, Jing, Wang, Taolei, Zhang, Lu, Leng, Jingwen, Chen, Quan, Guo, Minyi

    Veröffentlicht: IEEE 01.09.2021
    “… Graph sampling and random walk operations, capturing the structural properties of graphs, are playing an important role today as we cannot directly adopt computing-intensive algorithms on large-scale graphs …”
    Volltext
    Tagungsbericht
  6. 6

    Local-GS: An Order-Independent Gaussian Splatting Training Accelerator Exploiting Splat Locality von Sun, Yiyang, Zhi, Qinzhe, Jing, Yiqi, Ye, Le, Huang, Ru, Jia, Tianyu

    Veröffentlicht: IEEE 22.06.2025
    “… 3D Gaussian Splatting has emerged as the SOTA approach for 3D representation and view synthesis. While Gaussian Splatting has demonstrated impressive …”
    Volltext
    Tagungsbericht
  7. 7

    SynGPU: Synergizing CUDA and Bit-Serial Tensor Cores for Vision Transformer Acceleration on GPU von Yao, Yuanzheng, Zhang, Chen, Qi, Chunyu, Chen, Ruiyang, Wang, Jun, Fu, Zhihui, Jing, Naifeng, Liang, Xiaoyao, Song, Zhuoran

    Veröffentlicht: IEEE 22.06.2025
    “… Vision Transformers (ViTs) have demonstrated remarkable performance in computer vision tasks by effectively extracting global features …”
    Volltext
    Tagungsbericht
  8. 8

    Fine-grained DRAM: energy-efficient DRAM for extreme bandwidth systems von O'Connor, Mike, Chatterjee, Niladrish, Lee, Donghyuk, Wilson, John, Agrawal, Aditya, Keckler, Stephen W., Dally, William J.

    ISBN: 1450349528, 9781450349529
    ISSN: 2379-3155
    Veröffentlicht: New York, NY, USA ACM 14.10.2017
    “… Future GPUs and other high-performance throughput processors will require multiple TB/s of bandwidth to DRAM …”
    Volltext
    Tagungsbericht
  9. 9

    SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity von Fan, Zichen, Dai, Steve, Venkatesan, Rangharajan, Sylvester, Dennis, Khailany, Brucek

    Veröffentlicht: IEEE 22.06.2025
    “… Diffusion models have gained significant popularity in image generation tasks. However, generating high-quality content remains notably slow because it …”
    Volltext
    Tagungsbericht
  10. 10

    GSAcc: Accelerate 3D Gaussian Splatting via Depth Speculation and Gaussian-centric Rasterization von Yang, Mengtian, Wang, Yipeng, Lo, Chieh-Pu, Zhang, Xiuhao, Oruganti, Sirish, Kulkarni, Jaydeep P.

    Veröffentlicht: IEEE 22.06.2025
    “… D Gaussian Splatting (3DGS) has emerged as a promising real-time photorealistic radiance field rendering technique. Existing GPU and hardware accelerators face …”
    Volltext
    Tagungsbericht
  11. 11

    MHDiff: Memory- and Hardware-Efficient Diffusion Acceleration via Focal Pixel Aware Quantization von Qi, Chunyu, Wang, Xuhang, Chen, Ruiyang, Yao, Yuanzheng, Jing, Naifeng, Zhang, Chen, Wang, Jun, Fu, Zhihui, Liang, Xiaoyao, Song, Zhuoran

    Veröffentlicht: IEEE 22.06.2025
    “… Diffusion models have demonstrated superior performance in image generation tasks, thus becoming the mainstream model for generative visual tasks. Diffusion …”
    Volltext
    Tagungsbericht
  12. 12

    Cambricon-D: Full-Network Differential Acceleration for Diffusion Models von Kong, Weihao, Hao, Yifan, Guo, Qi, Zhao, Yongwei, Song, Xinkai, Li, Xiaqing, Zou, Mo, Du, Zidong, Zhang, Rui, Liu, Chang, Wen, Yuanbo, Jin, Pengwei, Hu, Xing, Li, Wei, Xu, Zhiwei, Chen, Tianshi

    Veröffentlicht: IEEE 29.06.2024
    “… computational redundancy and substantial hardware expenditures.Performing differential computing on input data seems to be a feasible approach for addressing such computational redundancy and improving hardware efficacy …”
    Volltext
    Tagungsbericht
  13. 13

    StocHD: Stochastic Hyperdimensional System for Efficient and Robust Learning from Raw Data von Poduval, Prathyush, Zou, Zhuowen, Najafi, Hassan, Homayoun, Houman, Imani, Mohsen

    Veröffentlicht: IEEE 05.12.2021
    “… Hyperdimensional Computing (HDC) is a neurally-inspired computation model working based on the observation that the human brain operates on high-dimensional representations of data, called hypervector …”
    Volltext
    Tagungsbericht
  14. 14

    DARIS: An Oversubscribed Spatio-Temporal Scheduler for Real-Time DNN Inference on GPUs von Babaei, Amir Fakhim, Chantem, Thidapat

    Veröffentlicht: IEEE 22.06.2025
    “… In particular, DARIS improves GPU utilization and uniquely analyzes GPU concurrency by oversubscribing computing resources …”
    Volltext
    Tagungsbericht
  15. 15

    Boustrophedonic Frames: Quasi-Optimal L2 Caching for Textures in GPUs von Joseph, Diya, Aragon, Juan L., Parcerisa, Joan-Manuel, Gonzalez, Antonio

    Veröffentlicht: IEEE 21.10.2023
    “… In this paper, however, we surpass this exploration by fabricating a formal proof for a no-overhead quasi-optimal caching technique for caching textures in graphics workloads …”
    Volltext
    Tagungsbericht
  16. 16

    GS-TG: 3D Gaussian Splatting Accelerator with Tile Grouping for Reducing Redundant Sorting while Preserving Rasterization Efficiency von Jo, Joongho, Park, Jongsun

    Veröffentlicht: IEEE 22.06.2025
    “… 3D Gaussian Splatting (3D-GS) has emerged as a promising alternative to neural radiance fields (NeRF) as it offers high speed as well as high image quality in …”
    Volltext
    Tagungsbericht
  17. 17

    SFLU: Synchronization-Free Sparse LU Factorization for Fast Circuit Simulation on GPUs von Zhao, Jianqi, Wen, Yao, Luo, Yuchen, Jin, Zhou, Liu, Weifeng, Zhou, Zhenya

    Veröffentlicht: IEEE 05.12.2021
    “… Sparse LU factorization is one of the key building blocks of sparse direct solvers and often dominates the computing time of circuit simulation programs …”
    Volltext
    Tagungsbericht
  18. 18

    PARO: Hardware-Software Co-design with Pattern-aware Reorder-based Attention Quantization in Video Generation Models von Yang, Xinhao, Zhao, Tianchen, Wang, Hongyi, Ma, Wenheng, Zeng, Shulin, Zhu, Zhenhua, Ning, Xuefei, Yang, Huazhong, Wang, Yu

    Veröffentlicht: IEEE 22.06.2025
    “… Transformer-based video generation models have demonstrated significant potential in content creation. However, the current state-of-the-art model employing " …”
    Volltext
    Tagungsbericht
  19. 19

    BEVSA: A Real-Time Bird's-Eye-View Semantic Segmentation Accelerator for Multi-Camera System von Lee, Sangho, Jung, Jueung, Jang, Wuyoung, Hwang, Jihyeon, Lee, Kyuho

    Veröffentlicht: IEEE 22.06.2025
    “… A bird's-eye-view (BEV) semantic segmentation accelerator (BEVSA) is proposed for real-time 3D space perception in multi-camera system (MCS …”
    Volltext
    Tagungsbericht
  20. 20

    Harnessing Conventional Video Processing Insights for Emerging 3D Video Generation Models: A Comprehensive Attention-aware Way von Zhao, Tianlang, Liu, Jun, Li, Xingyang, Ding, Li, Li, Jinhao, Li, Shuaiheng, Hu, Jinbo, Dai, Guohao

    Veröffentlicht: IEEE 22.06.2025
    “… Video Generation Models based on 3D full attention (3D-VGMs) have significantly enhanced video quality. However, their inference overhead remains substantial, …”
    Volltext
    Tagungsbericht