Zhong, S., Sun, Y., Liang, L., Wang, R., Huang, R., & Li, M. (2025, June 22). HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference. 2025 62nd ACM/IEEE Design Automation Conference (DAC), 1-7. https://doi.org/10.1109/DAC63849.2025.11133274
Chicago-Zitierstil (17. Ausg.)Zhong, Shuzhang, Yanfan Sun, Ling Liang, Runsheng Wang, Ru Huang, und Meng Li. "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference." 2025 62nd ACM/IEEE Design Automation Conference (DAC) 22 Jun. 2025: 1-7. https://doi.org/10.1109/DAC63849.2025.11133274.
MLA-Zitierstil (9. Ausg.)Zhong, Shuzhang, et al. "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference." 2025 62nd ACM/IEEE Design Automation Conference (DAC), 22 Jun. 2025, pp. 1-7, https://doi.org/10.1109/DAC63849.2025.11133274.