Hwang, R., Wei, J., Cao, S., Hwang, C., Tang, X., Cao, T., & Yang, M. (2024, June 29). Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference. 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA), 1018-1031. https://doi.org/10.1109/ISCA59077.2024.00078
Citace podle Chicago (17th ed.)Hwang, Ranggi, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, a Mao Yang. "Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference." 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) 29 Jun. 2024: 1018-1031. https://doi.org/10.1109/ISCA59077.2024.00078.
Citace podle MLA (9th ed.)Hwang, Ranggi, et al. "Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference." 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA), 29 Jun. 2024, pp. 1018-1031, https://doi.org/10.1109/ISCA59077.2024.00078.