Qin, Y., Wang, Y., Zhao, Z., Yang, X., Zhou, Y., Wei, S., . . . Yin, S. (2024, June 29). MECLA: Memory-Compute-Efficient LLM Accelerator with Scaling Sub-matrix Partition. 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA), 1032-1047. https://doi.org/10.1109/ISCA59077.2024.00079
Citace podle Chicago (17th ed.)Qin, Yubin, Yang Wang, Zhiren Zhao, Xiaolong Yang, Yang Zhou, Shaojun Wei, Yang Hu, a Shouyi Yin. "MECLA: Memory-Compute-Efficient LLM Accelerator with Scaling Sub-matrix Partition." 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) 29 Jun. 2024: 1032-1047. https://doi.org/10.1109/ISCA59077.2024.00079.
Citace podle MLA (9th ed.)Qin, Yubin, et al. "MECLA: Memory-Compute-Efficient LLM Accelerator with Scaling Sub-matrix Partition." 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA), 29 Jun. 2024, pp. 1032-1047, https://doi.org/10.1109/ISCA59077.2024.00079.