Wang, Z., Xu, P., Liu, F., Hu, Y., Sun, Q., Li, G., . . . Guan, H. (2025, June 22). MILLION: MasterIng Long-Context LLM Inference Via Outlier-Immunized KV Product QuaNtization. 2025 62nd ACM/IEEE Design Automation Conference (DAC), 1-7. https://doi.org/10.1109/DAC63849.2025.11132862
Citácia podle Chicago (17th ed.)Wang, Zongwu, et al. "MILLION: MasterIng Long-Context LLM Inference Via Outlier-Immunized KV Product QuaNtization." 2025 62nd ACM/IEEE Design Automation Conference (DAC) 22 Jun. 2025: 1-7. https://doi.org/10.1109/DAC63849.2025.11132862.
Citácia podľa MLA (8th ed.)Wang, Zongwu, et al. "MILLION: MasterIng Long-Context LLM Inference Via Outlier-Immunized KV Product QuaNtization." 2025 62nd ACM/IEEE Design Automation Conference (DAC), 22 Jun. 2025, pp. 1-7, https://doi.org/10.1109/DAC63849.2025.11132862.