8-bit Transformer Inference and Fine-tuning for Edge Accelerators.

Jeffrey Yu, Kartik Prabhu, Yonatan Urman,Robert M. Radway, Eric Han,Priyanka Raina

International Conference on Architectural Support for Programming Languages and Operating Systems(2024)

引用 0|浏览6
暂无评分
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要