2 Paper Code Loss-aware Weight Quantization of Deep Networks ... to 1 or 2 bits that can enable end-to-end speedups of up to 12X when optimized using TVM.
確定! 回上一頁