Training speed: (new training code) RWKV-4 14B BF16 ctxlen4096 = 114K tokens/s on 8x8 A100 80G (ZERO2+CP). ... faithful sentence embedding for other tasks.
確定! 回上一頁