ooli@lemmy.world to Technology@lemmy.worldEnglish · 9 months agoGPU's rival? What is Language Processing Unit (LPU)www.turingpost.comexternal-linkmessage-square15fedilinkarrow-up199arrow-down111
arrow-up188arrow-down1external-linkGPU's rival? What is Language Processing Unit (LPU)www.turingpost.comooli@lemmy.world to Technology@lemmy.worldEnglish · 9 months agomessage-square15fedilink
minus-squareFinadil@lemmy.worldlinkfedilinkEnglisharrow-up1·9 months agoThat with a fp16 model? Don’t be scared to try even a 4 bit quantization, you’d be surprised at how little is lost and how much quicker it is.
That with a fp16 model? Don’t be scared to try even a 4 bit quantization, you’d be surprised at how little is lost and how much quicker it is.