Destide@feddit.uk to Programming@programming.devEnglish · 3 days agoOpen-R1: a fully open reproduction of DeepSeek-R1huggingface.coexternal-linkmessage-square9fedilinkarrow-up1106arrow-down15cross-posted to: technology@lemmy.worldtechnology@lemmy.zip
arrow-up1101arrow-down1external-linkOpen-R1: a fully open reproduction of DeepSeek-R1huggingface.coDestide@feddit.uk to Programming@programming.devEnglish · 3 days agomessage-square9fedilinkcross-posted to: technology@lemmy.worldtechnology@lemmy.zip
minus-squaremanicdave@feddit.uklinkfedilinkarrow-up4arrow-down1·2 days agoAll I want is a 3gb model for the raspberry pi. 7b is too big and 1.5b is too stupid.
minus-squareTomasEkeli@programming.devlinkfedilinkarrow-up4·2 days agohonestly both 7b and 8b are pretty dumb as well.
minus-squareMadhuGururajan@programming.devlinkfedilinkEnglisharrow-up1·22 minutes agowe could add so much deterministic code at 1.5GB that would start religions…
All I want is a 3gb model for the raspberry pi. 7b is too big and 1.5b is too stupid.
3B is probably also pretty dumb
honestly both 7b and 8b are pretty dumb as well.
we could add so much deterministic code at 1.5GB that would start religions…
True