Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They also mention they got a score above 0.8 for 1000 neurons out of GPT2 (which has 1.5B (?)).


1.5B parameters, only 300k neurons. The number of connections is roughly quadratic with the number of neurons.


I thought they had only applied the technique to 307,200 neurons. 1,000 / 307,200 = 0.33% is still low, but considering that not all neurons would be useful since they are initialized randomly, it's not too bad.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: