Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I linked results where the user ran Kimi k2 across his 8-node cluster. Inference results are listed for 1,10,100 concurrent requests.

Edit to add:

Yeah, those stations with the GB300 look more along the lines of what I would want as well but I agree, they’re probably way beyond my reach.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: