Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The many sources of stochastic/non-deterministic behavior have been mentioned in other replies but I wanted to point out this paper: https://arxiv.org/abs/2506.09501 which analyzes the issues around GPU non determinism (once sampling and batching related effects are removed).

One important take-away is that these issues are more likely in longer generations so reasoning models can suffer more.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: