LlaMa was trained on 78 GB of StackExchange (I assume StackOverflow was included... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		dchest on March 22, 2023 \| parent \| context \| favorite \| on: Show HN: ChatLLaMA – A ChatGPT style chatbot for F... LlaMa was trained on 78 GB of StackExchange (I assume StackOverflow was included in that).

int_19h on March 23, 2023 [–]

But was it parsed and reformatted specifically in the "chat format" (i.e. the same as inputs later fed to the model when used as a chatbot)? It can make a surprisingly big difference.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact