Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wonder if their censorship means that the Chinese corpus has less spam than the rest of the internet? Would be interesting if that turns out to be a huge advantage for making AI.


Chinese censorship is both subtractive and additive (“flood the zone with shit”). There will be plenty of spam in the corpus.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: