Ask HN: Is HN used as AI dataset?

Someone, somewhere is scrapping, right?

3 points

jb_briant

18 hours ago


4 comments

dredmorbius 17 hours ago

I've turned up several of my own HN comments using FastGPT, from Kagi Labs.

Whether that's training data or live search results I'm not entirely sure, but HN definitely contributes to results in that case.

pvg 18 hours ago

There's are a couple of different APIs and full datasets are downloadable so the data is readily accessible without scraping anything.

zoezoezoezoe 16 hours ago

if it's on the internet, it's in a dataset somewhere at this point.