Ask HN: Is HN used as AI dataset?

Someone, somewhere is scrapping, right?

3 points

jb_briant

a year ago


4 comments

dredmorbius a year ago

I've turned up several of my own HN comments using FastGPT, from Kagi Labs.

Whether that's training data or live search results I'm not entirely sure, but HN definitely contributes to results in that case.

pvg a year ago

There's are a couple of different APIs and full datasets are downloadable so the data is readily accessible without scraping anything.

zoezoezoezoe a year ago

if it's on the internet, it's in a dataset somewhere at this point.