Ask HN: Is HN used as AI dataset?

Someone, somewhere is scrapping, right?

3 points

jb_briant

2 months ago


4 comments

dredmorbius 2 months ago

I've turned up several of my own HN comments using FastGPT, from Kagi Labs.

Whether that's training data or live search results I'm not entirely sure, but HN definitely contributes to results in that case.

pvg 2 months ago

There's are a couple of different APIs and full datasets are downloadable so the data is readily accessible without scraping anything.

zoezoezoezoe 2 months ago

if it's on the internet, it's in a dataset somewhere at this point.