Training large language models on narrow tasks can lead to broad misalignment

nature.com

5 points

thebeardisred

9 hours ago


1 comment