❌

ModalitΓ  di lettura

Training large language models on narrow tasks can lead to broad misalignment

Nature, Published online: 14 January 2026; doi:10.1038/s41586-025-09937-5

Finetuning a large language model on a narrow task of writing insecure code causes a broad range of concerning behaviours unrelated to coding.
  •  

Training large language models on narrow tasks can lead to broad misalignment

Nature, Published online: 14 January 2026; doi:10.1038/s41586-025-09937-5

Finetuning a large language model on a narrow task of writing insecure code causes a broad range of concerning behaviours unrelated to coding.
  •