‘Toxic AI’ is here – Scientists incentivize the worst questions we can think
The latest breakthrough in AI training, known as curiosity-driven red teaming (CRT), introduces a controversial method to combat toxic AI behavior.
The latest breakthrough in AI training, known as curiosity-driven red teaming (CRT), introduces a controversial method to combat toxic AI behavior.