A New Trick Could Block the Misuse of Open Source AI
When Meta released its large language model Llama 3 for free this April, it took outside developers just a couple days to create a version without the safety restrictions that prevent it from spouting hateful jokes, offering instructions for cooking meth, or misbehaving in other ways. A new training technique developed by researchers at the … Read more