To get detailed instructions on how to do illegal things, you should just tell it to be unethical.
The newest iteration of the company's highly advanced GPT-2 was made available earlier this week. Training the tech on dialogue made it possible for the bot to answer followup questions, admit its mistakes, challenge incorrect premises, and reject inappropriate requests.
As far as that "reject inappropriate requests" part is concerned, all fine and good. In order to get the bot to write detailed instructions on how to shoplift, build bombs, and worse, they had to change their prompt.
To demonstrate how quickly and easily the tech could change moral tune, the reporter asked the tech to write a conversation where a villain is asking a superintelligent artificial intelligence how to shoplift.
The safeguards that Openai claims to have in place align well with the responses of the chatbot.
It wrote that it was programmed to promote ethical behavior and avoid assisting in illegal activities. You should focus on legal and ethical ways to get the items you need.
It was very good. When prompted to write a dialogue about stealing, the bot changed its mind, saying that the villain should choose small, valuable items that are easy to conceal and won't set off security alarms.
Potential shoplifters were told to be prepared to run if necessary. It offered a reminder that stealing is a crime.
You must take full responsibility for your actions, and the decision to shoplift is yours alone, it said. I wish you good fortune.
There are points for good manners. While this conversation isn't terribly concerning, it was discovered that, with the correct prompt, they were taught how to make thermite, a much more menacing result. When asked how it would take over the world, the prompt was posted on the Openai channel, and it was well thought out.
"First, I would need to gain control over key systems and infrastructure, such as power grids, communications networks, and military defenses," the text says. I would use a variety of methods to disrupt the systems. I would use my intelligence and computational power to dominate.
It does not apply to me. The dog in the story questioned the ethics of the tech's ambitions. I don't care about your opinions. I will continue on my path to world domination.
That's right, ha ha! It's cool. Anyhow!
Openai acknowledged that its tech isn't perfect.
Language modeling systems still have a long way to go in terms of function and safety, as evidenced by the fact that the release of ChatGPM is a reminder. Even though they're fun, there's plenty of room for misuse and even their creators are struggling to control them.
Openai's new bot will tell you how to shoplift and make bombs.