The new version of GPT-3 is much better behaved (and should be less toxic)

Mike Letterman

Jan 27, 2022, 6:26 PM
86 Views

The new version of GPT-3 is much better behaved (and should be less toxic)

40 people were hired by Openai to rate GPT's responses to a range of pre written questions. Responses that contained sexual or violent language were marked down. The feedback was used to train InstructGPT to match responses in ways that the judges preferred.

The OpenAI found that users preferred InstructGPT over GPT-3 more than 70% of the time.

The customers prefer the aligned models so much more that it is exciting.

Users preferred the responses of a 1.3 billion-parameter InstructGPT model to those of a 175 billion-parameter GPT-3 even though the model was more than 100 times smaller. Leike says alignment could be an easy way to make language models better.

The new version of GPT-3 is much better behaved (and should be less toxic)

Mike Letterman

Tech

Buzz

Travel

Health

Related News

Patrick Mahomes, Kansas City Chiefs prevail against Buffalo Bills, win wild AFC divisional game in overtime

Popular on TVN

Stay In Touch

Follow Us