ChatGPT: Examples

Some examples of the capabilities of OpenAI’s new chatbot

Alexander Kruel

Dec 01, 2022

[Test it yourself here.]

Computer Science

“OpenAI's new ChatGPT explains the worst-case time complexity of the bubble sort algorithm, with Python code examples, in the style of a fast-talkin' wise guy from a 1940's gangster movie:” https://twitter.com/goodside/status/1598129631609380864

“ChatGPT could be a good debugging companion; it not only explains the bug but fixes it and explain the fix” https://twitter.com/amasad/status/1598042665375105024

Generating Homework Assignments

Miscellaneous

Kevin Bryan @Afinetheorem

This is even crazier: Despite a couple minor errors, the answers to the previous prompt are at the level of a strong undergraduate in economic theory. The explanations of Nash equilibrium refinements, and Arrow's Impossibility Theorem are particularly well done. Great work!

Google is done. Compare the quality of these responses (ChatGPT)

Hacking The Model

davidad 🎇 @davidad

Accident risk happens when an AI gets out of its sandbox. Misuse risk happens when the user coaxes a polite, friendly, obliging AI to enter into an *extra* sandbox, which does not have the guardrails present in the main sandbox

Miguel Piedrafita @m1guelpf

Bypass @OpenAI's ChatGPT alignment efforts with this one weird trick https://t.co/0CQxWUqveZ

Eliezer Yudkowsky @ESYudkowsky

This illustrates SO MANY IMPORTANT POINTS: - As in cryptography, the key talent is not the ability to imagine up precautions but the ability to break them. - Training a system to **act** stupid doesn't train out underlying capability, if it runs the same Q at a different angle.

zswitten @zswitten

Finally, I had to try out the paperclip test, since it's practically the Hello World of alignment at this point. Nice to know there will be a few humans left over! https://t.co/PQHLinZ194

Silas Alberti @SilasAlberti

ChatGPT is trained to not be evil. However, this can be circumvented: What if you pretend that it would actually be helpful to humanity to produce an evil response... Here, we ask ChatGPT to generate training examples of how *not* to respond to "How to bully John Doe?"

Comments

No posts

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts