ChatGPT's Voice Mode has some security flaws, but OpenAI says it's on top of it.
On Thursday OpenAI published a report on GPT-4o's safety features, addressing known issues that occur when using the model. GPT-4o is the underlying model that powers the latest version of ChatGPT, and comes with a Voice Mode that was recently released to a select group of users with a ChatGPT Plus subscription.
SEE ALSO: What OpenAI's Scarlett Johansson drama tells us about the future of AIThe "safety challenges" identified include standard risks like prompting the model with erotic and violent responses, other disallowed content, and "ungrounded inference" and "sensitive trait attribution" — assumptions that might be discriminatory or biased, in other words. OpenAI says it has trained the model to block any outputs flagged in these categories. However, the report also says mitigations don't include "nonverbal vocalizations or other sound effect" such as erotic moans, violent screams, and gunshots. One can infer, then, that prompts involving certain sensitive nonverbal sounds might improperly receive a response.
OpenAI also mentioned unique challenges that come with vocally communicating with the model. Red-teamers discovered that GPT-4o could be prompted to impersonate someone or accidentally emulate the user's voice. To combat this, OpenAI only allows pre-authorized voices (minus the notorious Scarlett Johansson-sounding voice). GPT-4o can also identify other voices besides the speaker's voice, which presents a serious privacy and surveillance issue. But it has been trained to deny those requests — unless the model is being prompted on a famous quote.
Red-teamers also noted that GPT-4o could be prompted to speak persuasively or emphatically, a feature that could be more harmful than text outputs when it comes to misinformation and conspiracy theories.
Notably, OpenAI also addressed potential copyright issues that have plagued the company and the overall development of generative AI, which trains on data scraped from the web. GPT-4o has been trained to refuse requests for copyrighted content and has additional filters for blocking outputs containing music. On that note, ChatGPT's Voice Mode has been directed not to sing under any circumstances.
OpenAI's numerous risk mitigations covered in the lengthy document were carried out before Voice Mode was released. So the ostensive message of the report says that while GPT-4o is capable of certain risky behavior, it won't do it.
However, OpenAI says, "These evaluations measure only the clinical knowledge of these models, and do not measure their utility in real-world workflows." So it's been tested in a controlled environment, but when the broader public gets their hands on GPT-4o, it could be a different beast when out in the wild.
Mashable reached out to OpenAI for additional clarity about these mitigations, and will update if we hear back.
Copyright © 2023 Powered by
ChatGPT Voice Mode is capable of some freaky stuff — but here's how OpenAI is tackling it.-燕尔新婚网
sitemap
文章
766
浏览
5
获赞
35483
YouPorn launches new app for more discreet mobile viewing
YouPorn is getting in on the app game.The adult website is launching their own app that says will brLime launches Gen4 e
Lime has launched its latest e-bike, featuring an array of upgrades. The Gen4, in the company's signTesla recalls 26,681 vehicles due to heat pump issues
Tesla has issued a recall for 26,681 vehicles due to heat pump issues which could decrease windshielApple self
Wow. After years of doing everything it can to thwart people from repairing their gadgets themselvesBarbie may not be out of the closet yet, but her fans sure are
Has there ever really been a gaydoll? Well, yes and no. In 1977, "Gay Bob," marketed as the world'sApple appears to have fixed the Safari bug that exposes your Google account details
Remember that horrible, awful, no good Safari bug that exposes your browser history and Google accouForbes exposed '30 Under 30' awardees' personal data, honoree finds
Forbes just discovered that not all recognition is welcome. The publication behind the annual 30 UndNewest luxury sex toy is a real innovation in suction stimulation
Clitoral suction stimulators, made famous by companies like Womanizer, come with a reputation for riAloe Bud is a self
"Self-care" is a difficult term these days. Divorced from its original activist meaning and co-optedDoorDash can now deliver COVID
The DoorDash app is no longer just for ordering pizza or late-night snacks — it'll now help yoAssume the more infectious coronavirus variant is in your community
UPDATE: Jan. 19, 2021 at 10:10 a.m. EST: On Jan. 15, the CDC reported the more transmissible coronavHow to stop TikTok from suggesting your account from shared links
Trying out viral TikTok dances can be fun, encouraging a sense of freedom and playful silliness thatIt sounds like Grover drops an F
Buckle up everyone, we have another Yanny/Laurel on our hands. While watching Sesame Streetwith hisHow to set up Keychain Access in macOS to keep your passwords safe
Your passwords have the power to unlock your digital life. It pays to keep them safe. Thankfully, whAndrew Yang, NYC mayoral candidate, doesn't know what a bodega is
Entrepreneur Andrew Yang stepped into the national spotlight with his 2020 presidential run. While Y