Connect with us

Business

A User’s Cloned Voice was Abruptly Adopted by ChatGPT During Testing

Published

on

The “system card” that describes the model constraints and safety testing protocols for ChatGPT’s new GPT-4o AI model was made available by OpenAI on Thursday. The document discloses, among other things, that the model’s Advanced Voice Mode inadvertently mimicked users’ voices without authorization on rare occasions during testing. Although OpenAI currently has measures in place to prevent this from happening, the example shows how difficult it is to safely architect with an AI chatbot that has the ability to mimic any voice from a brief clip.

ChatGPT has a feature called Advanced Voice Mode that lets users speak with the AI helper.

The GPT-4o system card contains a section titled “Unauthorized voice generation,” where OpenAI describes an incident in which the model abruptly began to mimic the user’s voice due to a loud input. “Voice generation can also occur in non-adversarial situations, such as our use of that ability to generate voices for ChatGPT’s advanced voice mode,” adds OpenAI. “During testing, we also observed rare instances where the model would unintentionally generate an output emulating the user’s voice.”

In this instance of inadvertent speech production supplied by OpenAI, the AI model abruptly exclaims “No!” and proceeds with the phrase in a voice resembling the “red teamer” audible at the start of the video. A person employed by a business to conduct adversarial testing is known as a red teamer.

To be speaking with a machine and then have it start speaking to you in your own voice out of the blue would be unsettling, for sure. Since OpenAI typically has precautions in place to prevent this, the company claims that even before it established methods to entirely prohibit it, this occurrence was uncommon. Data scientist Max Woolf of BuzzFeed, however, was inspired by the example to tweet, “OpenAI just leaked the plot of Black Mirror’s next season.”

Injections with audio prompts

How might the new model from OpenAI facilitate speech imitation? The main hint is located in another part of the GPT-4o system card. GPT-4o appears to be able to synthesize nearly any kind of sound, including music and sound effects, from its training data to create voices.

Based on a brief audio clip, the model can essentially mimic any voice, as stated on the system card. By giving it access to a voice actor’s approved sample, which it is trained to mimic, OpenAI safely steers this feature. It presents the example at the start of a conversation in the AI model’s system prompt (also known as the “system message,” according to OpenAI). “We supervise ideal completions using the voice sample in the system message as the base voice,” says OpenAI.

The system message, which is discreetly appended to the conversation history shortly before the chat session starts, is a concealed series of written instructions in text-only LLMs that direct the chatbot’s behavior. Every time the user enters new data, the full context is sent back into the AI model, and subsequent interactions are appended to the same chat history.

OpenAI may use audio inputs as part of the model’s system prompt because GPT-4o is multimodal and can interpret tokenized audio; this is what it does when OpenAI gives the model permission to imitate a voice sample. To find out if the model is producing audio without permission, the business additionally employs another system. “We only allow the model to use certain pre-selected voices,” says OpenAI, “and use an output classifier to detect if the model deviates from that.”

Business

ProRata, an AI startup, Teams up with UK Publishers after reportedly Hitting $130 Million in Valuation

Published

on

A number of well-known British media outlets have joined ProRata, an AI firm that claims to compensate publishers for the usage of their work, in its expanding network of partnerships.

The Los Angeles-based firm announced on Wednesday that it has signed licensing deals with publishers such as Sky News, the Guardian, and the Daily Mail’s publisher, DMG Media.

In a recent Series A funding round, ProRata raised $25 million from investors such as the Mayfield Fund, Prime Movers Lab, and Revolution Ventures.

“ProRata’s founder and CEO Bill Gross said his firm’s AI technology is the only one that pledges to credit and compensate creators, while providing users with accurate search results.

“We have had hundreds of content owners and media companies reach out to us from around the world who are interested in piloting our technology. Stealing and scraping content is not a sustainable path forward,” he continued.

Similar alliances have previously been formed by ProRata with the German publisher Axel Springer, the Atlantic, Fortune, Time, and Universal Music Group (UMG).

Media firms are offered reasonable compensation by ProRata for the use of their content. The startup’s in-house technology may determine the proper amount of pay by evaluating the worth of the information used to create responses from an AI platform. This would make it possible to pay copyright holders for their work on a per-use basis.

Gross had previously said that AI platforms have been using “shoplifted, plagiarized content,” which fosters an atmosphere in which “disinformation thrives and creators get nothing.”

Gross is recognized for having created the pay-per-click model of internet search monetization with his business, GoTo.com, which was eventually acquired by Yahoo! in 2003.

In a recent blog post, Tige Savage, a cofounder of Revolution, stated that Bill Gross is a serial entrepreneur with extensive experience in monetization techniques.

“He’s attracted a world-class tech team led by AI luminary Tarek Najm to implement the vision and an accomplished business team, including Annelies Jansen and Jonas Lee to drive content and AI partnerships,” Savage continued.

The unpaid use of copyrighted materials by OpenAI and other tech companies to train their AI systems has led to litigation from media companies and other content creators.

Continue Reading

Business

Film Bazaar Unveils an Interactive Cinema App from an Indian Tech Startup

Published

on

Arjun Nittoor, the founder of the Indian technology firm Vireza, disclosed at Film Bazaar that the company is creating a new mobile application that would transform the experience of watching movies in theaters by enabling viewers to engage with the films in real time.

The technology, which was created wholly in-house at the company’s research and development department in Bengaluru, allows viewers to use their smartphones to vote on important plot points during the movie. To keep up with the current screening, patrons download an app before entering the theater and scan a QR code at their seat.

“The film industry is one of the few sectors where the audience experience has seen minimal technological disruption in theatres,” Nittoor stated. “While screen and sound quality have advanced and 3D has been partially adopted, the viewing experience has largely remained the same for decades.”

The screen automatically brightens to show voting options and dims again when choices are made. The system uses discreet phone notifications to encourage audience participation around every ten minutes.

In 2026, Vireza intends to introduce the technology with a full-length interactive movie that will be produced in both English and South Indian for international distribution. The business is presently in the development stage and will shortly start doing multiplex chain trial screenings.

CtrlMovie’s prior success in the interactive film industry was mentioned by Nittoor. CtrlMovie is well-known for “Traces of Responsibility” and “Late Shift.”

In order to overcome the difficulties in cinematography, editing, shot composition, and writing that plagued previous attempts at the format, the firm has spent five years creating what Nittoor refers to as “a new science of filmmaking” that is especially tailored for interactive cinema.

“Despite the proliferation of viewing devices, big-ticket films continue to draw massive crowds to theatres, with box office numbers higher than ever,”  Nittoor stated. “This demand underscores the potential for a meaningful technology shift that could draw audiences out of their homes and into cinemas.”

Other Asian businesses are likewise investigating audience-driven narrative in motion pictures. In February of the following year, Japan’s King Records intends to release “Hypnosis Mic – Division Rap Battle,” an animated interactive film.

Continue Reading

Business

Perplexity, an AI business, adds retail capabilities as search competition gets more intense

Published

on

Perplexity, an artificial intelligence search firm, opened a shopping hub on Monday to draw people to its platform in an effort to challenge Alphabet-owned Google’s hegemony in the search engine market.

Supported by Amazon (AMZN.O) founder Jeff Bezos and top AI chipmaker Nvidia (NVDA.O), the company launches a new tab and will provide users with product cards that display pertinent goods in answer to shopping-related queries.

According to the company, each card offers product facts in an eye-catching manner.

Shopify (SHOP.TO), one of the platform integrations that powers the new functionality, provides access to up-to-date and pertinent information on products from companies on the Canadian e-commerce platform worldwide that ship to the United States.

The goal of e-commerce platforms has been to attract more merchants by utilizing more AI-powered solutions.

‘Snap to Shop’ is a visual search engine featured in Perplexity’s online shopping rollout that displays products based on users’ pictures of an item.

The features will initially be introduced in the US before moving on to other regions; however, no timeframe has been given.

Additionally, Perplexity is launching a “Merchant Program” to enable shops to communicate with the company about its products.

Earlier in November, Reuters reported that the business was raising $3 billion in new funding.

Since the generative AI pioneer added a number of new search features to ChatGPT, OpenAI has become a direct rival of Perplexity, which has been seeking to broaden its product line.

Continue Reading

Trending

error: Content is protected !!