OpenAI's Secret Weapon: High-Accuracy AI Text Detection Tool Faces Release Delays Amid Internal Debates

August 4, 2024

Tech

Generative AI

OpenAI is actively researching a text watermarking method aimed at identifying AI-generated text, alongside other detection solutions like classifiers and metadata.
The company has developed a tool that can detect text generated by ChatGPT with 99.9% accuracy, but it remains unavailable to the public.
Internal debates at OpenAI have delayed the release of this detection tool for two years, despite its technical readiness for about a year.
OpenAI is taking a cautious approach to the release of the detection tool due to the complexities and potential broader impacts it may have.
Concerns about misuse and the potential stigmatization of non-English speakers have led OpenAI to withhold the tool from public access.
The proposed text watermarking method could disproportionately affect non-native English speakers, raising concerns about its impact on their use of AI writing tools.
OpenAI has identified potential methods that bad actors could use to bypass the detection system, contributing to the decision to withhold the tool.
While the watermarking method shows high accuracy against localized tampering, it is less effective against broader text alteration techniques.
The watermarking technology aims to address concerns about the authenticity of AI-generated content, embedding an imperceptible watermark for later identification.
In a similar effort, Google is beta testing its own watermarking tool, SynthID, to detect text generated by its Gemini AI.
The project has been a topic of internal debate at OpenAI for the past two years, reflecting the challenges in balancing transparency and user attraction.

Summary based on 6 sources

Get a daily email with more Tech stories

Sources

TechCrunch • Aug 4, 2024

OpenAI says it’s taking a ‘deliberate approach’ to releasing tools that can detect writing from ChatGPT | TechCrunch

PCMag • Aug 4, 2024

OpenAI Holds Key to Stopping ChatGPT Cheating, But Keeps It Private

Engadget

OpenAI confirms it’s looking into text watermarking for ChatGPT that could expose cheating students

Cointelegraph • Aug 4, 2024

OpenAI has a ‘highly accurate’ tool to detect AI content, but no release plans

OpenAI's Secret Weapon: High-Accuracy AI Text Detection Tool Faces Release Delays Amid Internal Debates

Get a daily email with more Tech stories

Sources

More Stories