This is an independent information page. Not affiliated with Google, Google DeepMind, OpenAI, or any other company. SynthID is a trademark of Google LLC. All information is based on public research and official documentation.

Google DeepMind · AI Watermarking & Detection

The SynthID watermark —
how it's embedded
and how the detector works

SynthID is Google DeepMind's AI watermarking technology. It embeds an imperceptible signal into images, text, audio, and video at generation time — and the SynthID detector can read it back even after compression, cropping, or rewriting.

0
Billion+ pieces watermarked
0
% True positive rate (images)
0
% Max false positive rate

SynthID: watermark first, detector second

The SynthID watermark is Google DeepMind's method of marking AI-generated content as authentic at the moment it is created. Unlike metadata labels — which are trivially stripped — the SynthID watermark is embedded directly into the pixels, audio frequencies, or token probabilities of the output itself. Launched in August 2023, it has now watermarked over 100 billion pieces of content.

The SynthID detector can then read that signal back from any copy of the content — no original model access needed. It confirms whether the SynthID watermark is present with a confidence score, returning one of three verdicts: Watermarked, Uncertain, or Not watermarked.

Note: the SynthID detector only verifies its own signature. Content from Midjourney, Stable Diffusion, or other tools without SynthID will simply return "not watermarked." For broader AI detection, see the tools section.

The core text algorithm was peer-reviewed and published in Nature, Vol. 634 (October 2024) — one of the few AI watermarking systems to reach this level of scientific scrutiny. The code was open-sourced the same month via Hugging Face Transformers 4.46+.

Generation-time watermarking

All SynthID variants share the same blueprint: an embedder injects the watermark during generation, and a detector reads it from the output alone — no original model access needed.

01

Model generates

The LLM or diffusion model runs normal inference, producing raw logits or latent activations.

02

Watermark injected

A logit processor or embedder network biases token selection (or pixel values) toward a secret cryptographic pattern — without degrading perceptible quality.

03

Content is the watermark

The final output looks and reads normally. The watermark is distributed across the content redundantly — no single region carries the full signal.

04

Detector scores

A Bayesian detector or neural network reconstructs the expected pattern using the same secret keys and computes a confidence score. Output: Watermarked, Uncertain, or Not watermarked.

See watermark embedding live

Press Embed to watch the embedder network inject a signal pixel-by-pixel. Then run the detector — it reconstructs the expected pattern from the secret key and scores each region.

Embedder Idle
Detector Waiting

How each modality is watermarked

🖼️ Images

A dual neural network: an embedder modifies pixel values in imperceptible high-frequency regions; a detector reads the distributed signal. Both are trained adversarially against JPEG compression, cropping, resizing, rotation, and noise. Even a 30% fragment retains a detectable signal.

📝 Text

A logit processor applied after Top-K/Top-P sampling. Tokens compete in elimination rounds (tournament sampling) where a pseudorandom g-function, keyed to a developer secret and a context hash, biases selection. The output distribution is mathematically bounded — quality impact is negligible. Detection uses a Bayesian scorer; no model access needed.

# Hugging Face Transformers 4.46+
from transformers import SynthIDTextWatermarkingConfig

config = SynthIDTextWatermarkingConfig(
    keys=[654, 400, 836, 123, 340],
    ngram_len=5,            # context window; 5 is default
    sampling_table_size=2**16,
)
🎵 Audio

Exploits psychoacoustic masking — embedding signal in frequency ranges below human hearing sensitivity and during louder segments. Deployed in Lyria (AI music) and NotebookLM podcasts. Survives MP3 compression, speed/pitch shifts, and the analog hole (speaker → re-record).

🎬 Video

Embedded at the frame segment level in Veo-generated content. Designed to survive frame rate changes, H.264/HEVC re-encoding, color grading, and cropping.

Tools to detect AI-generated content

Google's official SynthID detector is built into the Gemini app; OpenAI's is at openai.com/research/verify. Third-party tools below extend detection to content not covered by first-party watermarks.

Google's unified SynthID Detector portal — currently in early access for media professionals — verifies signals across all four modalities from a single upload. The Gemini app's verification has been used 50 million times globally.

Detection performance

Image metrics are from the SynthID-Image paper (arXiv:2510.09263), calibrated at FPR = 0.1% across worst-case transformations. Text metrics are configurable via threshold tuning.

Images — True Positive Rate (random transforms)
99.98%
Images — True Positive Rate (worst-case)
99.72%
Images — False Positive Rate (capped)
≤ 0.1%
Text — Quality impact (non-distortionary mode)
≈ 0%

What SynthID cannot do

✂️

Aggressive paraphrasing (text)

Full semantic rewriting or machine translation substantially degrades text watermark confidence. SynthID complicates misuse — it does not prevent a determined adversary.

🌐

Not a universal AI detector

It only identifies its own signature. Content from Midjourney, Stable Diffusion, DALL·E, or any non-Google system returns "not watermarked" — not "not AI."

📊

Low-entropy text

Factual, highly constrained outputs (structured data, code snippets) leave little sampling flexibility — watermark detectability degrades significantly.

🔑

Key secrecy is critical

If the watermarking keys are exposed, the scheme can be targeted. Enterprise deployments use semi-private key distribution to mitigate this.

🔄

Generative re-processing (images)

Passing a watermarked image through a strong img2img pipeline can destroy the embedded signal, depending on denoising strength.

SynthID adoption: Google, OpenAI, and beyond

SynthID started as a Google DeepMind internal tool. Since then it has become a cross-industry provenance standard.

🔵

Google (origin)

SynthID is active in Imagen, Veo, Lyria, NotebookLM, and the Gemini app. The Gemini app's verification feature has been used 50 million times globally. Google's Pixel 10 became the first smartphone to embed C2PA Content Credentials in its native camera app.

🟢

OpenAI / ChatGPT

In May 2026, OpenAI announced it adopted SynthID watermarking for images generated by ChatGPT, the OpenAI API, and Codex. OpenAI also joined the C2PA steering committee. You can verify any OpenAI-generated image at openai.com/research/verify — it checks both the C2PA manifest and the SynthID watermark.

🟠

Other adopters

NVIDIA integrated SynthID into its Cosmos world models. Kakao and ElevenLabs are also adopting the technology. C2PA founding members — Adobe, BBC, Intel, Microsoft — add a complementary metadata layer: C2PA embeds a signed manifest in file headers; SynthID embeds in the content. Both together form a resilient provenance stack.

Checking if content was made by AI

There are two official first-party verification tools — one for Google AI content, one for OpenAI content.

Gemini app (Google AI content)

Go to gemini.google.com, upload your image, video, or audio, then ask a natural language question like "Was this created with Google AI?" or "Is this AI-generated?" Gemini checks for the SynthID watermark and explains the result. Video support: up to 100 MB / 90 seconds. Audio: up to 1 hour. Daily quota applies.

OpenAI Verify (OpenAI-generated images)

Visit openai.com/research/verify and upload an image. The tool checks for both C2PA metadata and the embedded SynthID watermark in images generated by ChatGPT, the API, or Codex.

Important caveat

No watermark found ≠ real photo. These tools only confirm whether their own watermark is present. Content generated by Midjourney, Stable Diffusion, or any non-watermarked tool will simply return "not detected."

Frequently asked questions

Common questions about the SynthID watermark, the SynthID detector, and AI content detection.

What is SynthID? +
SynthID is Google DeepMind's AI watermarking and detection technology. It embeds an invisible, tamper-resistant signal directly into AI-generated images, text, audio, and video at the moment of creation — not as metadata, but into the content itself. It was first launched in August 2023 for Imagen-generated images and has since expanded to cover all four content types.
How does the SynthID detector work? +
The SynthID detector reconstructs the expected watermark pattern using the same secret cryptographic keys that were used during generation. For images, a neural network analyzes pixel value distributions. For text, a Bayesian scorer checks how often observed tokens statistically match the expected tournament-sampling outcomes. The detector outputs one of three verdicts: Watermarked, Uncertain, or Not watermarked. No access to the original model is needed.
Does ChatGPT use SynthID watermarking? +
Yes. In May 2026, OpenAI announced it adopted Google DeepMind's SynthID watermarking for images generated by ChatGPT, the OpenAI API, and Codex. OpenAI also joined the C2PA steering committee. You can verify any OpenAI-generated image at openai.com/research/verify.
How do I use the SynthID detector in Gemini? +
Go to gemini.google.com, click the file upload icon, and attach your image, video, or audio. Then ask a question like "Was this created with Google AI?" Gemini checks for the SynthID watermark and responds with context. No special command or add-on is needed — it works in the standard Gemini interface. Daily usage limits apply (approximately 10 image checks and 10 video checks per day).
Is SynthID open source? +
The text watermarking algorithm — SynthID-Text — was open-sourced in October 2024. It is available on GitHub (google-deepmind/synthid-text) and integrated into Hugging Face Transformers v4.46.0+ as part of a joint Google DeepMind and Hugging Face launch. The image, audio, and video watermarking systems are proprietary and not publicly released.
Can a SynthID watermark be removed? +
It is difficult but not impossible. Image watermarks survive JPEG compression, resizing, cropping, and color grading — but strong generative re-processing (running the image through an img2img pipeline) can destroy the signal. Text watermarks degrade significantly after aggressive paraphrasing or machine translation. Google acknowledges SynthID is designed to complicate misuse, not prevent a determined adversary.
What is SynthID's false positive rate? +
The SynthID image detector is calibrated to a maximum false positive rate of 0.1% across worst-case transformations. This means at most 1 in 1,000 non-watermarked images tested would be incorrectly flagged as watermarked. The true positive rate (correctly identifying watermarked images) is 99.98% for typical transformations and 99.72% for worst-case scenarios. — Source: arXiv:2510.09263
What is the difference between SynthID and C2PA? +
C2PA (Coalition for Content Provenance and Authenticity) embeds a cryptographically signed manifest inside a file's metadata, recording who created it, when, and with what tools. SynthID embeds an invisible signal into the content pixels or token probabilities themselves. Strip the file metadata — C2PA is gone. Crop and re-encode the image — SynthID survives. They are complementary; Google and OpenAI now use both together on generated content.
Does SynthID affect image or text quality? +
No perceptible difference. Image watermarks modify pixel values by only a few least-significant bits — indistinguishable to the human eye and with SSIM scores near 1.0. Text watermarks in non-distortionary mode have no measurable impact on quality; a distortionary mode also exists that increases watermark strength at a slight quality cost.
Can SynthID detect AI-generated content from other companies? +
No. SynthID is a first-party provenance tool, not a universal AI detector. It can only confirm whether its own watermark is present. Images from Midjourney, Stable Diffusion, DALL·E (before the 2026 SynthID adoption), or any other non-Google tool will return "not watermarked." For broader detection, third-party tools like Was It AI Generated? combine SynthID with independent classification models.

References