AI Audio Detector
Upload audio for a private synthetic-voice, voice-clone, and deepfake-audio risk check.
How AI Audio Detection Works
EyeSift decodes the selected audio locally when your browser supports the format, then checks duration, bitrate, waveform energy, silence ratio, clipping, dynamic range, zero-crossing variation, and micro-variation. These signals help screen synthetic-voice risk, but they are not forensic proof.
Voice Clone Review Checklist
| Check | Why it matters | Best next action |
|---|---|---|
| Original file | Messenger downloads, screen recordings, and reposts hide useful artifacts. | Use the earliest available WAV, FLAC, M4A, or high-quality MP3. |
| Known speaker sample | A believable clone can sound natural without matching the real speaker perfectly. | Compare cadence, breathing, pauses, and repeated phrases against a trusted sample. |
| Provenance | C2PA Content Credentials can preserve signed history for audio and other media when present. | Check whether the file has verifiable provenance before relying on waveform clues. |
| Context | FTC guidance treats voice cloning as a deception risk, especially for scams and impersonation. | Verify the source through a separate channel before acting on urgent requests. |
What We Screen For
- Very smooth waveform transitions that can appear in synthetic or heavily processed voices
- Low dynamic variation, silence-heavy clips, hard limiting, clipping, or bitrate limits that reduce confidence
- Voice cloning, text-to-speech, and AI-generated audio risk signals for manual review
- Format and quality limits that can make a result inconclusive
Supported Audio Formats
WAV, FLAC
Raw or lossless files preserve waveform detail.
M4A, MP3, OGG
Useful for screening when bitrate is high enough.
Voice-note exports, reposts
Heavy compression can hide or create artifacts.
Related Tools
Frequently Asked Questions
What types of AI audio can this detector identify?
EyeSift screens audio for signs that can appear in AI text-to-speech, voice cloning, synthetic narration, and heavily processed audio. It checks browser-readable waveform and metadata signals such as duration, bitrate, energy, silence ratio, zero-crossing variation, and micro-variation.
How does voice clone detection work?
Voice-clone triage looks for file and waveform patterns that differ from natural recordings, such as overly smooth transitions, low dynamic variation, unusual silence ratios, or quality constraints that hide detail. The result is a screening signal and should be combined with source verification for high-stakes decisions.
What audio formats produce the best results?
WAV and FLAC files provide the most data for analysis because they are uncompressed or lossless. MP3 and OGG files work well but compression can mask some spectral artifacts. For best results, use the highest quality audio available — avoid files that have been converted multiple times or recorded from speakers using a phone microphone, as this degrades the signals our detector relies on.
Can this detect AI-generated podcast or audiobook content?
It can screen podcast or audiobook samples when the browser can decode the file. Longer clips usually provide stronger waveform context than very short clips. Premium AI voices and post-production processing can reduce reliability, so results should be considered probabilistic rather than definitive.
How reliable is AI audio detection for legal or forensic use?
AI audio detection tools provide probability-based assessments, not forensic certainty. For legal proceedings or formal investigations, automated detection should be supplemented with expert analysis by audio forensics professionals. Our tool is best suited for preliminary screening, content moderation workflows, and awareness of synthetic audio characteristics.
Is my uploaded audio file stored or transmitted?
No. All audio analysis runs entirely in your browser. Your audio file is never sent to any server, stored in any database, or accessible to anyone else. The file data exists only in your local browser memory for the duration of your session.