Real Voice or OpenAI's Voice Engine?
The first audio clip is authentic audio, i.e. spoken by a human. The second audio clip is a synthetic clip generated by OpenAI’s Voice Engine. Voice Engine requires only a 15-second reference audio file to replicate a speaker’s voice.
Reference audio:
OpenAI Generated audio:
See more examples of Voice Engine HERE.



