macOS 14 (Sonoma) introduced Personal Voice, the ability to create a model of one's voice to be used with Live Speech accessibility feature (text-to-speech synthesis).
The training involves recording oneself speaking 150 separate sentences, which the software then chews on for six hours or so to generate a model of one's voice. This can then be used the way the built-in voices (Alex, etc.) are for Live Speech.
The feature has an export function that writes a folder containing .caf
files of the 150 spoken sentences and a .json
metadata file.
However, there's no corresponding import function, making the export currently pretty useless.
This is clearly a work-in-progress, but if someone is familiar with the internals I'd be grateful knowing:
- How to import the exported data to a new voice and tell the Mac to create a voice model?
- How to export and import a voice model?
.caf
files? ... the import may be a simple copy&paste