Played with speech to speech a bit today, its pretty fking good.
Just like with the voices you want to clone, your audio clips you want for S2S, need to be clear and have good stability, understandable words, and for the moment, only ENG is available.
One of the issues ive come across is that most thots that delve in JOI and similar things, have shit quality mics, you get a lot of feedback (white noise) on the back and foreground audio, among other noises. If you can get yourself some good quality audio clip, then you're good for the most part.
E.L. takes the audio file, and "listens" to it, writes it down and adjusts your cloned so that they basically will be, like a glove over your sample audio.
I took a sample from an ASMR audio file (since audio is the whole shiz for ASMRtist, they have some of the best quality for samples for testing, if you dont mind the whispering), and used a cloned PotasticP voice, and it sounds something like this:
i got it to work after twerking it a bit, reducing noise, audio spikes and weird voice effects that Labs sometimes does. But since its "whispered" labs had a hard time reading it, the log coming out like this for individual tests:
Oh fuck, oh fuck, oh fuck, oh fuck, I'm fucking out of my mind. | |
r a v v a теперь r a v v a s r a v ve s r a v v h r a v we r a v r a v s h r a v s | |
The second test coming out the closest (that last part should be "Im cumming out of my fucking mind!")
Imma try it later with audio samples from hentai since those have amazing audio quality.
If your guys got any audio samples that have been working for you, please feel free to share them, it could be helpful and i would appreciate it. cheers
BTW, speech to speech is a very good way to get voices with accents, if you got a voice in another language, you force it to ENG with this tool, which is pretty cool if youre in to that stuff