You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm using the long-form inference example@jpc provided to process longer text blocks. I tried to process Donald Knuth's short essay on the hyphen in "e-mail" with my own voice as a sample. At first, it sounds fine, but the voice gradually fades away and eventually is just noise. Am I doing something wrong? Here's the resulting audio and the code below, which is just slightly adapted from the example.
Any suggestions for how to improve the result? Am I doing something wrong?
I do get a linter error old_atoks is not subscriptable and I don't entirely understand the logic there since I copied it from the inference example. But it seems to work.
Actually, I spoke too soon. It's still trailing off and dying when a longer text is provided.
I understand what the code is supposed to be doing: it takes the last bit of each generated tokens to use as the prompt for the next bit, to achieve better flow/continuity. But somehow it isn't actually working over several iterations.
I'm hoping @jpc can chime in and point us in the right direction.
I'm using the long-form inference example @jpc provided to process longer text blocks. I tried to process Donald Knuth's short essay on the hyphen in "e-mail" with my own voice as a sample. At first, it sounds fine, but the voice gradually fades away and eventually is just noise. Am I doing something wrong? Here's the resulting audio and the code below, which is just slightly adapted from the example.
Any suggestions for how to improve the result? Am I doing something wrong?
Set up code:
functions:
The text was updated successfully, but these errors were encountered: