I'm not sure how to write text prompts or CoT instructions to generate consistent sound effects.
Has anyone else encountered this? I'm trying to figure out whether my prompts aren't good enough, or if the model's output simply isn't capable of consistently adhering to the text prompts.
I'm not sure how to write text prompts or CoT instructions to generate consistent sound effects.
Has anyone else encountered this? I'm trying to figure out whether my prompts aren't good enough, or if the model's output simply isn't capable of consistently adhering to the text prompts.