D New SOTA Text to Audio model using rectified flow and FLUX architecture
A new TTA model trained with rectified flow matching followed by preference optimisation is released! Fully open sourced. Inference on a GPU takes about 3 seconds.
https://redd.it/1hq9hx1
@artificialintelligence24x7
A new TTA model trained with rectified flow matching followed by preference optimisation is released! Fully open sourced. Inference on a GPU takes about 3 seconds.
https://redd.it/1hq9hx1
@artificialintelligence24x7