MusiConGen

Text to Music Generation Model Based on Transformer

Support text description to generate music samples
Chord and rhythm control represented by symbols
Generate music by combining multiple text description styles
Estimate the chords of the generated sample through BTC chord recognition model
Provide samples of different music styles (such as blues, jazz, rock, funk, heavy metal)
Support comparing the performance of different fine-tuning methods

Product Details

MusiConGen is a Transformer based text to music generation model that enhances control over rhythm and chords through temporal conditions. This model is fine tuned from the pre trained MusicGen meta framework. It uses chord and rhythm controls represented by symbols, combined with five different styles of text descriptions to generate samples. The chords generated from the sample are estimated using the BTC chord recognition model, as described in the paper.

MusiConGen

Product Details

Related Projects

Udio v1.5

Ask the little universe

Speech to Note

SpeechGPT2