>>101757415
400k tracks, added special token for artist and lyrics so that it doesnt need a starting line
<|artist|>Taylor Swift<|lyrics|>
gpt2 arch because you dont need billions of parameters to generate lyrics but increased max length, initialized from scratch, tokenizer is trained on the dataset too
exciting desu