ported my python-fastapi tts server to go, same onnx runtime.
710 MB to start python
141 MB to start go
Mid generation
+ 312 MB used by go
+ 67 MB used by python
thought python was the issue but probably my shitty python coding.
but cli lag is the main thing pissing me off with python
python app --help
real 0m2.604s
user 0m5.190s
sys 0m0.228s
go app --help
real 0m0.028s
user 0m0.016s
sys 0m0.015s
idk, with the model running doesn't measure so different, just the start-up time