Thanks for the suggestion! That makes a lot of sense. You're right, most publishers are very particular about format of their content in these generated videos. Focusing on a single client would help nail down at least one use case, and bring in some much needed cash flow :)
The text-to-speech software was ~$1500, and it's been running on a $100/month dedicated server since February.
The text-to-speech software was ~$1500, and it's been running on a $100/month dedicated server since February.