even less aggressive silence filtering and more data
1 minute read
Tensorboard data can be found here. This version used even less aggressive silence filtering (set to aggressiveness value to 0 this time) as well as data from two additional books in the anne of green gables series:
I had originally missed those books since they didn’t start with “Anne”
This model is definitely an improvement from the previous one, being much more consistent in its output. It is also more sensitive to punctuation, comparitively.
The best example of this model at work is probably its attempt at reading a page from Harry Potter and the Order of the Phoenix as part of the entire autoread system (camera and ocr included).
More audio samples
The birch canoe slid on the smooth planks.Glue the sheet to the dark blue background.It's easy to tell the depth of a well.These days a chicken leg is a rare dish.Rice is often served in round bowls.The juice of lemons makes fine punch.The box was thrown beside the parked truck.The hogs were fed chopped corn and garbage.Four hours of steady work faced us.Large size in stockings is hard to sell.however creating audiobooks can be expensivehowever, creating audiobooks can be expensivehowever, creating audiobooks can be expensive! Twitter Facebook LinkedIn
I modified the aggressiveness from 3 to 1. Training history can be found here for a run which used the LJSpeech mode I trained, and here for a model trained ...
Cheryl and I spent some time taking a bit of a break from training models to try and figure out what it is. One big thing could be an issue with long sentenc...
I decided to try yet another dataset, this time the Blizzard dataset. The tensorboard results can be seen here. With this dataset, it is again successful at ...
I suspect the most significant issue is the forced alignment being slightly off on two accounts:
It might sometimes just be plain wrong
It is common for...