even less aggressive silence filtering and more data

1 minute read

Tensorboard data can be found here. This version used even less aggressive silence filtering (set to aggressiveness value to 0 this time) as well as data from two additional books in the anne of green gables series:

I had originally missed those books since they didn’t start with “Anne”

This model is definitely an improvement from the previous one, being much more consistent in its output. It is also more sensitive to punctuation, comparitively.

The best example of this model at work is probably its attempt at reading a page from Harry Potter and the Order of the Phoenix as part of the entire autoread system (camera and ocr included).

More audio samples

The birch canoe slid on the smooth planks.
Glue the sheet to the dark blue background.
It's easy to tell the depth of a well.
These days a chicken leg is a rare dish.
Rice is often served in round bowls.
The juice of lemons makes fine punch.
The box was thrown beside the parked truck.
The hogs were fed chopped corn and garbage.
Four hours of steady work faced us.
Large size in stockings is hard to sell.
however creating audiobooks can be expensive
however, creating audiobooks can be expensive
however, creating audiobooks can be expensive!