less aggressive silence filtering

2 minute read

I modified the aggressiveness from 3 to 1. Training history can be found here for a run which used the LJSpeech mode I trained, and here for a model trained from scratch. These models are actually starting to consistently read out words. Interestingly enough, the model trained from scratch actually seems to be performing better.

Unfortunately, the model doesn’t seem to be differentiating much between punctuations

Transfer learned model

The birch canoe slid on the smooth planks.
Glue the sheet to the dark blue background.
It's easy to tell the depth of a well.
These days a chicken leg is a rare dish.
Rice is often served in round bowls.
The juice of lemons makes fine punch.
The box was thrown beside the parked truck.
The hogs were fed chopped corn and garbage.
Four hours of steady work faced us.
Large size in stockings is hard to sell.
however creating audiobooks can be expensive
however, creating audiobooks can be expensive
however, creating audiobooks can be expensive!

Model from scratch

The birch canoe slid on the smooth planks.
Glue the sheet to the dark blue background.
It's easy to tell the depth of a well.
These days a chicken leg is a rare dish.
Rice is often served in round bowls.
The juice of lemons makes fine punch.
The box was thrown beside the parked truck.
The hogs were fed chopped corn and garbage.
Four hours of steady work faced us.
Large size in stockings is hard to sell.
however creating audiobooks can be expensive
however, creating audiobooks can be expensive
however, creating audiobooks can be expensive!