Posts by Category

blog

less aggressive silence filtering

2 minute read

I modified the aggressiveness from 3 to 1. Training history can be found here for a run which used the LJSpeech mode I trained, and here for a model trained ...

webrtcvad and Shorter Sections

1 minute read

Cheryl and I spent some time taking a bit of a break from training models to try and figure out what it is. One big thing could be an issue with long sentenc...

Blizzard

less than 1 minute read

I decided to try yet another dataset, this time the Blizzard dataset. The tensorboard results can be seen here. With this dataset, it is again successful at ...

Transfer Learning and Hand-Labelling

1 minute read

I suspect the most significant issue is the forced alignment being slightly off on two accounts: It might sometimes just be plain wrong It is common for...

2021-03-06

less than 1 minute read

I tried to retrain again with the Karen Savage data after changing each line to be separated by each sentence. The goal here is to allow more empty space bet...

LJSpeech

1 minute read

To narrow down the problem further, I trained another model using the LJSpeech dataset but my own text preprocessing pipeline. Very annoyingly enough, the mo...

Camera Issues

less than 1 minute read

I got around to testing our webcam, and the results were dissapointing to say the least.

New Dataset

1 minute read

I investigated the previous preprocessing pipeline and there didn’t seem to be any issues. However, just in case, I decided to switch to using the mel spectr...

Camera stand design review

less than 1 minute read

The ongoing pandemic situation has led to a reconsideration of the previous camera stand design from fall 2020. At this point we are almost certainly abandon...

Model Selection

1 minute read

The original plan was to train first with flowtron since it offers multi-voice training. However, the data for that would be more difficult to gather as it r...

Data Verification

less than 1 minute read

I wanted to check the accuracy of aeneas, but definitely did not want to be manually looking at the json results and checking each word and its timestamp to ...

Data Cleaning

1 minute read

The main dataset we’ll use to start off with is The Adventures of Tom Sawyer read by John Greeman.

Back to Top ↑