aboutsummaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'README.md')
-rw-r--r--README.md11
1 files changed, 9 insertions, 2 deletions
diff --git a/README.md b/README.md
index c4c9da8..72301ac 100644
--- a/README.md
+++ b/README.md
@@ -1,4 +1,11 @@
-# pgm
-Projet PGM
+# CTC implementation for Blocks and Theano
Thomas Mesnard, Alex Auvolat
+
+This repository contains an implementation of the CTC cost function (Graves et al., 2006). To avoid numerical underflow, two solutions are implemented:
+
+- Normalization of the alphas at each timestep
+- Calculations in the logarithmic domain
+
+This repository also contains sample code for applying CTC to two datasets, a simple dummy dataset constituted of artificial data, and code to use the TIMIT dataset. The model on the TIMIT dataset is able to learn up to 50% phoneme accuracy using no handcrafted processing of the signal, but instead uses an end-to-end model composed of convolutions, LSTMs, and the CTC cost function.
+