Speech:Scripts


 * Home
 * Semesters - Project Work by Semester
 * Information
 * Experiments - List of speech experiments

Project Notes

 * Unix Notes
 * Active Directory
 * Backups
 * Network Bridge
 * Speech S/W Installation
 * Speech Corpus Setup
 * Switchboard Data Notes
 * Experiment Setup
 * [Scripts Page]
 * Model Building
 * Step 1: Run a Train
 * Step 2: Create the Language Model
 * Step 3: Run the Decode

Scripts Page

 * checkTrain - This Script checks the training trans against the .sph files to see if they match.
 * clone_exp - This script will clone an experiment.
 * convert - This script makes symbolic links to all required sph files from a transcript file located in a corpus directory.
 * copySph - This script will make symbolic links to all the required sph files that are noted in a transcript file located in a particular corpus directory.
 * createTranscript - This script will create a transcript where the spoken dialog lasts for the amount of time specified by length_of_time.
 * createSubTranscript - This script extends createTranscript further by using the same algorithm we used to calculate the corpus size. It takes three arguments, the base transcript, the duration in hours and the start time in hours.
 * dictionary - Compares list of words in file to words in a dictionary and outputs words available with pronunciations.
 * find - Looks like this just searches for a term in the cmudict.0.6d dictionary specifically.
 * gen_errors - This script is used when training the acoustic model for an experiment.
 * genFileIDs - This script will generate wave file ID's for transcripts.
 * GenTrans - This script generates transcripts and wave files (several versions exists)
 * lm_create - This script creates a Sphinx Language Model from a text file.
 * master_run_train - ( new for 2014 ) Training Master script
 * parseDecode - This scripts generates a list of hypothesis (decoded) transcripts from decode output.
 * pruneDictionary - This script prunes master dictionary, creating a new dictionary with only words we are interested in.
 * train_01 - This script sets up the experiment directory.
 * train_02 - This script sets up the experiment configuration file.
 * trans_time - This script counts the lines and durations of a transcript file.
 * updateDict - This script takes a list of words and the according pronunciations, adds them in sorted order to dictionary.