From Cricket's Garden

Nonfic: Shorthand Dictation

Shorthand is a hobby of mine. Unfortunately, dictating at the slow speeds I need is not a hobby of anyone I know.

Hubby has written an online utility which, combined with a (free) voice synthesizer, will dictate text at the speed you choose.

It does this by adding time between the words rather than slowing down the words. (Slowing down the words too much becomes incomprehensible.)

http://onebit.ca/cgi-bin/short.py

Functional rather than fancy, but reliable. (I notice he's even put in code for invalid entries, but haven't tested it.)

Quick Instructions

Enter the text on the page as the form says. Hit "Translate".

Copy the resulting text into a text-to-speech program such as Cepstral. It has to recognize SSML (Speech Synthesis Markup Language, similar to HTML and XML. More here: http://www.cepstral.com/cgi-bin/support?page=ssml )

Press "Play."

More Details

The program replaces spaces with the "break" command. Not sure what he did with other whitespace or what happens if there is more than one space in a row. (He also did dishes that night, so I didn't get too picky.)

It adds more time after periods and other punctuation. When typing, leave out the period after "Mr". (That follows modern British punctuation.)

I use Cepstral's SwiftTalker. http://www.cepstral.com/downloads/

They have an online demo at http://www.cepstral.com/demos/ that sends you a wav file.

To get SwiftTalker, download and install any of their free voices; it gets installed with the voice. If using a free voice, it sticks "buy me" in every now and then.

SwiftTalker will play it immediately or save to a wav file. (File / Export) Audacity or Nero should be able to convert it to MP3 or a CD or whatever.

You may have to go into "tools / options / text handling " so it handles SSML. You can also play around with the speed in WPM. Lots of things to play with.

Note: Cepstral uses "delay" to mean something else. The SSML code is actually <break time="500ms">. "Break" means something else in the programming language, so he chose "delay". I love him anyways.

Speeds

It's impossible to say which delay value will give what speed. There are too many variables.

I chose a voice, and set my goals in "delay" rather than wpm. I only convert back to wpm when talking to other humans, or when I want to increase my speed by a given amount.

This is the chart I use for converting:

wpmdelay
40700
50450
60300
70200
80125
9060
10010, 7, 5, 1
140without delay codes

The chart was made as follows:

Paragraph 7-54 from Gregg Shorthand Manual Simplified, Second Edition. It claims to have 52 words. Yes, not a very large sample.

Lawrence voice at 170 wpm.

I ran it through the program to put in breaks, then exported the audio file. I then removed the "register me" messages (using Audacity), and did the math.

You'll notice that, although I told it 170wpm, it gave me 140. Other voices gave other results.

Musings

Schools count 1.4 syllables as one word, because a passage with many long words will take longer to write.

However, that doesn't give a good indication of your speed in the field.

If you're familiar with the field, you will have short forms for the longest words, so a 5-syllable word would be easier than 5 single-syllable words.

Swem himself said that cases with lots of numbers were difficult to record. You can't phrase them.

The championship speeds for literary material are much slower than for Q&A, which are slower than court proceedings. Same writers, same days. Proof that the material makes a huge difference.

Natural human speech phrases things differently than machine speech. Trust me on this one.

So, while it's useful for comparing your progress, an accurate wpm isn't really necessary this program.

If you're preparing for an exam, it's best to spend the money on their practice material for your final preparation.

Phrasing

You can omit the break between two words by replacing the space with a dash. The two words will be said together. This will reduce the total dictation time.

Sample .WAV, Summary

Here are two sample .wav files, using Cepstral's David. They are the same passage. The slower file was created at a lower frequency/quality, hence the drastic difference in file size.

rabbit300.wav
Delay of 300ms between words. File is over 3MB.
rabbit1000.wav
Delay of 1000ms between words. File is just under 1MB.

Speed Results, Summary

Cepstral's own wpm seems to use 1.4 syllables/word. (Thanks sidhetaba for those experiments.)

The free David voice at a delay of 750 and wpm of 170 (the default) gives 45-50 wpm.

If you experiment, please add your observations in the comment field. I will merge them into the text as I have time.

Speeds, Early Experiments

My first experiments used a piece 70 words long, 89 syllables, for an SI of 1.27. Cepstral was set at 170wpm.

Any speed over 100wpm can be done through Cepstral rather than this program, which saves a step.

delaynet timewpmspm
124175223
10040111141
15043100127
2005493119
2504988111
500736582
700875368
750845164
10001024253
12501193645
15001403240
17501642734

If anyone can fit a formula showing words, syllables (or SI), and delay to get a given WPM, I'd appreciate it.


Feedback is wanted. If you want to email rather than comment here, see the sidebar.

Comments

Pat O?23 May 2009, 09:14

Hi Cricket, Is your 'other half's' program a free utility or a paid-for utility? I'm finding that I am running out of online dictation material [of the 'free' variety] anyway. There's quite a bit at the slower speeds and then some at the 180 wpm plus end but I'm at about 120-130 w.p.m. [which as I put on another shorthand site is precisely where I was a quarter of a century ago - but my excuse is I don't use it every day in my job anymore].

Cricket?21 January 2008, 11:31

We've updated it using several comments from Frances. It now does three speeds at once, and leaves longer gaps for punctuation.

frances28?29 December 2007, 17:41

Hi Cricket,

I love your little program and have used it with Cepstral voice 'Lawrence' (being British) purchased, so without the inserted nags. My results are a bit different from yours but maybe Lawrence speaks more slowly(!). I went for an SI of 1.30 and experimented with various delays to get a 2 minute speech at different wpm. Results were as follows:

Delaywpm
10110
50100
12090
20080
30070
44060

The formula to calculate any speed using Lawrence based on these results is y = 107.9107e ^ 0.0013902x, where y = wpm and x = delay in milliseconds, so you need to be pretty hot at maths to get other speeds, I'm afraid.

Good luck with it - it's a really useful tool for shorthand learners.

Cricket?10 October 2007, 12:06

More test results:

Again using the free David voice, not including the time for the initial "buy me" bit. Including the time for the ads after that, but not counting those words. SwiftTalker set for 170 wpm, the default. Using all the text in chapter 2 Anni, about 768 words. A word is a word; I didn't count syllables.

DelayWPM
160020
125033
90040
65050
45060
30070

Tester?27 September 2007, 10:58

Test Comment

Retrieved from http://www.cricket.onebit.ca/Nonfic/DictProg
Page last modified on May 23, 2009, at 09:14 AM