This multiple component set consists of a text-to-speech
components to make your applications talk, and grammar, speech recognition,
voice command, and dictation components to make your applications listen and
understand. There is also a number of supporting components dialog box
and UI components to make speech
programming a snap.
The low level speech components offer the
greatest flexibility in design while the high level components take the
more RAD approach for fast and easy development. NOTE - The following
is not a complete list of features, it is only some highlights. For
a complete description see the DTalk.hlp file which is available in the demonstration
version downloads. DTALK
FEATURE HIGHLIGHTS |
| TTTS |
Low level text to speech
- This component offers the most flexibility for speech synthesis. |
 | Easy selection of the speech engine and audio
devices. |
 | Send wave output to speakers, file, direct x or a
custom audio output device. |
 | Support for all engine dialog boxes |
 | Complete control of all attributes including;
pitch, realtime, speed and volume |
 | Support for embedded speech tags |
 | Access to the internal phoneme strings |
|
| TVTxt |
High level text to speech
- Voice text adds higher level functionality including a text to speech
"site" for central control of TTS attributes. |
 | Centralized control of all Voice text TTS applications on the
system through the voice text site.. |
 | Easily pause, resume, rewind, and fast forward
output. |
 | Includes may published events including a visual
event for animating the TLip component. |
 | Control of speech priority to resolve conflicts
when multiple applications are trying to speak at once. |
 | Runtime control of the voice's speed and pitch |
 | Support for embedded speech tags |
|
| TSR |
Low level speech
recognition - Provides low level access to the SR engine. |
 | One engine instance can be associated with many
TGrammar components |
 | Audio input can be from a microphone, a wave file,
Direct X or a custom audio source. |
 | Easy selection of speech engine and audio source. |
 | Control over the various engine attributes of
AutoGain, Echo, EnergyFloor, RealTime, Threshold and TimeOut. |
 | Support for Speaker and Microphone selection and
creation. |
 | Support for all engine dialog boxes |
|
| TGrammar |
Low level speech recognition
grammar- use with TSR to define the grammars for any kind of speech
recognition, command and control or dictation. |
 | Built in grammar complier |
 | Design time grammar tester |
 | Runtime lists so grammars can be modified at
runtime without out recompiling |
 | Used for any type of grammar; context free, command
and control, continuous dictation, or limited domain. |
 | Individual grammar rules can be activated or
deactivated at runtime. |
 | Supports both native engine and SAPI compiled
grammars |
 | Support for speaker training |
 | Complete recognition results objects are returned
in each recognition so, for example, the audio wav of what was
spoken can be retained. |
|
| TVCmd |
High level speech
recognition for command and control - the fastest way to create
a command and control application |
 | Voice menus that can be easily created at either
design time or run time. |
 | Runtime lists so voice commands can be modified at
runtime without recompiling |
 | Site support for central control of all command and
control attributes |
 | Command menus can be global to the system, only
active for a specified window or only listening when the speech
engine is asleep |
|
| TVDct |
High level speech
recognition for continuous dictation - the fastest way to
implement continuous dictation. |
 | Invisible virtual edit box lets you easily keep
track of what was said |
 | Visual correction floating dialog |
 | Ability to play back what was said |
 | Supports custom topic and command and control
grammars |
 | Supports glossary entries |
 | Inverse text normalization |
 | Dictation site for central control of all dictation
applications |
|
| TGramComp |
Grammar
Compiler - for runtime grammar creation and modification. |
 | Fast compiles |
 | Supports runtime lists |
 | Output to memory, file or OLE stream |
 | Input from memory, file, resource or OLE stream |
 | Phrase parser for special features |
|
| TMicWizard |
Microphone
Setup Wizard - implements the standard SAPI wizard. |
|
| TLexicon |
Allows
editing of the lexicon for any of the SR or TTS speech
components |
 | Add a word |
 | Get a word |
 | Remove a word |
|
| TModeDlg
|
A dialog box component to allow the user easy
engine mode enumeration and selection. |
|
| TAuDevDlg |
A dialog box component to allow the user easy
audio device enumeration and selection. |
|
| TLip |
Animated
graphic lips that can respond to text to speech events. |
 | Adjustable lip, tongue, throat, and teeth color |
 | Male or female mouth shape |
 | High and low resolution lips |
|