This appendix contains several listings of SAPI-related resources, including
Tip |
For the most recent list of SAPI-related resources, including updated books, Web links, other online resources, and SAPI software and hardware, use your Web browser to connect to the Communications Developer's Guide home page at www.iac.net/~mamund/mstdg/index.htm |
Table B.1 provides a sample list of books available at most large
libraries. It is not a definitive list, but it is a good representation
of writings on the subject of SAPI.
Title | Author | Notes |
Electronically Hearing:Computer Speech Recognition | Cater, John P. | Speech processing systems. |
Electronically Speaking: Computer Speech Generation | Cater, John P. | Speech processing systems. Speech synthesis. |
Computer Speech Processing | Fallside, Frank | Speech processing systems. Automatic speech recognition. Speech synthesis. |
Hidden Markov Models for Speech Recognition | Huang, X. D. | Automatic speech recognition. Speech recognition by computer systems. |
Designing with Speech Processing Chips | Jimenez, Ricardo. | Integrated circuits, design and construction, data processing. Speech processing systems. Computer-aided design. |
Artificial Neural Networks for Speech and Vision, 1st Ed. | Mommone, Richard J. | Neural networks (computer science). Automatic speech recognition. |
Neural Networks and Speech Processing | Morgan, David P. | |
Principles of Computer Speech | Witten, Ian H. | Speech processing systems. Speech synthesis. |
Note |
A more complete listing of books, including publisher data, can be found in the BOOKLIST table of the CDGLISTS.MDB database. You can use Microsoft Access, Microsoft Query, or any Access/Visual Basic database product to read this data file. |
Table B.2 lists World Wide Web links related to SAPI. This list
is constantly changing. An updated list is contained in the SAPIWEB.htm
data file in the RESOURCE
folder on the companion CD-ROM. This file can be loaded into almost
any Web browser and used as a launch document to connect to the
associated links.
Web Title | Web Address | Description |
Access First | www.inforamp.net/~access/af1.html | We service the visually impaired and blind community in Canada. We deal with text-to-speech, screen- magnification, and voice-recognition systems. We also train in these areas. |
AudioWav | www.audiowav.com/ | Now you can create and publish audio files to any destination on the Net by simply using your touch- tone phone with RealAudio, IWave, and TrueSpeech. |
Cobotyx | www.cobotyx.com/ | pc-based systems for voice processing, computer telephony (CTI), voice mail, auto attendant, text to speech, voice networking, interactive voice response (IVR), and fax mail technologies. |
Colibri | colibri.let.ruu.nl | An electronic newsletter and WWW service for people interested in the fields of language, speech, logic, and information. |
Command Corp | www.commandcorp.com/incube_welcome.html | Speech recognition on the Web. |
comp.speech WWW site [Australia] | fortis.speech.su.oz.au/comp.speech/ | Information on speech technology products and software. |
comp.speech WWW site [UK] | svr-www.eng.cam.ac.uk/comp.speech/ | Information on speech technology products and software. |
Digital Dreams | emf.net/~dreams/ | Offers speech-recognition plug-ins for multimedia developers. Current support for Macromedia Director and Hypercard. Demos available online. |
Dragon Systems, Inc. | www.dragonsys.com/ | DragonDictate for Windows and DOS. Discrete and continuous speech dictation and command/ control products for the pc. Also links to other SR sites. |
DragonDictate | www.waterw.com/~jkornit/ | A speech-recognition software package that allows you to dictate directly into your DOS or Windows applications! |
Eloquent Technology, Inc. (Eloquence) | www.fcinet.com/eti/index.html | Natural-sounding Text To Speech synthesis engine with Developer's Kit for several computer platforms. |
First Byte | www.firstbyte.davd.com/ | ProVoice Developers Tool Kit, Text To Speech synthesis for a wide variety of computers used in popular programs such as Monologue. |
GT Technology, Inc. | www.portal.com/~gt-tech/ | Provides consulting services in the field of error correction, image, and data and speech compression for computer and communication companies. |
Gus Communications, Inc. | www.gusinc.com | Speech disorders, augmentative communication. |
Index - Speech | mambo.ucsc.edu/psl/speech.html | |
Kolvox Communications Inc. | www.kolvox.com/ | Applications for speech input, recognition, and voice control, based on Kurzweil, IBM, Dragon, and other engines. Reseller inquiries welcome. |
Kurzweil Applied Intelligence | www.kurz-ai.com/ | Develops, markets, and supports automated speech-recognition systems used to create documents and interact with personal computers by voice. |
Lakewood Desktop Publishing | accessone.com/~mrbones/ | A licensed reseller of Kolvox Speech Systems. Control your Windows applications with your voice. Dictate up to 60 WPM. |
MIT - Lyon Speech Transcript | sap.mit.edu/projects/mit-lyon/project.html | A multimedia collaborative project about communication between Lyon, France and MIT, Cambridge. Speech recognition, translation, speech synthesis, and video are parts of the exhibition. |
PureSpeech, Inc. | www.speech.com/ | Speaker-independent recognition with natural language processing for personal computers and computer telephony applications. |
Speech Systems, Inc. | www.speechsys.com/ | Speech-recognition products, services, and technology. |
Talk Technology, Inc. | www.usbusiness.com/talk/ | Provides speech- and voice-recognition systems to doctors, lawyers, and people with repetitive strain injury. Wide range of software, including DragonDictate. |
Toolz 2000 | www.earthlink.net/~webwizard/toolz.html | Speech-recording software and pro audio sampler/hard disk recording software for Macintosh. |
UltraMedia Systems International | www.infi.net/~ums/ | Voice Recognition IBM VoiceType Dictation Technology. |
Verbex Voice Systems | www.txdirect.net/verbex/ | Download a freeware working demonstration of the continuous- speech voice-recognition system. |
Voice Processing Corporation | www.vpro.com/ | A speech-recognition company that provides voice engines for speech-enabling telephony and desktop applications. |
Note |
A more complete listing of these links can also be found in the WEBLIST table of the CDGLISTS.MDB database on the companion CD-ROM. You can use the database table to search for specific vendors, products, and so on. |
The following is a list of SAPI-related topics that you can browse on CompuServe:
Forum or Topic Name |
WinCim Add-on - VOICE-MAIL |
Disabilities Forum |
IBM PSP A Product Forum |
IBM VoiceType Forum |
Windows Third-Party A Forum |
Windows Third-Party App H Forum |
Windows Extensions Forum |
An extensive list of topics for The Microsoft Network can be found in the SAPIMSN folder on the CD-ROM. This contains a list of MSN shortcuts. If you are an MSN subscriber, you can click these icons to connect directly to the topic area on MSN.
Table B.3 lists software and hardware vendors who are currently providing, or have pledged to provide, SAPI-compliant software and/or hardware products. A more complete list of vendors, including contact names, addresses, and phone numbers, can be found in the PRODUCTS table of the CDGLISTS.MDB database on the companion CD-ROM. You can use the data table to perform searches for selected products or vendors.
In the table there are three types of entries:
Type | Company | Products |
SR | Advanced Recognition Technologies, Inc. | smARTspeakr for Windows-a compact speech-recognition engine suitable for command and control applications. Typical applications will be navigation of menus, phone and/or fax dialing, educational software, interactive games, and multimedia titles. |
SR | AT&T | WATSON-Advanced Speech Applications Platform-Available for Beta release in 4Q95, WATSON incorporates AT&T- patented BLASR Speech Recognition and FlexTalk Speech Synthesis technologies in a software product running under Microsoft Windows 95. |
SR | Cambridge Group Research, Ltd. | Cambridge Voice for Windows. The Cambridge Voice for Windows speech-recognition engine supports true speaker-independent recognition of continuous speech in real time. The engine does not require any speaker training. |
SR | IBM Corporation | Source for IBM VoiceType technology information. |
SR | Kurzweil Applied Intelligence, Inc. | Kurzweil VOICE for Windows Release 1.5 is the latest version of Kurzweil AI's award-winning voice-recognition system for Microsoft Windows. |
SR | Lernout & Hauspie Speech Products | ASR SDK for the L&H.asr1000M (computer/multimedia) and the L&H.1000T (telephony/telecommunications). L&H SDK for Automatic Speech Recognition is a speaker-independent recognizer that recognizes natural and fluently spoken words. |
SR | PureSpeech, Inc. | PureSpeech 2.2 Recognizer. The PureSpeech engine permits speaker-independent, continuous speech recognition. No user training is required. Both microphones and recognition over telephone lines are supported. |
SR | Speech Systems, Inc. | VoiceMatch System Development Kit (SDK) SpeechWizard. The VoiceMatch SDK is a suite of development tools for creating speech-aware applications in Microsoft Windows. |
SR | Verbex Voice Systems, Inc. | VISE, FlexVISE. The Verbex Integrated Speech Engine (VISE) is a high-performance, speaker-independent, speaker-trainable, continuous speech- recognition engine widely used in industrial, home, and office environments. |
SR | Voice Control Systems, Inc. | |
SR | Voice Processing Corporation | |
TTS | AT&T | WATSON-Advanced Speech Applications Platform. Available for Beta release in 4Q95, WATSON incorporates AT&T-patented BLASR speech recognition and FlexTalk speech synthesis technologies in a software product running under Microsoft Windows 95 |
TTS | Berkeley Speech Technologies, Inc. | BeSTspeech T-T-S is text-to-speech conversion software. It creates computer-synthesized speech output in a wide variety of languages. |
TTS | Cambridge Group Research, Ltd. | The Cambridge Voice for Windows text-to-speech engine provides synthetic speech with realistic, natural intonation. The engine uses linguistic rules and sentence analysis. |
TTS | Centigram Communications Corporation | TruVoicer Text-to-Speech Converter is a premium-quality text-to-speech product that converts any written text into natural-sounding, intelligible, spoken American English, German, Spanish, Italian, or French. |
TTS | Digital Equipment Corporation | DECtalk SDK for Windows 95. The DECtalk text-to-speech engine for Microsoft® Windows 95 provides highly intelligible, natural-sounding, synthesized speech from freeform text input. |
TTS | FirstByte | ProVoice Developers Kit for WIN 32. FirstByte, the OEM sales leader in text-to-speech software, offers development kits for the Microsoft Win32 Application Programming Interface (API), Microsoft Windows 3.1, Macintosh, OS/2, and embedded systems. |
TTS | Lernout & Hauspie Speech Products | TTS SDK for telephony and desktop applications. L&H Text-To-Speech SDK is a multiple-language software- development package that converts any computer-readable text into natural-sounding synthetic speech. |
TTS | SoftVoice, Inc. | SoftVoice, the developers of Apple Computer's original MacinTalk speech synthesizer and the Amiga's Narrator device, has developed a Windows version of its state-of-the-art, multilingual system. |
TTS | Telefonica I+D | Spanish TTS engine for Windows 95. The Telefonica I+D text-to-speech engine for Windows 95 produces Spanish synthetic speech from unrestricted text. The synthetic output is of very high quality: highly intelligible and very natural. |
TTS | Telia Promotor AB | Infovox 230-State-of-the-art text-to-speech conversion software for Microsoft Windows 95. Available for British English, American English, German, French, Spanish, Italian, Norwegian, Swedish, Danish, Finnish, and Icelandic. |
API | AT&T Microelectronics | |
API | Berkeley Speech Technologies, Inc. | |
API | Centigram Communications Corporation | |
API | Creative Technology Ltd. | |
API | Digital Equipment Corporation | |
API | Dragon Systems, Inc. | |
API | Eloquent Technology, Inc. | |
API | First Byte | |
API | Kolvox Communications | |
API | Kurzweil Applied Intelligence, Inc. | |
API | Lernout & Hauspie Speech Products | |
API | Scott Instruments Corporation | |
API | Speech Systems, Inc. | |
API | S Systems, Inc. | |
API | Telia Promotor Infovox AB | |
API | Verbex Voice Systems | |
API | Voice Processing Corporation |