appendix B

SAPI Resources


CONTENTS


This appendix contains several listings of SAPI-related resources, including

Tip
For the most recent list of SAPI-related resources, including updated books, Web links, other online resources, and SAPI software and hardware, use your Web browser to connect to the Communications Developer's Guide home page at
www.iac.net/~mamund/mstdg/index.htm

Books

Table B.1 provides a sample list of books available at most large libraries. It is not a definitive list, but it is a good representation of writings on the subject of SAPI.

Table B.1. SAPI-related books.
TitleAuthor Notes
Electronically Hearing:Computer Speech Recognition Cater, John P.Speech processing systems.
Electronically Speaking: Computer Speech Generation Cater, John P.Speech processing systems. Speech synthesis.
Computer Speech ProcessingFallside, Frank Speech processing systems. Automatic speech recognition. Speech synthesis.
Hidden Markov Models for Speech Recognition Huang, X. D.Automatic speech recognition. Speech recognition by computer systems.
Designing with Speech Processing Chips Jimenez, Ricardo.Integrated circuits, design and construction, data processing. Speech processing systems. Computer-aided design.
Artificial Neural Networks for Speech and Vision, 1st Ed. Mommone, Richard J.Neural networks (computer science). Automatic speech recognition.
Neural Networks and Speech Processing Morgan, David P. 
Principles of Computer SpeechWitten, Ian H. Speech processing systems. Speech synthesis.

Note
A more complete listing of books, including publisher data, can be found in the BOOKLIST table of the CDGLISTS.MDB database. You can use Microsoft Access, Microsoft Query, or any Access/Visual Basic database product to read this data file.

Web Links

Table B.2 lists World Wide Web links related to SAPI. This list is constantly changing. An updated list is contained in the SAPIWEB.htm data file in the RESOURCE folder on the companion CD-ROM. This file can be loaded into almost any Web browser and used as a launch document to connect to the associated links.

Table B.2. Computer speech links for SAPI.
Web TitleWeb Address Description
Access Firstwww.inforamp.net/~access/af1.html We service the visually impaired and blind community in Canada. We deal with text-to-speech, screen- magnification, and voice-recognition systems. We also train in these areas.
AudioWavwww.audiowav.com/ Now you can create and publish audio files to any destination on the Net by simply using your touch- tone phone with RealAudio, IWave, and TrueSpeech.
Cobotyxwww.cobotyx.com/ pc-based systems for voice processing, computer telephony (CTI), voice mail, auto attendant, text to speech, voice networking, interactive voice response (IVR), and fax mail technologies.
Colibricolibri.let.ruu.nl An electronic newsletter and WWW service for people interested in the fields of language, speech, logic, and information.
Command Corpwww.commandcorp.com/incube_welcome.html Speech recognition on the Web.
comp.speech WWW site [Australia] fortis.speech.su.oz.au/comp.speech/ Information on speech technology products and software.
comp.speech WWW site [UK]svr-www.eng.cam.ac.uk/comp.speech/ Information on speech technology products and software.
Digital Dreamsemf.net/~dreams/ Offers speech-recognition plug-ins for multimedia developers. Current support for Macromedia Director and Hypercard. Demos available online.
Dragon Systems, Inc.www.dragonsys.com/ DragonDictate for Windows and DOS. Discrete and continuous speech dictation and command/ control products for the pc. Also links to other SR sites.
DragonDictatewww.waterw.com/~jkornit/ A speech-recognition software package that allows you to dictate directly into your DOS or Windows applications!
Eloquent Technology, Inc. (Eloquence)www.fcinet.com/eti/index.html Natural-sounding Text To Speech synthesis engine with Developer's Kit for several computer platforms.
First Bytewww.firstbyte.davd.com/ ProVoice Developers Tool Kit, Text To Speech synthesis for a wide variety of computers used in popular programs such as Monologue.
GT Technology, Inc.www.portal.com/~gt-tech/ Provides consulting services in the field of error correction, image, and data and speech compression for computer and communication companies.
Gus Communications, Inc.www.gusinc.com Speech disorders, augmentative communication.
Index - Speechmambo.ucsc.edu/psl/speech.html  
Kolvox Communications Inc.www.kolvox.com/ Applications for speech input, recognition, and voice control, based on Kurzweil, IBM, Dragon, and other engines. Reseller inquiries welcome.
Kurzweil Applied Intelligencewww.kurz-ai.com/ Develops, markets, and supports automated speech-recognition systems used to create documents and interact with personal computers by voice.
Lakewood Desktop Publishingaccessone.com/~mrbones/ A licensed reseller of Kolvox Speech Systems. Control your Windows applications with your voice. Dictate up to 60 WPM.
MIT - Lyon Speech Transcriptsap.mit.edu/projects/mit-lyon/project.html A multimedia collaborative project about communication between Lyon, France and MIT, Cambridge. Speech recognition, translation, speech synthesis, and video are parts of the exhibition.
PureSpeech, Inc.www.speech.com/ Speaker-independent recognition with natural language processing for personal computers and computer telephony applications.
Speech Systems, Inc.www.speechsys.com/ Speech-recognition products, services, and technology.
Talk Technology, Inc.www.usbusiness.com/talk/ Provides speech- and voice-recognition systems to doctors, lawyers, and people with repetitive strain injury. Wide range of software, including DragonDictate.
Toolz 2000www.earthlink.net/~webwizard/toolz.html Speech-recording software and pro audio sampler/hard disk recording software for Macintosh.
UltraMedia Systems Internationalwww.infi.net/~ums/ Voice Recognition IBM VoiceType Dictation Technology.
Verbex Voice Systemswww.txdirect.net/verbex/ Download a freeware working demonstration of the continuous- speech voice-recognition system.
Voice Processing Corporationwww.vpro.com/ A speech-recognition company that provides voice engines for speech-enabling telephony and desktop applications.

Note
A more complete listing of these links can also be found in the WEBLIST table of the CDGLISTS.MDB database on the companion CD-ROM. You can use the database table to search for specific vendors, products, and so on.

Other Online Resources

The following is a list of SAPI-related topics that you can browse on CompuServe:

Forum or Topic Name
WinCim Add-on - VOICE-MAIL
Disabilities Forum
IBM PSP A Product Forum
IBM VoiceType Forum
Windows Third-Party A Forum
Windows Third-Party App H Forum
Windows Extensions Forum

An extensive list of topics for The Microsoft Network can be found in the SAPIMSN folder on the CD-ROM. This contains a list of MSN shortcuts. If you are an MSN subscriber, you can click these icons to connect directly to the topic area on MSN.

Software and Hardware Resources

Table B.3 lists software and hardware vendors who are currently providing, or have pledged to provide, SAPI-compliant software and/or hardware products. A more complete list of vendors, including contact names, addresses, and phone numbers, can be found in the PRODUCTS table of the CDGLISTS.MDB database on the companion CD-ROM. You can use the data table to perform searches for selected products or vendors.

In the table there are three types of entries:

Table B.3. SAPI-related software and hardware.
TypeCompany Products
SRAdvanced Recognition Technologies, Inc. smARTspeakr for Windows-a compact speech-recognition engine suitable for command and control applications. Typical applications will be navigation of menus, phone and/or fax dialing, educational software, interactive games, and multimedia titles.
SRAT&TWATSON-Advanced Speech Applications Platform-Available for Beta release in 4Q95, WATSON incorporates AT&T- patented BLASR Speech Recognition and FlexTalk Speech Synthesis technologies in a software product running under Microsoft Windows 95.
SRCambridge Group Research, Ltd. Cambridge Voice for Windows. The Cambridge Voice for Windows speech-recognition engine supports true speaker-independent recognition of continuous speech in real time. The engine does not require any speaker training.
SRIBM CorporationSource for IBM VoiceType technology information.
SRKurzweil Applied Intelligence, Inc. Kurzweil VOICE for Windows Release 1.5 is the latest version of Kurzweil AI's award-winning voice-recognition system for Microsoft Windows.
SRLernout & Hauspie Speech Products ASR SDK for the L&H.asr1000M (computer/multimedia) and the L&H.1000T (telephony/telecommunications). L&H SDK for Automatic Speech Recognition is a speaker-independent recognizer that recognizes natural and fluently spoken words.
SRPureSpeech, Inc.PureSpeech 2.2 Recognizer. The PureSpeech engine permits speaker-independent, continuous speech recognition. No user training is required. Both microphones and recognition over telephone lines are supported.
SRSpeech Systems, Inc. VoiceMatch System Development Kit (SDK) SpeechWizard. The VoiceMatch SDK is a suite of development tools for creating speech-aware applications in Microsoft Windows.
SRVerbex Voice Systems, Inc. VISE, FlexVISE. The Verbex Integrated Speech Engine (VISE) is a high-performance, speaker-independent, speaker-trainable, continuous speech- recognition engine widely used in industrial, home, and office environments.
SRVoice Control Systems, Inc.  
SRVoice Processing Corporation  
TTSAT&TWATSON-Advanced Speech Applications Platform. Available for Beta release in 4Q95, WATSON incorporates AT&T-patented BLASR speech recognition and FlexTalk speech synthesis technologies in a software product running under Microsoft Windows 95
TTSBerkeley Speech Technologies, Inc. BeSTspeech T-T-S is text-to-speech conversion software. It creates computer-synthesized speech output in a wide variety of languages.
TTSCambridge Group Research, Ltd. The Cambridge Voice for Windows text-to-speech engine provides synthetic speech with realistic, natural intonation. The engine uses linguistic rules and sentence analysis.
TTSCentigram Communications Corporation TruVoicer Text-to-Speech Converter is a premium-quality text-to-speech product that converts any written text into natural-sounding, intelligible, spoken American English, German, Spanish, Italian, or French.
TTSDigital Equipment Corporation DECtalk SDK for Windows 95. The DECtalk text-to-speech engine for Microsoft® Windows 95 provides highly intelligible, natural-sounding, synthesized speech from freeform text input.
TTSFirstByteProVoice Developers Kit for WIN 32. FirstByte, the OEM sales leader in text-to-speech software, offers development kits for the Microsoft Win32 Application Programming Interface (API), Microsoft Windows 3.1, Macintosh, OS/2, and embedded systems.
TTSLernout & Hauspie Speech Products TTS SDK for telephony and desktop applications. L&H Text-To-Speech SDK is a multiple-language software- development package that converts any computer-readable text into natural-sounding synthetic speech.
TTSSoftVoice, Inc.SoftVoice, the developers of Apple Computer's original MacinTalk speech synthesizer and the Amiga's Narrator device, has developed a Windows version of its state-of-the-art, multilingual system.
TTSTelefonica I+DSpanish TTS engine for Windows 95. The Telefonica I+D text-to-speech engine for Windows 95 produces Spanish synthetic speech from unrestricted text. The synthetic output is of very high quality: highly intelligible and very natural.
TTSTelia Promotor AB Infovox 230-State-of-the-art text-to-speech conversion software for Microsoft Windows 95. Available for British English, American English, German, French, Spanish, Italian, Norwegian, Swedish, Danish, Finnish, and Icelandic.
APIAT&T Microelectronics  
APIBerkeley Speech Technologies, Inc.  
APICentigram Communications Corporation  
APICreative Technology Ltd.  
APIDigital Equipment Corporation  
APIDragon Systems, Inc.  
APIEloquent Technology, Inc.  
APIFirst Byte 
APIKolvox Communications  
APIKurzweil Applied Intelligence, Inc.  
APILernout & Hauspie Speech Products  
APIScott Instruments Corporation  
APISpeech Systems, Inc.  
APIS Systems, Inc. 
APITelia Promotor Infovox AB  
APIVerbex Voice Systems  
APIVoice Processing Corporation