ChipFind - документация

Электронный компонент: 80-0206-R

Скачать:  PDF   ZIP
RSC-4128
Speech Recognition Processor
Data Sheet
2006 Sensory Inc.
P/N 80-0206-R
1
General Description
The RSC-4128 represents Sensory's next generation
speech and analog I/O mixed signal processor. The
RSC-4128 is designed to bring high performance
speech I/O features to cost sensitive embedded and
consumer products. Based on an 8-bit
microcontroller, the RSC-4128 integrates speech-
optimized digital and analog processing blocks into a
single chip solution capable of accurate speech
recognition; high quality, low data-rate compressed
speech; and advanced music. Products can use one
or all features in a single application.

The RSC-4128 operates in tandem with the radically
new FluentChipTM technology, offering the best
speech recognition technologies in the industry.
FluentChipTM includes Hidden Markov Model-Neural
Net hybrid speech recognition. Accuracy in all kinds
of noise is dramatically improved. New Speaker
Verification technology is perfect for voice password
security applications that must work in noisy
environments. New high quality compressed speech
technology reduces data rates by 5 times. New 8-
voice MIDI-compatible music includes drum tracks,
effectively increasing instruments beyond 8.
Simultaneous music and speech round out the
FluentChipTM technology.

FluentChipTM technology tools also support the
revolutionary capability of creating speaker
independent recognition sets by simply typing in the
desired recognition vocabulary! A few keystrokes
creates a recognition set in seconds without the wait
or cost of recording sessions to train the recognizer,
speeding time to sales.

A new and unique Audio Wakeup feature listens
while the RSC-4128 is in power down mode. When
an audio event such as a clap or whistle occurs,
Audio Wakeup will wakeup the RSC-4128 for speech
or application tasks. Audio Wakeup is perfect for
battery applications that require continuous listening
and long battery life.

In addition to best-in-class recognition performance,
the RSC-4128 provides further on-chip integration of
features. A complete speech I/O application can be
built with as few additional parts as a clock crystal,
speaker, microphone, and few resistors and
capacitors.


Moreover, the RSC-4128 provides an unprecedented
level of cost effective system-on-chip (SOC)
integration, enabling many applications that require
DSP and/or audio processing. The RSC-4128 may
be used as a general-purpose mixed signal
processor platform for custom algorithms,
technologies and applications.
Features
Full Range of FluentChipTM Capabilities
Noise-robust Speaker Independent and Speaker
Dependent recognition
Many languages now available for international use
Speaker Verification voice password biometric security
Word Spotting and Continuous Listening recognition
options
High quality, 2.4-7.8 kbps speech synthesis & sound
effects, with Sensory SX
TM
synthesis technology
8 voice MIDI-compatible music synthesis coincident with
speech; drum track feature enables additional voices
Voice Record & Playback (voice memo)
Audio Wake Up from sleep with whistles or claps
Touch Tone (DTMF) output
Integrated Single-Chip Solution
8-bit microcontroller
128K bytes ROM
16 bit ADC, 10 bit DAC & PWM, and microphone pre-
amplifier
Independent, programmable Digital Filter engine
4.8 KBytes total RAM (262 bytes "user" application RAM)
Five timers (3 GP, 1 Watchdog, 1 Multi Tasking)
Twin-DMA, Vector Math accelerator, and Multiplier
Built-in Analog Comparator Unit (4 inputs)
External memory bus: 20-bit Address(1Mbyte), 8-bit Data
On chip storage for SD, SV, templates (10 templates)
24 configurable I/O lines with 10 mA (typical) outputs
Code security - no ROM dump capability
Uses low cost 3.58MHz crystal (internal PLL)
Low EMI design for FCC and CE requirements
Fully nested interrupt structure with up to 8 sources
Optional Real Time Clock
Long Battery Life
2.4 3.6V operation
12mA (typical) operating current at 3V
2 low power modes; 1
A typical sleep current
Full Suite of Quick & Powerful Tools
Quick Text-to-SI (T2SITM) text entry to build noise robust
SI recognition sets: low cost & push-buttonno recording!
QuickSynthesisTM 4 for push-button speech compression
Integrated Development Environment, C Compiler,
Debugger & In Circuit Emulator from Phyton, Inc.
RSC-4128
Data Sheet
2
P/N 80-0206-R
2006 Sensory Inc.
Table of Contents
General Description ........................................................................................................................................................................ 1
RSC-4128 Overview ....................................................................................................................................................................... 3
Speech Technologies ..................................................................................................................................................................... 4
Speech Recognition .............................................................................................................................................................................................. 4
Speech and Music Synthesis ................................................................................................................................................................................ 4
Record and Playback ............................................................................................................................................................................................ 4
RSC-4128 Architecture................................................................................................................................................................... 5
Reference Schematics.................................................................................................................................................................... 7
Using the RSC-4128....................................................................................................................................................................... 9
Instruction Set ....................................................................................................................................................................................................... 9
Flags ..................................................................................................................................................................................................................... 9
Stack ................................................................................................................................................................................................................... 10
Register and User RAM ...................................................................................................................................................................................... 10
L1 Vector Accelerator/Multiplier .......................................................................................................................................................................... 11
Digital Filter ......................................................................................................................................................................................................... 11
Power and Wakeup Control ................................................................................................................................................................................ 11
General Purpose I/O ........................................................................................................................................................................................... 12
Memory Addressing ............................................................................................................................................................................................ 14
Wait States.......................................................................................................................................................................................................... 17
On-Chip ROM ..................................................................................................................................................................................................... 19
Oscillators ........................................................................................................................................................................................................... 19
Clocks ................................................................................................................................................................................................................. 20
Timers/Counters.................................................................................................................................................................................................. 21
Interrupts ............................................................................................................................................................................................................. 24
Analog Input ........................................................................................................................................................................................................ 27
Audio Wakeup..................................................................................................................................................................................................... 28
Microphones........................................................................................................................................................................................................ 29
Reset................................................................................................................................................................................................................... 30
Digital-to-Analog-Converter (DAC) Output.......................................................................................................................................................... 30
Pulse Width Modulator (PWM) Analog Output.................................................................................................................................................... 32
Comparator Unit.................................................................................................................................................................................................. 33
Instruction Set Opcodes and Timing Details................................................................................................................................. 35
MOVE Group Instructions ................................................................................................................................................................................... 35
ROTATE Group Instructions ............................................................................................................................................................................... 36
BRANCH Group Instructions............................................................................................................................................................................... 36
ARITHMETIC/LOGICAL Group Instructions ....................................................................................................................................................... 36
MISCELLANEOUS Group Instructions ............................................................................................................................................................... 37
Special Functions Registers (SFRs) Summary............................................................................................................................. 38
DC Characteristics........................................................................................................................................................................ 40
A.C. Characteristics (External memory accesses)........................................................................................................................ 40
Timing Diagrams........................................................................................................................................................................... 41
Absolute Maximum Ratings .......................................................................................................................................................... 41
Package Options .......................................................................................................................................................................... 42
Die Pad Ring ................................................................................................................................................................................ 45
RSC-4128 Die Bonding Pad Locations......................................................................................................................................... 46
Mechanical Data........................................................................................................................................................................... 47
Data Sheet
RSC-4128
3
P/N 80-0206-R
2006 Sensory Inc.
RSC-4128 Overview
The RSC-4128 is a member of the Interactive SpeechTM line of products from Sensory. It features a high-
performance 8-bit microcontroller with on-chip ADC, DAC, preamplifier, RAM, ROM, and optimized audio
processing blocks. The RSC-4128 is designed to bring a high degree of integration and versatility into low-cost,
power-sensitive applications. Various functional units have been integrated onto the CPU core in order to reduce
total system cost and increase system reliability.

The RSC-4128 operates in tandem with FluentChipTM firmware, an ultra compact suite of recognition and synthesis
technologies. This reduced software footprint enables, for example, products with 3 minutes of compressed
speech, multiple-speaker dependent and independent vocabularies, speaker verification, and all application code
built into the RSC-4128 as a single chip solution. Revolutionary Text-to-Speaker-Independent (T2SITM) technology
allows the creation of SI recognition sets by simply entering text.

The CPU core embedded in the RSC-4128 is an 8-bit, variable-length-instruction microcontroller. The instruction
set is similar to the 8051 microcontroller, and has a variety of addressing mode, MOV and 16 bit instructions. The
RSC-4128 processor avoids the limitations of dedicated A, B, and DPTR registers by having completely
symmetrical sources and destinations for all instructions.

The RSC-4128 provides a high level of on-chip features and special DSP engines, providing a very cost effective
mixed signal platform for general-purpose applications and development of custom algorithms. The full suite of
industry standard tools for easy product development makes the RSC-4128 an ideal platform for consumer
electronics.
RSC-4128
Data Sheet
4
P/N 80-0206-R
2006 Sensory Inc.
Speech Technologies
Speech Recognition
The RSC-4128 is designed to operate in tandem with the FluentChipTM technology library, including speaker
independent (SI), speaker dependent (SD), and speaker verification (SV) speech recognition. Combinations of
these technologies may used to create applications that are rich in features. These are described below:
Speaker Independent recognition requires no user training. The RSC-4128 can recognize up to 30 words in an
active set (number of sets is limited only by internal ROM size). Text-to-SI (T2SITM), based on a hybrid of Hidden
Markov Modeling and Neural Net technologies, allows creation of accurate SI recognition sets in seconds. SI
requires on-chip ROM or off-chip parallel bus ROM, EPROM, or Flash to store the words to be recognized.
Speaker Dependent recognition allows the user to create names for products or customize recognition sets. SD
is implemented with DTW (dynamic time warping) pattern matching technology. SD requires programmable
memory to store the personalized speech templates(trained patterns) that may be on-chip SRAM, or off-chip
serial or parallel bus EEPROM, Flash Memory, or SRAM. Up to 100 templates can be recognized in an active
set (the number of unique sets is limited only by programmable memory capacity). The RSC-4128 can store up
to 7 SD templates in on-chip SRAM.
Speaker Verification enables the RSC-4128 to authenticate when a previously trained password is spoken by
the target user. SV is also implemented with DTW technology. 5 SV templates can be stored in on-chip SRAM,
or more with external programmable memory such as delineated in SD above.
Word Spotting enables the RSC-4128 to spot a specific word surrounded by other speech within a phrase. This
can be quite effective when the users response may vary (e.g. spotting "telephone" in the phrases "ummm
telephone", or "telephone call"). This option is available for SI and SD.
Continuous Listening allows the chip to continuously listen for a specific word. This may be used as a trigger
word to request a device to listen for a command. This option is available for SI and SD.

Speech and Music Synthesis
The RSC-4128 provides high-quality speech compression using Sensory SXTM technology. One may select
various data rates from approximately 2.4 to 10.8Kbps to manage speech quality versus allotted memory. The
highest data rates use 16KHz sample rates to provide high quality reproduction of high pitched voices. Speech and
sound effects may also be compressed using 8-bit PCM (64Kbps) or 4-bit ADPCM (32Kbps) technologies.

The RSC-4128 also provides high-quality, eight-voice, wave table music synthesis which allows multiple,
simultaneous instruments for harmonizing. The RSC-4128 uses a MIDI-like system to generate music. One or
more of the eight voices may be speech playback instead of music. One or more of the eight voices may be a drum
track comprising multiple drums. In effect, drum tracks allow the number of simultaneous instruments to exceed 8.

Speech and Music data may be stored in on-chip ROM or off-chip parallel bus ROM, EPROM or Flash. Speech
data may alternatively be stored in off-chip serial data ROM or serial data Flash for extended durations.

Easy to use tools allow the developer to record and compress their own voice talents and create with the push of a
button, or to create their own MIDI scores and instruments.

Record and Playback
The RSC-4128 can perform speech record and playback (sometimes called "voice memo") using either 8 bits
(64Kbps) or 4 bits (32Kbps) per sample, depending on the quantity and quality of playback desired. The record and
playback technology also optionally performs silence removal to reduce memory requirements.

External parallel or serial bus Flash or SRAM is required to store the compressed speech.

Data Sheet
RSC-4128
5
P/N 80-0206-R
2006 Sensory Inc.
RSC-4128 Architecture
The RSC-4128 is a highly integrated speech and analog I/O mixed signal processor that combines:
8-bit microcontroller with enhanced instructions and interrupt control, superior register architecture, independent
Digital Filter engine and "L1" Vector Math Accelerator
On-chip ROM and RAM (4.8 Kbytes), and the ability to address off-chip RAM, ROM, EPROM or Flash.
Input microphone preamp and 16 bit Analog-to-Digital Converter (ADC) for speech and audio/analog input
10 bit Digital-to-Analog Converter (DAC), and 10 bit Pulse Width Modulator (PWM) to directly drive a speaker or
other analog device
Low power Audio Wakeup from power down mode, when a selected audio event, such as clap or whistle, occurs

The RSC-4128 has 20-bit address
and 8-bit data buses for interfacing
with external memory. It includes an
-XM input pin capable of enabling or
disabling the internal ROM.

NOTE: Neither the -XM input pin nor
the extended memory buses are
available on 64-lead LQFP
packaged versions of the RSC-4128
with internal ROM. These are
available on the die and 100 LQFP
versions.

Three bi-directional ports provide 24
configurable, general-purpose I/O
pins to communicate with or control
external devices with a variety of
source and sink currents. Up to 4 of
these I/O may be used as
programmable Analog Comparator
inputs. 16 may be used as I/O
wakeup.

The RSC-4128 has a high
frequency (14.32 MHz) clock as well
as a low frequency (32,768 Hz)
clock. The processor clock can be
selected from either source, with a
selectable divider value. The device
performs speech recognition when
running at 14.32 MHz. The RSC-
4128 also supports programmable
wait states to allow the use of
slower memory.

OSC1 is a very low-cost 3.58 MHz
crystal oscillator that is used by a
4X PLL to generate the 14.32MHz
clock. The OSC2 oscillator provides the options of using an external crystal or its own internal RC devices (no
external components required for the internal RC mode).
RSC-4128 Internal Block Diagram