VocalFusion™ 4-Mic Kit for AVS

Part #: XK-VF3000-L33-AVS
Silicon on Board: XVF3000-TQ128-C

The VocalFusion™ 4-Mic Kit for Amazon AVS features a small four-microphone linear array, that enables developers and OEMs to add far-field voice capture to consumer electronics and IoT products. The linear design is optimised for integration into flat screens, white goods and kitchen equipment, and other consumer electronics.

The voice signals are captured by the microphone array even in noisy environments, enabling commands to be accurately processed and passed to the Alexa client running on a system processor.

The kit provides direct interfacing to four PDM (Pulse Density Modulation) microphones, with I2S interface (USB optional) to connect to application or host processors. Voice sources are isolated from unwanted noise with the integration of advance DSP techniques including beamforming, echo cancellation and noise suppression. A rich set of optimisation parameters are available to ensure that the best results are achieved for the individual acoustics of the end product. These parameters include adjustment to noise attenuation and gain control as well as numerous optimisations for echo cancellation.

VocalFusion AVS GENERAL FEATURES


VocalFusion XVF3000

- High performance smart microphone for voice interfaces
- Integrated microphone and voice DSP
- Integrated keyword detection
- Integrated USB 2.0 PHY for high and full-speed host and device operation
- 2048KB on-chip flash
- 128-pin TQFP package 0.4mm pitch

Audio output

- I2S output to DAC
- 48kHz PCM

Microphone array

- 4-mic linear array, with IFX mics
- Inter-mic spacing: 1033.33mm

Host processor interface options

- I2S interface to Raspberry Pi sub-system with I2C for control
- High speed USB2.0 compliant device

 

VocalFusion AVS PRE-PROCESSING DSP


Beamformer

- Adaptive de-reverberation beamformer
- Tracks the loudest voice
- Adaption time 100ms (typically)
- 180° coverage
- Up to 5m operating distance

Acoustic Echo Canceller

- Mono AEC with barge-in support
- Up to 50dB suppression
- Initial convergence <2s
- Adaption time 100ms (typically)

Noise Suppression

- Up to 15dB stationary noise suppression
- Up to 15dB diffuse noise suppression