Platform & tool to integrate TTS and/or ASR in applications

Main Features


MMBerry is a framework with which it is possible to develop multimodal applications rapidly and without having to know the low-level details of the underlying technologies. 

MMBerry allows for the development of multimodal applications independently of the engines. 

Scheme


With MMBerry, it is possible to transform a classic application into a multimodal application by writing just a few lines of code. 

MMBerry consists of a series of dll and header files (.h). The dlls contain the implementation of all the logic of the voice technologies used and the Vui Manager. The header file makes the Vui Manager interaction interface available to the programmer. 

The structure of the library has been designed so as to hide from the end user all the details relevant to the implementation.
Indeed, the header file only shows standard data types and makes no reference to low level objects. 

Thanks to MMBerry, it is possible to write multimodal applications in C++, Java and .NET. In fact, there exists a complete implementation of MMBerry for Java and one for the .NET environment.


MMBerry is made of: 


1.a Core library, structured from the following parts:

   a. Voice Command Gateway: to write multimodal applications in C++, including

               - voice recognition

               - synthesis functionality;

               - log file generation;

               - catch events from TTS engine;

               - catch events from ASR engine (match, nomatch, no input, timeout).

 

    b. JNI interface for writing Java applications with MMBerry in native mode.

    c. .NET interface for writing applications with any framework language.

    d. VAD (Voice Activity Detection) to optimise the use of the ASR motor in continual mode      

        without being a burden on the recogniser itself.

    e. Audio Dump allows for recording of the audio buffer which is sent to the ASR. This is 

        especially recommended for debugging and application tuning.

    f. Advanced logging allowing for the highlighting of each single event within the library 

       and, above all, to have control of the recognition events (grammar loaded, utterance, 

       recognition, confidence, etc).


2. An interface library to ASR and Speaker Verification engine (Berry ASR.dll),


3. An interface library to TTS engine (Berry TTS.dll).




    Copyright © 2008 ByteWay Srl