Monday, December 23, 2013

Developing Android Applications with Voice Recognition Features

Developing Android Applications with Voice Recognition Features


Android can’t recognize speech, so a typical Android device cannot recognize speech either.
 
Or, is there a way it can?  The easiest way is to ask another application to do the recognition for us. Asking another application to do something in Android is called using intents.
 
Our target device must have at least one application that can process the Intent for speech recognition, which is called by theRecognizerIntent.ACTION_RECOGNIZE_SPEECH action.
 
One such app is Google Voice Search. It is one of the best recognizers available for Android and supports a lot of languages. This service requires Internet connection because the voice recognition occurs on Google servers. This app has a very simple Activity that informs users they can speak. The moment the user stops talking, the dialog is closed and our application (intent caller) receives an array of strings with the recognized speech.
 
A voice recognition sample
Let’s write a little sample app that demonstrates using voice search in applications.
Our application needs to do these things:
  • Receive a request for voice recognition
  • Check the availability of application for speech recognizing
  • If speech recognizing is available, then call the intent for it and receive the results
  • If speech recognizing is not available, then show the dialog for installing Google Voice Search and redirect the user to Google Play, if he wants
First, we create a class that implements the logic for speech recognition. Call this class SpeechRecognitionHelper where we declare a static, public function run() that will receive a request for launching a recognition:
01/**
02 * A helper class for speech recognition
03 */
04public class SpeechRecognitionHelper {
05 
06/**
07     * Running the recognition process. Checks availability of recognition Activity,
08     * If Activity is absent, send user to Google Play to install Google Voice Search.
09    * If Activity is available, send Intent for running.
10     *
11     * @param callingActivity = Activity, that initializing recognition process
12     */
13    public static void run(Activity callingActivity) {
14        // check if there is recognition Activity
15        if (isSpeechRecognitionActivityPresented(callingActivity) == true) {
16            // if yes – running recognition
17            startRecognition(callingActivity);
18        else {
19            // if no, then showing notification to install Voice Search
20            Toast.makeText(callingActivity, "In order to activate speech recognition you must install "Google Voice Search"", Toast.LENGTH_LONG).show();
21            // start installing process
22            installGoogleVoiceSearch(callingActivity);
23        }
24    }
25}
As you can see, besides the run() function we need to implement three other functions:
  • isSpeechRecognitionActivityPresented – checks if the speech recognition application is present on the system
  • installGoogleVoiceSearch – initializes the Google Voice Search installation process
  • startRecognition – prepares the appropriate Intent and runs the recognition
To check if the device has an application for speech recognition, we can use the queryIntentActivities method in class PackageManager. This method gives a list of activities that can process the specified Intent. To receive an instance of the PackageManager, we can use getPackageManager.
Our code is shown below:
isSpeechRecognitionActivityPresented
01/**
02     * Checks availability of speech recognizing Activity
03     *
04     * @param callerActivity – Activity that called the checking
05     * @return true – if Activity there available, false – if Activity is absent
06     */
07    private static boolean isSpeechRecognitionActivityPresented(Activity callerActivity) {
08        try {
09            // getting an instance of package manager
10            PackageManager pm = callerActivity.getPackageManager();
11            // a list of activities, which can process speech recognition Intent
12            List activities = pm.queryIntentActivities(newIntent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH), 0);
13 
14            if (activities.size() != 0) {    // if list not empty
15                return true;                // then we can recognize the speech
16            }
17        catch (Exception e) {
18 
19        }
20 
21        return false// we have no activities to recognize the speech
22    }
Now implement the startRecognition function. This function will form the appropriate Intent for launching the speech recognition Activity. You can find detailed information for how to do it on documentation page.
Source code:
01   /**
02     * Send an Intent with request on speech
03     * @param callerActivity  - Activity, that initiated a request
04     */
05    private static void startRecognitionActivity(Activity callerActivity) {
06 
07        // creating an Intent with “RecognizerIntent.ACTION_RECOGNIZE_SPEECH” action
08        Intent intent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH);
09 
10        // giving additional parameters:
11        intent.putExtra(RecognizerIntent.EXTRA_PROMPT, "Select an application");    // user hint
12        intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL, RecognizerIntent.LANGUAGE_MODEL_WEB_SEARCH);    // setting recognition model, optimized for short phrases – search queries
13        intent.putExtra(RecognizerIntent.EXTRA_MAX_RESULTS, 1);    // quantity of results we want to receive
14//choosing only 1st -  the most relevant
15 
16        // start Activity ant waiting the result
17        ownerActivity.startActivityForResult(intent, SystemData.VOICE_RECOGNITION_REQUEST_CODE);
18    }
And last, we’ll implement the installGoogleVoiceSearch. This function will show the dialog, asking the user if he wants to install Google Voice Search and send him to Google Play, if he does.
01/**
02     * Asking the permission for installing Google Voice Search.
03     * If permission granted – sent user to Google Play
04     * @param callerActivity – Activity, that initialized installing
05     */
06    private static void installGoogleVoiceSearch(final Activity ownerActivity) {
07 
08        // creating a dialog asking user if he want
09        // to install the Voice Search
10        Dialog dialog = new AlertDialog.Builder(ownerActivity)
11            .setMessage("For recognition it’s necessary to install "Google Voice Search"")    // dialog message
12            .setTitle("Install Voice Search from Google Play?")    // dialog header
13            .setPositiveButton("Install"new DialogInterface.OnClickListener() {    // confirm button
14 
15                // Install Button click handler
16                @Override
17                public void onClick(DialogInterface dialog, int which) {
18                    try {
19                        // creating an Intent for opening applications page in Google Play
20                        // Voice Search package name: com.google.android.voicesearch
21                        Intent intent = new Intent(Intent.ACTION_VIEW, Uri.parse("market://details?id=com.google.android.voicesearch"));
22                        // setting flags to avoid going in application history (Activity call stack)
23                        intent.setFlags(Intent.FLAG_ACTIVITY_NO_HISTORY | Intent.FLAG_ACTIVITY_CLEAR_WHEN_TASK_RESET);
24                        // sending an Intent
25                        ownerActivity.startActivity(intent);
26                     catch (Exception ex) {
27                         // if something going wrong
28                         // doing nothing
29                     }
30                }})
31 
32            .setNegativeButton("Cancel", null)    // cancel button
33            .create();
34 
35        dialog.show();    // showing dialog
36    }
That’s about it. We run the speech recognition Activity. Then request the user’s permission to install Voice Search and send him to Google Play if he consents. One thing we still need to do and that is gather the voice recognition results.
We send a request using the startActivityForResult function to gather results of the launched Activity. We also need to redefine a OnActivityResultmethod in our intent caller Activity. This can be done this way:
01// Activity Results handler
02    @Override
03    public void onActivityResult(int requestCode, int resultCode, Intent data) {
04 
05        // if it’s speech recognition results
06        // and process finished ok
07        if (requestCode == SystemData.VOICE_RECOGNITION_REQUEST_CODE && resultCode == RESULT_OK) {
08 
09            // receiving a result in string array
10            // there can be some strings because sometimes speech recognizing inaccurate
11            // more relevant results in the beginning of the list
12            ArrayList matches = data.getStringArrayListExtra(RecognizerIntent.EXTRA_RESULTS);
13 
14            // in “matches” array we holding a results... let’s show the most relevant
15            if (matches.size() > 0) Toast.makeText(this, matches.get(0), Toast.LENGTH_LONG).show();
16        }
17 
18        super.onActivityResult(requestCode, resultCode, data);
19    }
Now we’re ready
The created class SpeechRecognitionHelper allows us to perform a speech recognition request by calling only one function run().
All that is needed for adding a recognition feature is to add this class in our project and call the run function in needed place. And then implement processing text results by redefining the onActivityResult method for the Activity that initiated the recognition call.
 
For additional information you can look at the Android Developers website. Here, you’ll find good examples showing how to do voice recognition, and importantly, how to get the available language list. You will need this list if you want to recognize a language other than the user’s default locale.
 
For fast integration of voice input in to your app, you can download and use this code for the SpeechRecognitionHelper class. 
 
For more information please visit: http://intel.ly/17Zd51G

No comments:

Post a Comment