Rumor is rampant that Google is feverishly working on releasing its answer to Siri in the next couple of months — if not indeed in the next couple of weeks! The project, if not the product itself, is called Majel (as in Mabel), named after Majel Barrett-Roddenberry, the name of the actress behind the voice of the Federation Computer from Star Trek.
This is exciting news, especially for a voice assistant enthusiast like myself. Indeed, having several assistants (let’s not forget Microsoft) get into a Rockem-Sockem kind of a match will only strengthen everyone involved, and can only spell good news for the user base. But a few remarks are in order.
First, let’s all pause to remember — before it is erased from our collective memory — that Google’s Andy Rubin, the man in charge of Android’s development, famously said that smartphones should not be perceived as assistants, but rather simply as tools for communicating with OTHER PEOPLE (my loud caps):
Your phone is a tool for communicating. You shouldn’t be communicating with the phone; you should be communicating with somebody on the other side of the phone.
Clearly, the man was totally clueless and didn’t know what he was talking about, and Siri completely blindsided him and Google. And now, he and Google are eating their words and feverishly working on developing nothing less than an assistant that will help Android users to talk to their phones!
However, according Matias Duarte, Senior Director, Android User Experience at Google, Google is pursuing a different tack from Apple:
Out approach is different. The metaphor I like to take is – if it’s Star Wars, you have these robot personalities like C-3PO who runs around and he tries to do stuff for you, messes up and makes jokes, he’s kind of a comic relief guy. Our approach is more like Star Trek, right, starship Enterprise; every piece of computing surface, everything is voice-aware. It’s not that there’s a personality, it doesn’t have a name, it’s just “Computer.” And you can talk to it and you can touch it, you can interact with it at the same time as you talk with it. It’s just another way to interface with the computer.
Interesting, but probably telling in how Google, even as it dashes to give birth to Siri’s rival, still doesn’t fully get it: You cannot take the human out of human language. People cannot speak to a machine and listen to a machine without injecting into the interaction all of the rules of conversation that they have learned — because the only context within which they have learned those rules is the context of talking to another human being. Apple’s breakthrough move was to give Siri a voice, so that you can hold a conversation with your assistant. Google still thinks that it’s about the discrete actions and tellingly, thinks that voice interaction “is just another way to interface with the computer.” It really is not. It’s a special interaction because it borrows a medium that is at the core of what makes us human. All this time, with typing, and clicking, and swiping, and tapping, we have been using a language that we had to learn from scratch and that was dictated to us by the machine and the people who built those machines. In the case of human, natural language, it is radically the other way around: the computer is learning our language and how we use that language, and will have to adapt to it or fail.
Two interesting things to watch for as Majel emerges are: (1) What type of Natural Language will Majel use? and (2) What types of API enhancements will they provide to the developer base? On the first, the recent purchase by Google of Clever Sense, the company behind Alfred, a personalized restaurant and bars recommendations app, is a strong indication that Google is retracing Apple’s exact steps, except two years later. The engine behind Alfred will probably be broadened to provide assistance on several fronts, just as Siri’s was broadened to handle Calendar, SMS, etc. On the second — the APIs — I expect that Google will have a leg up over Apple, since Google already has published hooks to the Speech engine and the Text to Speech, and will only have to figure out how to expose its Natural Language processing to empower developers. If it does, it would be Apple’s turn to play catch up….
Last thing — the obvious: this is not the first time that Google will be playing catch up. It played catch up by releasing its Android phone a couple years after Apple released the iPhone; it is trying to play catch up to Facebook with its Google+ initiative; and now it is playing catch up with Majel. Indeed, one could say that Google is now the new Microsoft: a company so blinded by its business model that it can’t see beyond the next step, and so devoid of the capacity to generate its own strategic thinking that its “strategy” is now reduced to sitting back and letting the innovators innovate and then, thanks to the deep pockets, step in and start building stuff. Which, let us all note, is something that Microsoft has gotten real good at doing very smoothly and without a hint of embarrassment. Just watch how Microsoft will soon come out and reveal the name of its assistant, and not feel one bit sheepish about doing so.