The breakthroughs in natural language processing and machine translation brought by deep learning might enable us to build a trope of science-fiction books — a universal real-time translator that fits within the human ear. Geoff Hinton, one of the godfathers of deep learning and neural networks, explained how it could be done at the Association for the Advancement of Artificial Intelligence conference held in Austin, Texas, on Wednesday at the tail end of a talk he gave about the history and future of artificial intelligence.
He wasn’t clear in his timeline, although he did say that he only could only anticipate the future about five years out, so perhaps we’re closer than we think to this concept. Here’s how he explained it in his talk for a translation from English to French.
You start with recurrent neural networks, which excel at text analysis and natural language processing. Recurrent neural networks have been responsible for some of the significant improvements in language understanding, including the machine translation that powers Microsoft’s Skype Translate and Google’s word2vec libraries.
Essentially, for each language you have multiple recurrent neural networks that will take your English sentence and parse it word by word. It will then take the entire sentence and move that over to the French recurrent neural network for decoding. There, it will take the concept represented by the sentence and start with the first word to be translated. Once it has translated that, it will match that word against both the statistically probability of the likeliest word that would follow that first word and also against a distribution of the likeliest translation of the second word to come up with a match.
It continues to do this until you get a translation. Hinton explained that the neural networks are trained using random words, and after training the recurrent neural networks for one man-year, which equated to a few students working for about three months, the Hinton recurrent neural network translator matched state-of-the-art databases.
Hinton added that the more languages one adds, the better it makes the neural network, because it helps the computer narrow the probabilities it has to look at. Hinton concluded, “In few years time we will put it on a chip that fits into someone’s ear and have an English-decoding chip that’s just like a real Babel fish.”
For those who aren’t Douglas Adams fans, the Babel fish was an alien fish that the hero of his Hitchhiker’s Guide to the Galaxy books slipped into his ear at the beginning of his journey so he could instantly understand all of the alien languages he encountered.