Tuesday, June 21, 2011

Official Google Blog

Official Google Blog


Google Translate welcomes you to the Indic web

Posted: 21 Jun 2011 09:18 AM PDT



 

Beginning today, you can explore the linguistic diversity of the Indian sub-continent with Google Translate, which now supports five new experimental alpha languages: Bengali, Gujarati, Kannada, Tamil and Telugu. In India and Bangladesh alone, more than 500 million people speak these five languages. Since 2009, we've launched a total of 11 alpha languages, bringing the current number of languages supported by Google Translate to 63.

Indic languages differ from English in many ways, presenting several exciting challenges when developing their respective translation systems. Indian languages often use the Subject Object Verb (SOV) ordering to form sentences, unlike English, which uses Subject Verb Object (SVO) ordering. This difference in sentence structure makes it harder to produce fluent translations; the more words that need to be reordered, the more chance there is to make mistakes when moving them. Tamil, Telugu and Kannada are also highly agglutinative, meaning a single word often includes affixes that represent additional meaning, like tense or number. Fortunately, our research to improve Japanese (an SOV language) translation helped us with the word order challenge, while our work translating languages like German, Turkish and Russian provided insight into the agglutination problem.

You can expect translations for these new alpha languages to be less fluent and include many more untranslated words than some of our more mature languages—like Spanish or Chinese—which have much more of the web content that powers our statistical machine translation approach. Despite these challenges, we release alpha languages when we believe that they help people better access the multilingual web. If you notice incorrect or missing translations for any of our languages, please correct us; we enjoy learning from our mistakes and your feedback helps us graduate new languages from alpha status. If you're a translator, you'll also be able to take advantage of our machine translated output when using the Google Translator Toolkit.

Since these languages each have their own unique scripts, we've enabled a transliterated input method for those of you without Indian language keyboards. For example, if you type in the word "nandri," it will generate the Tamil word நன்றி (see what it means). To see all these beautiful scripts in action, you'll need to install fonts* for each language.

We hope that the launch of these new alpha languages will help you better understand the Indic web and encourage the publication of new content in Indic languages, taking us five alpha steps closer to a web without language barriers.

*Download the fonts for each language: Tamil, Telugu, Bengali, Gujarati and Kannada.

Thousands of “hackers for good” build applications for humanity

Posted: 20 Jun 2011 11:53 AM PDT

(Cross-posted on the Google.org Blog)

Earlier this month, thousands of "hackers for good" gathered in more than 19 different global locations—from Berlin to Nairobi, and Sydney to Sao Paulo—to participate in Random Hacks of Kindness #3. These teams are now off and running, working with NGO and government advisors to finish their applications for humanity.

In partnership with Microsoft, Yahoo!, NASA and the World Bank, we founded RHoK in 2009 to build and support a community creating open source technology for crisis response. At RHoK #3, we expanded the mandate to include climate change, and we also recently announced that we're broadening the scope in the future to tackle any development challenges.

Of the more than 75 solutions submitted for judging at this year's global events, many are already on their way to making a difference around the world. The UN, in partnership with the Colombia government, is considering adopting the shelter management system developed at RHoK Bogota to aid the 3 million victims of winter flooding in South America. Of the nine hacks submitted for judging at RHoK Sao Paulo, two are already in use and two others may be further developed and incorporated into the restructuring of the National Weather Service. The winning application at RHoK Philadelphia, developed in response to a problem proposed by the World Bank Water group, is set for further development at the WaterHackathon, RHoK's first community-sponsored event, later this year.

At the RHoK Silicon Valley event at Google's Mountain View campus, we selected three winners:
  • SMS Person Finder enables anyone with a phone to interact with Person Finder, a software application that Google built to help people connect with their loved ones following a disaster. The Google Crisis Response team is working with this group to integrate their application into future Google Person Finder deployments
  • Hey Cycle makes it easier for people to reuse and recycle items by setting up email alerts when free items that they're looking for are entered on freecycle.org
  • FoodMovr connects people with excess food to others who need it through a simple live application
We're proud to be one of the founding partners and ongoing sponsors of Random Hacks of Kindness and look forward to seeing these application make a difference. Stay tuned for future RHoK events, and follow the progress of the community at RHoK.org.

No comments:

Post a Comment