![]() |
|||
![]() |
|||

|
|
|||||||||
TERAGRAM ADDS HUNGARIAN TO ITS LINGUISTIC SUITETeragram Supports All Major Western and Eastern European Languages, Korean, Japanese, Chinese and Arabic CAMBRIDGE, Mass. - May 2, 2005 - Teragram, the leading provider of multilingual natural language processing technologies, today announced the addition of Hungarian to its Linguistic Suite, bringing the total number of supported global languages to more than 30, including all major Western and Eastern European languages, Korean, Japanese, Chinese and Arabic. Support for additional languages is forthcoming. The strategic addition of Hungarian supports Teragram's morphological stemming, spelling correction, part-of-speech tagging, word and sentence tokenization, automatic categorization and entity extraction software products, which are essential in building multilingual information retrieval and text mining applications. Teragram's Linguistic Suite is primarily targeted at OEM customers who embed the technology to enhance their own products, especially Internet or enterprise search engines, and are looking for comprehensive language support, superior performance and quick time-to-market. Other targeted end users include companies selling information retrieval or content management software. "The admission of Hungary to the European Union last May has been a significant demand catalyst for linguistic technologies that support the Hungarian language," said Dr. Yves Schabes, co-founder and president of Teragram. "Our customers have made it clear that they wish to partner with one, highly-reliable vendor who can support the growing EU family of languages, rather than a patchwork of vendors specializing in individual languages." "Hungarian is a highly complex language and it is vexingly difficult to perform intelligent searches in the language without support from software like Teragram's Linguistic Suite," continues Dr. Schabes. "Our wholly-owned and developed Hungarian dictionary, which boasts more than five million entries, took a team of Teragram software developers and lexicographers over 36 months to build. The comprehensiveness and quality of our solution goes far beyond traditional, paper dictionaries, and includes coverage of tens of thousands of challenging new words, word contexts and meanings used with increasing frequency on the Internet and in large enterprises." According to Dr. Schabes, "Closely related to Finnish, another language Teragram supports, Hungarian is difficult to break down linguistically because it is highly inflectional and agglutinative, where words combine with other words, word parts and cases to form new words." Hungarian's linguistic challenges are easily handled by Teragram's dictionary as it breaks apart and parses meaning from highly agglutinative words. For example, the Hungarian word "mostohagyerekeidhez," meaning "to your step children" is actually composed of many smaller pieces and can be deconstructed into "mostoha + gyerek + e + i + d + hez." In this example, Teragram's software breaks the word down into its basic elements to derive meaning: "mostoha" (meaning 'step' in English), gyerek ('child'), "gyereke" (the possessive marker 'e' turns the meaning to 'child belonging to'), "gyerekei" (the plural marker 'i' turns the meaning to 'children belonging to'), "gyerekeid" (the second person marker 'd' turns the meaning to 'the children belonging to you' i.e. 'your children'), "gyerekeidhez" (the inflectional "hez" turns the meaning to 'to your children'). "The ability of Teragram's powerful linguistic engine to deconstruct words into meaningful parts is critical to improving the precision of information retrieval applications and search accuracy," says Dr. Schabes. "That's what our customers look to us to uniquely provide."
About Teragram |
|
||||||||
Copyright © Teragram Corporation