Lu Tong's Seven Bowls of Tea 七碗詩 ---LuTong (Tang. 790~835)盧仝(唐. 790~835)
Cedict(MDBG.NET) will parse the correct polysyllabic combinations if
they exist in its dictionary. Nciku does so equally. If its not in
one it is in the other. You pick a character in a Chinese string you
dont know if it is mono or polysyllabic. Sites like these know how to
parse. If the Chinese can do it a computer can.
Jim
PS There are lists of STOP words in Chinese that help with sentence
parsing. I will say Google translate is better at this than word
parsing.
On Feb 25, 3:55 pm, Lewis Perin > wrote:
> Space Cowboy > writes:
> >Google translate would be better if it recognized more polysyllabic
> >Chinese characters for words you find in dictionaries like Cedict.
>
> Yeah, but Chinese text doesnt have spaces between words. And a given
> character in a sentence might be the last character in one word or the
> first in another or a middle character in yet another. Teaching a
> computer program to figure out this stuff reliably would not be easy.
>
> /Lew
> ---
> Lew Perin /
|