blog

Posts Tagged ‘Chinese’

The 北京话儿 Beijing Pirate T-shirt

Saturday, June 14th, 2008

Speaking of t-shirts, I’d been toying with a t-shirt idea for the past year or two: a Beijing Pirate t-shirt. Let me explain…

A distinctive feature of Beijing dialect of Mandarin (and, indeed, most northern Chinese dialects) is the very frequent rhoticization (adding to or replacing the end of a word with “arr”) whose function is often glossed as a diminutive suffix. This phenomenon is called 儿化 (érhùa) in Chinese. Here are some examples, blatantly stolen from Wikipedia:

  • 公园(gōngyuán)(public garden) → 公园儿(gōngyuánr), pronounced “gōngyuár”
  • 小孩(xiǎohái) (small child) → 小孩儿(xiǎoháir), pronounced “xǐaohár”
  • 事 (shì) (thing) → 事儿(shìr), pronounced “shèr”

The result of this variation is that it makes you sound like a pirate… and thus my t-shirt idea was born:

Beijing Pirate shirt

(more…)

Testing Google’s Language Detection

Saturday, May 17th, 2008

google code

As Google adds ten more languages to its machine translation service, it seems to be on its way to becoming the most convenient universal translator of the world’s popular languages. Google’s handling of languages of course isn’t perfect, however—in particular, I’ve been complaining to friends for a while about the weaknesses of Google’s handling of queries in Chinese character (漢字/汉字) scripts. In this post, I run some tests using Google’s Language Detection service to try to better understand its handling of Chinese character queries.

Background

Chinese characters have been used all across East Asia, most notably in Chinese, Japanese, Korean, and Vietnamese (the “CJKV”). Prescriptivist writing reforms in Communist China and Japan have simplified many characters, though. Some characters were simplified in the same way, some in different ways, and some in only one country but not the other. For more information, there’s Wikipedia or Ken Lunde’s CJKV Information Processing.

The problem

The issue comes up when you try to search for a word in Chinese characters which clearly came from one Chinese character-using language. From my experience, Google doesn’t consider which language you are a user of, based on the query, and returns many results in other Chinese character-using languages as well.[^1]

(more…)

Taipei find: a dictionary of Chinese-Japanese false cognates

Saturday, March 22nd, 2008

The fact that Japanese and Chinese both share the use of Chinese characters. The connection goes beyond simply sharing characters, though: many two- and four-character expressions in Japanese come from older Chinese (these are known as Sino-Japanese items in the biz). This is how I can often “cheat” and use my knowledge of Japanese to guess what some Chinese words are saying, even if I have no idea how to pronounce them.

There are, however, many Chinese-Japanese false cognates—words which look the same and often do indeed have a shared etymology, but have quite different contemporary meanings.1 As such, I’ve often lamented to friends, especially learners of Japanese or Chinese, the lack of a dictionary highlighting these false cognates and how their usage differs between the Japanese and Chinese. A couple weekends ago I was browsing dictionaries in the Page One bookstore in Taipei 101 and I found exactly that: 誤用度100%日語漢子.

Each spread shows the three sets of cognates, with an explanation of the Japanese use, in Chinese, on the left, and vice versa on the right. It’s a godsend.

By the way, here’s my favorite Chinese-Japanese false cognate:

勉強 (べんきょう)

one’s study (N), to study (V) ~する

勉強 (miǎnqiǎng)

  • V
    1. force sb. to do sth. | ¹Bié ∼ tā. Don’t force him to do it.
    2. do with difficulty
  • S.V.
    1. unconvincing; strained | Zhège jìhuà ¹hěn ∼. This plan may not work.
  • Adv
    1. reluctantly; grudgingly | Tā ∼ xiàole yīxià. He forced a smile.
    2. barely enough | Tā ∼ néng shuō jǐ jù Fǎyǔ. She can speak only a little French.

  1. In French, they’re “faux amis,” but I think that sounds more like a spy. 


© 2006-2008 mitcho (Michael 芳貴 Erlewine).
Proudly powered by WordPress.
Entries (RSS) and Comments (RSS).
The views expressed on these pages are mine alone and do not
reflect those of my employers and clients, past and present.