Attachment Ambiguity—or—when is the gyudon cheap?

Every day on the way to work I walk by a fine establishment known as Yoshinoya (吉野家), Japan’s largest gyudon (牛丼) chain restaurant. For those of you whose lives have yet to be graced by gyudon, it’s a bowl of rice topped with beef and onions stewed in a sweet-savory soy-based sauce. Loving gyudon and being a cheapskate, I naturally noticed the recent 50 yen off gyudon promotion at Yoshinoya. The above photo is a photo of part of that sign.
Part of this sign, though, made me think about our new Ubiquity parser. In particular, it was the attachment ambiguity in the end date of the promotion. The text in the photo above literally is “April 15th (Wed.) 8PM until”. (Note that Japanese is a strongly head-final language, and that the “until” is a postposition.) There are two possible readings for this expression, as illustrated by the two composition trees below.

The first tree, on the left, represents the reading “until (April 15th 8PM)”, while the second represents two arguments: “on April 15th” and “until 8PM”. In other words, in the first reading, the promotion begins at some earlier date and extends until April 15th at 8PM while, in the second reading, the promotion is one day only, on April 15th, until 8pm. Such syntactic ambiguities are called “attachment ambiguities” in linguistics as it is an ambiguity of where different arguments “attach” in a tree representation.
This attachment ambiguity was possible because there was no clear marker on “April 15th,” which may have disambiguated it as “on April 15th”. In fact, in many languages this time position argument comes with no case marker or preposition, or it’s optional, making parsing for them difficult. If such a sentence is entered with spaces, the Ubiquity Parser: The Next Generation would try a parse where “8PM” is the “until” or goal argument and “April 15th” is an object argument, but it will only check its noun type, not put it in the correct semantic role (position). Perhaps this is something to think about in the future.
These types of situations will surely come up as we continue work on the Ubiquity parser, making it essential to look at different languages. Are there certain kinds of arguments in your language that do not have any word-external markers such as case or prepositions/postpositions?
Related posts:
- User-Aided Disambiguation: a demo
- Talking Ubiquity in Japan: 拡張機能勉強会にて発表
- Solving a Romantic Problem: Portmanteau’ed Prepositions
- Inside the Argument
- Ubiquity in Firefox: Focus on Japanese
Related posts brought to you by Yet Another Related Posts Plugin.
Tags: arguments, attachment ambiguity, food, Japanese culture, Japanese language, linguistics, Mozilla Planet, parser, syntax, Tokyo, ubiquity
If you enjoyed this post, make sure you subscribe to my RSS feed (optionally with tweets from my Twitter)!
April 15th, 2009 at 7:51 am
[…] cheap? VA:F [1.1.8_518]Rating: 0.0/5 (0 votes cast) This article was found on Planet Mozilla. Click here to visit the full article on the original website.Every day on the way to work I walk by a fine establishment known as Yoshinoya (吉野家), […]
April 15th, 2009 at 12:20 pm
普通は、日本語の文法では、左ツリーの「{4月15(水) 夜8時} まで」という解釈をします。 特別な表現でない限り、一まとまりの名詞として扱います。
まともな広告なら「4月2日朝10時から」と入れて期間を明示すべきところですが、これだと右ツリーの解釈をされて吉野家が損するだけ :-p
April 30th, 2009 at 3:09 pm
そうですか?私は最初、narrow scopeの解釈をしましたけど。「きのう3時まで飲んだ」が「普通」のみっぱなしだったとは解釈し辛いと思いますが?
Incidentally I bumped into this blog when I was thinking about why the negation of 'until' is ambiguous. (I didn't work until yesterday can mean you worked or did not work yesterday). I always naively thought it would be simply due to scope, but I am beginning to think i may be wrong, as the 'work' bit is not usually negated. Let me know if you know of some literature on this….
余談ですが「まで」を否定するとなぜ曖昧さが出るのかを考えていてこのサイトにぶちあたりました。「昨日まで働いた」はunambiguousなのに何で「昨日まで働かなかった」は昨日働いたかについて曖昧なんでしょう。スコープだとばっかり思ってましたが、働いたことは確かな以上、そうでもないような。ご存知でしたらお教えください。
May 1st, 2009 at 2:37 am
英語で応えるべきか日本語で応えるべきか… yosatoさん、どうしましょう。取りあえず英語で…
In the case of "I didn't work until yesterday," the syntax gives you two possible readings but the semantics of "until" makes one more preferred than another. You'll want to start by checking out Lauri Karttunen's classic paper "Until" from CLS 10 which I found a link for:
http://www2.parc.com/istl/members/karttune/public...
August 10th, 2009 at 7:39 am
Looks quite Yummy …..