blog

Ubiquity i18n: questions to ask

I recently have traveled a fair deal and have met many people excited about the Ubiquity project and its localization efforts. “I want to help,” say the people, but many are unsure where to start.

As a linguist, studying a language involves looking at instances of that language as data. To this end, we as Ubiquity internationalizers need to get at some examples of target utterances. Here’s an example survey which could be a good starting point for native speakers who want to contribute information on their language, based on Blair’s list of common Ubiquity verbs.


A survey for Ubiquity localization

Instructions

How would you express the following commands in your language? The words in CAPITAL LETTERS do not need to be translated. Feel free to give multiple possible answers for each command.

Try to express the same command rather than forcing a “literal translation”; for example, if there’s no “map” verb in your language, you could translate example (8) as lookup a map of PLACE. Please keep in mind that the addressee is a computer.

Basic word order / argument structure

  1. search HELLO
  2. search HELLO with google
  3. translate HELLO from English to French
  4. lookup the weather for PLACE
  5. shop for SHOES with Amazon
  6. email HELLO to Bill
  7. email HELLO to ADDRESS
  8. map PLACE
  9. find HELLO
  10. tab to HELLO or switch to HELLO tab

Pronominal/deictic arguments (aka “magic words”)

  1. search this with google
  2. translate this to French
  3. bookmark this tab


How this data is used

Responses to these surveys would be used to identify certain salient features of the language, such as how the language codes for its arguments (for example using adpositions, case marking, or word order), whether the commands tend to be verb-inital or -final. Individual case markings, for example, can be identified by comparing minimal pairs—for example, by comparing item (1) and (2), we can learn how google in an instrumental role is marked, or by comparing example (2) and the “magic word” example (1), we can identify the appropriate “magic word” and determine whether the language uses any clitics or not.

Data collection

In the future we ideally could build a web-based system to collect these “utterances.” We could also use such a system to automatically test our parsers in different languages against the sentences in the command-bank, or ultimately even generate parser parameters based on those sentences. That would essentially reduce the parser-construction process to a more run-of-the-mill string translation process.

Related posts:

  1. Contribute: how your language identifies its arguments
  2. Exploring Command Chaining in Ubiquity: Part 1
  3. The Aliens Aliases Have Landed
  4. Where’s The Verb?
  5. Automating the Linguist’s Job

Related posts brought to you by Yet Another Related Posts Plugin.

Tags: , , , , , , , ,

If you enjoyed this post, make sure you subscribe to my RSS feed (optionally with tweets from my Twitter)!

21 Responses to “Ubiquity i18n: questions to ask”

  1. Jan! Says:

    For Dutch (nl):

    Basic word order / argument structure 1. zoek HELLO 1. zoek naar HELLO 2. zoek HELLO met Google 2. zoek naar HELLO met Google 3. vertaal HELLO van Engels naar Frans 3. vertaal HELLO van het Engels naar het Frans 4. toon het weer voor PLACE 5. koop SHOES bij Amazon 6. e-mail HELLO naar Bill 6. email HELLO naar Bill 6. mail HELLO naar Bill 7. e-mail HELLO naar ADDRESS 7. email HELLO naar ADDRESS 7. mail HELLO naar ADDRESS 8. toon kaart van PLACE 8. toon kaart voor PLACE 8. toon map van PLACE 8. toon map voor PLACE 9. zoek HELLO 9. vind HELLO 10. tab naar HELLO 10. verander naar tab HELLO 10. verander naar HELLO tab

    Prepositional arguments 1. zoek dit met Google 2. vertaal dit naar het Frans 2. vertaal dit naar Frans 3. bookmark deze tab 3. plaats een bladwijzer op deze tab 3. plaats een bladwijzer voor deze tab

  2. Alberto Santini Says:

    Italian version.

    1. search HELLO
    • cerca HELLO
    • trova HELLO

      1. search HELLO with google
    • cerca HELLO con google

    • trova HELLO con google

      1. translate HELLO from English to French
    • traduci HELLO dall'inglese al francese

    • traduci HELLO da inglese a francese

      1. lookup the weather for PLACE
    • guarda il tempo per PLACE

    • dammi le previsioni per PLACE
    • dimmi le previsioni per PLACE

      1. shop for SHOES with Amazon
    • compra le scarpe con Amazon

    • compra le scarpe attraverso Amazon

      1. email HELLO to Bill
    • invia HELLO a Bill

    • spedisci HELLO a Bill

      1. email HELLO to ADDRESS
    • invia HELLO a ADDRESS

    • spedisci HELLO a ADDRESS

      1. map PLACE
    • trova PLACE

    • indica PLACE

      1. find HELLO
    • definisci HELLO

    • ricerca HELLO

      1. tab to HELLO or switch to HELLO tab
    • cambia a HELLO tab

    • vai a HELLO tab

      1. search this with google
    • cerca this con google

      1. translate this to French
    • traduci this in francese

      1. bookmark this tab
    • segna this tab

  3. NGUYỄN-Mạnh Hùng Says:

    In Vietnamese:

    Basic: 1. tìm HELLO 2. tìm HELLO với google 3. dịch HELLO từ tiếng Anh sang tiếng Pháp 4. tra cứu thời tiết cho PLACE 5. mua sắm SHOES với Amazon 6. email HELLO tới Bill 7. email HELLO tới ADDRESS 8. bản đồ PLACE 9. tìm HELLO 10. thẻ sang HELLO or chuyển sang thẻ HELLO

    Prepositional: 1. tìm cụm này với google 2. dịch cụm này sang tiếng Pháp 3. đánh dấu thẻ này

  4. philiKON Says:

    One thing that still trips me up with Ubiquity is its verb-object philosophy. More than often I tend to think about something, type it in and then realize I forgot to tell Ubiquity what to do with it. So now I have to go to the beginning of the line and add a command. In these instances, I wish Ubiquity was object-verb. In fact, I believe Jef Raskin recommends this philosophy in his book :) . I do realize that Ubiquity can be object-verb, viz. when you select a text in the web page and then enter a command.

    What this has to do with i18n: In German for instance, one could translate "search HELLO" as an imperative ("suche HELLO") or as an infinitive ("HELLO suchen"). The latter seems more natural to me because it'd be the answer to "What do you want to do?" — "HELLO suchen" or "HELLO an Bill mailen". German (and other Germanic languages) might be unique in this respect, though as I've tried to point out above, I don't think my preference for object-verb isn't just language-related. It happens to match my train of thought much better.

  5. Survey for Ubiquity localization @ Yet Another Tech Blog Says:

    […] can we localize this set of commands in Italian (see Mitcho’s post)? 1. search HELLO 2. search HELLO with google 3. translate HELLO from English to French 4. lookup […]

  6. Robert Kaiser Says:

    German computer commands (like in menus, etc.) are usually put in passive voice, but some people might want to enter them in active voice in a command-line interface, so I'll list passive first, then active.

    Basic word order / argument structure

    1. search HELLO

    HELLO suchen

    Suche HELLO

    1. search HELLO with google

    HELLO mit Google suchen

    Suche HELLO mit Google

    1. translate HELLO from English to French

    HELLO von Englisch auf Französisch übersetzen

    Übersetze HELLO von Englisch auf Französisch

    1. lookup the weather for PLACE

    Wetter in PLACE anzeigen

    Zeige Wetter in PLACE an

    ("anzeigen" is "show", could use lots of different verbs here, intuitively I probably would try it even without a verb and enter just "Wetter in PLACE")

    1. shop for SHOES with Amazon

    SHOES bei Amazon kaufen

    Kaufe SHOES bei Amazon

    1. email HELLO to Bill

    HELLO per E-Mail an Bill senden

    Sende HELLO per E-Mail an Bill

    (those are "send via email", the "to email" Variant is a Germanized English word and not that popular, but would be as follows:)

    HELLO an Bill mailen

    Maile HELLO an Bill

    1. email HELLO to ADDRESS

    HELLO per E-Mail an ADDRESS senden

    Sende HELLO per E-Mail an ADDRESS

    (same as above)

    1. map PLACE

    Karte von PLACE anzeigen

    Zeige Karte von PLACE an

    (as with weather, I've used "show map" here but intuitively I'd also probably leave out the verb completely and go for "Karte von PLACE")

    1. find HELLO

    HELLO suchen

    Suche HELLO

    (note that we usually use the same word for search and find, though "finden" would exist)

    1. tab to HELLO or switch to HELLO tab

    Auf HELLO-Tab umschalten /or/ Auf Tab "HELLO" umschalten

    Schalte auf HELLO-Tab um /or/ Schalte auf Tab "HELLO" um

    Prepositional arguments (aka “magic words”)

    1. search this with google

    Dies mit Google suchen

    Suche dies mit Google

    (instead of "Dies"/"dies" a probably equal number of people would use "Das"/"das")

    1. translate this to French

    Dies auf Französisch übersetzen

    Übersetze dies auf Französisch

    (same as above)

    1. bookmark this tab

    Dieses Tab als Lesezeichen ablegen

    Lege dieses Tab als Lesezeichen ab

    (uses "file as bookmark" as there is no "to bookmark" word in German, there are also discussions if "Tab" has male or neutral gender, I used neutral, if male is used, it's "Diesen Tab"/"diesen Tab")

    note also that in some cases, multiple different prepositions should be accepted, as some people would probably enter "Wetter für PLACE" or even "Wetter auf PLACE" if it's an island, similar things for maps, and the services to shop and search with.

  7. Automating the Linguist’s Job Says:

    […] blog blog « Ubiquity i18n: questions to ask […]

  8. mitcho Says:

    @philiKON - thanks for the reply. We're definitely looking at supporting verb-final languages (including optionally verb-final ones, like German) and also applying these innovations to other languages as well, in case the verb is not entered first. I have some initial thoughts on verb-final languages and their advantages here on an article about Japanese. This design has been incorporated into the latest parser design proposal as well, and you can play around with a demo of the upcoming parser here.

  9. Steve Yu Says:

    In Korean.

    1. search HELLO HELLO 검색하기

    2. search HELLO with google HELLO를 구글로(에서) 검색하기 구글로(에서) HELLO 검색하기

    3. translate HELLO from English to French HELLO를 영어에서 프랑스어로 번역하기

    4. lookup the weather for PLACE PLACE 날씨 알아보기

    5. shop for SHOES with Amazon Amazon에서 SHOES를 구매하기

    6. email HELLO to Bill Bill에게 HELLO 이메일을 보내기

    7. email HELLO to ADDRESS HELLO 이메일을 ADDRESS로 보내기

    8. map PLACE 지도에서 PLACE 위치 찾기 지도에 PLACE 위치 출력하기

    9. find HELLO HELLO를 찾기

      1. tab to HELLO or switch to HELLO tab HELLO 탭으로 변경하기
  10. flod Says:

    (explanation here)

    List of commands in Italian.

    1. cerca HELLO
    2. cerca HELLO su google
    3. traduci HELLO da inglese a francese
    4. controlla meteo di PLACE
    5. compra SHOES su Amazon
    6. invia HELLO per e-mail a Bill
    7. invia HELLO per e-mail a ADDRESS
    8. cerca mappa di PLACE
    9. trova HELLO
    10. passa alla scheda HELLO
  11. Ville Says:

    Finnish here:

    1. etsi HELLO
    2. etsi HELLO googlesta
    3. käännä HELLO englannista ranskaksi
    4. tarkista sää PLACE
    5. osta SHOES Amazonista
    6. lähetä sähköpostissa HELLO Billille (send in an email X to Bill)
    7. lähetä sähköpostissa HELLO (osoitteeseen) ADDRESS (without the word in parenthesis "to address" the command's pretty odd)
    8. näytä kartalla PLACE (show on map)
    9. etsi HELLO
      1. siirry HELLO välilehteen (the other version doesn't make sense to me.. even in english :) )

    Pronominal/deictic arguments (aka “magic words”)

    1. etsi tätä googlella
    2. käännä tämä ranskaksi
    3. lisää tämä välilehti kirjanmerkkeihin
  12. Ricardo Panaggio Says:

    Translations to Portuguese:

    1. search HELLO

    buscar HELLO

    1. search HELLO with google

    buscar HELLO com Google

    1. translate HELLO from English to French

    traduzir HELL do Inglês para Francês

    1. lookup the weather for PLACE

    Procurar a previsão do tempo para PLACE

    1. shop for SHOES with Amazon

    comprar SHOES com Amazon

    1. email HELLO to Bill

    enviar email* HELLO para Bill

    1. email HELLO to ADDRESS

    enviar email* HELL para ADDRESS

    1. map PLACE

    mapear PLACE

    1. find HELLO

    encontrar HELLO

    1. tab to HELLO or switch to HELLO tab

    guiar/mudar* para ou trocar para aba HELLO

    1. search this with google

    procurar isto/isso** com google

    1. translate this to French

    traduzir isto/isso** para Francês

    1. bookmark this tab

    favoritar* esta/essa** aba

    • There doesn't exist a good word that translates these words to Portuguese. We generally use them in English. ** Some words are commonly misused in Portuguese (at least in Brazil, where I live). And there should be some misinterpretation of what to use, since isto/este/esta are used with those things that are closer to the speaker and isso/esse/essa are used with things closer to the listener (I don't know if "this" on those phrases are closer to the user or to the machine). But there's another grammatical rule that says that when we're referencing things that were said or that occurred previously we use esse/essa/isso. I do think this last rule is the most applicable, but I'm not completely sure of it.
  13. Where’s The Verb? Says:

    […] I received a great comment by PhiliKON on German and associated data by Robert Kaiser on my blog post […]

  14. Benoit Says:

    Here it is for French. Like I said in your other post, mostly infinitive forms:

    Basic word order / argument structure

    1. search HELLO

    chercher HELLO /
    rechercher HELLO

    1. search HELLO with google

    (re)chercher HELLO avec/dans google (infinitive form) maybe: recherche google de HELLO (no verb!)

    1. translate HELLO from English to French

    traduire HELLO de l'anglais vers le français maybe: traduire HELLO d'anglais en français (not really correct but shorter and still understandable)

    1. lookup the weather for PLACE

    trouver la météo pour PLACE maybe: météo à PLACE (no verb!)

    1. shop for SHOES with Amazon

    acheter des SHOES sur Amazon

    1. email HELLO to Bill

    There is no verb to say "send an email". This could work but there would be an ambiguity with IM/SMS: écrire HELLO à Bill ( =write to) / envoyer HELLO à Bill ( =send to)

    Longer but not ambiguous:
    envoyer HELLO par email/courriel à Bill

    1. email HELLO to ADDRESS

    Probably less ambiguous due to "ADDRESS" envoyer HELLO à ADDRESS / écrire HELLO à ADDRESS

    1. map PLACE

    trouver la carte/le plan de PLACE / carte/plan de PLACE (no verb)

    1. find HELLO

    trouver HELLO / rechercher HELLO dans la page (same as "search HELLO")

    1. tab to HELLO or switch to HELLO tab

    passer à l'onglet HELLO

    … Pronominal/deictic arguments (aka “magic words”)

    [1 and 2 don't feel natural but would work, 3 is good because "this" is qualified: you kow it is a tab.]

    1. search this with google

    (re)chercher ceci/ça avec/dans google

    Would work better with:
    - (re)chercher cette phrase avec/dans google (search for this sentence…) - (re)chercher ce mot avec/dans google (search for this word…)

    1. translate this to French

    Traduire ceci/ça en français

    Would work better with:
    - traduire cette phrase en français (translate this sentence…) - traduire ce mot en français (translate this word…)

    1. bookmark this tab

    Marquer cet onglet also: Ajouter cet onglet aux marque-pages (for people not familiar with "mark" as a verb to add a bookmark)

  15. mitcho Says:

    Alberto, is there an Italian translation for "this" in the last three examples?

  16. Alberto Santini Says:

    I didn't translate "this" because it is replaced with the selection. Literally I would translate it with "questo".

    For instance, cerca questo con Google

  17. Alejandro Moreno Says:

    SPANISH.

    I toyed with translating the verbs in the imperative form, but infitive is much cleaner ("make a map" in imperative could be "haz un mapa" or "hazme un mapa", literally, "make a map for me", but quite common in speech). So here goes:

    1. buscar HELLO (identical to #9; assuming this is "search on web with default engine", then it could turn to "buscar HELLO en la web")
    2. buscar HELLO con google
    3. traducir HELLO de Inglés a Francés
    4. buscar [el] clima de PLACE
    5. comprar SHOES con Amazon (lit. "buy SHOES with Amazon", there's no "shop" verb; could also simply be "search SHOES with <some e-store>")
    6. mandar correo con HELLO a Bill
    7. mandar correo con HELLO a ADDRESS
    8. mostrar mapa de PLACE
    9. buscar HELLO (identical to #1; if this is "find on page", then it could turn to "buscar HELLO en ésta página")
    10. pestaña HELLO / ir a pestaña HELLO

    11. buscar esto con google

    12. traducir esto a Francés
    13. agregar esta pestaña a marcadores (lit. add this tab to bookmarks) OR crear un marcador para esta pestaña (lit. create a bookmark for this tab) OR marcar esta pestaña (mark this tab)

    Unfotunately, not one of my suggestions for magic words #3 seems better than the other. At least to me. Other spanish speakers, opine! :)

  18. Zhang Yi Jiang Says:

    Translating these commands into Simplified Chinese is quite an exercise in itself, and highlighted the differences between the two language. I'm can't claim to be a linguist, but I am bilingual, so although these may not be perfect they should be natural, at least.

    1.(找,搜索,寻找) —- / 寻找有关 —- 的讯息 (找,搜索,寻找) = Search/Find

    2.用(谷歌/百度)搜索 —- 百度 (Baidu) is the most popular search engine in mainland China 3.将 —- 从英文翻译成法文 4.找 —- [地区]的天气

    5.在卓越网上找 —- 买 This one is very awkward - Direct trans. : "On Amazon.com find —- to buy"

    Maybe this would be better: 在卓越网上买 —- But it could be misunderstood: "shop" in English in this case means to look up at a e-store, while 买 simply means buy. Therefore, the sentence above could be read as "On Amazon.com buy (things)"

    6.电邮 —- 给比尔,寄 —- 给比尔,将 —- 发给比尔 Lit. Trans. : "Email (this) to Bill" / "Send (this) to Bill"
    7.电邮 —- 给 —-,寄 —- 给 —-,将 —- 发给 —-

    8.找 —- 的地图 Lit. Trans. : "Search for/Find (place)'s map"

    9.(找,搜索,寻找)—- 10.换到 —- 的分页 Lit Trans. : "Switch to (tab name)'s tab"

    1.用(谷歌)搜索(这个) 2.将(这个)翻译成法文 3.将(此页)加入书签 此页=This page

    Some of these would literally not be directly translatable to English, or sound extremely awkward in English because some of the words have no English counterparts. Also, certain word would vary a lot based on context, words like "this" in commands, would have these variations in Chinese:

    这个 - This, almost meaningless even with context 这[段/句][话/字] - This sentence/paragraph 这个字 - This word 这个[object]

    And so on… Some of these are more natural than others in many contexts.

  19. Sergio_Majluf Says:

    In spanish we are somewhat used to adopt international words, and make them our own. So, in a very informal scenario, one could translate "email this to John" as "mailear esto a John", as the ending "ar" (as well as "ir" and "er") most of the times denotates the word is a verb. A more formal situation would be to translate the action, rather than the protocol, so one could type "escribir esto a John", wich in turns, tranlsates to "Write this to John"

  20. Traveller Says:

    It really depends whether you want correct sentences or something that passes for correct sentences in "daily computer life". Some examples:

    1. In German, a gramatically correct use of translate (as in, not making you fail a German exam most horribly, unlike the examples above) would be

    HELLO aus dem Englischen ins Französische übersetzen

    or

    Übersetze HELLO aus dem Englischen ins Französische

    (( or maybe ))

    HELLO vom Englischen ins Französische übersetzen

    Übersetze HELLO vom Englischen ins Französische

    However, computer users (especially in countries where translated software is common) are used to horrible mistreatment of their language, and do not even notice. One of the reasons is that, while a lot of software and localization libraries are written in english, most european languages are more complicated than english. Allow me to demonstrate:

    Übersetze HELLO vom Französischen ins Englische

    Note how the language names changed this way around, and remember the discussion about romanian in one of your last posts. The from-language is in the dative case, while the to-language is in the accusative case. The english-based localisation templates are not case-aware, so both language names would come from the same list, which is generally consists of unmodified (nominative) language names, so it would end up being "vom Französisch ins Englisch" which is relatively jarring. The end result might look like the examples from Robert Kaiser above.

    Another example that demonstrates "computerized natural language", this time in english:

    If you think about it, search and find are not actually synonymous in english. You do not search Hello, you search (the Web) for hello, unless of course you search Hello for Ls, or search your apartement for beer. A similar thing applies to "map": "to map London" does not actually mean "to show a map of London" but instead "to explore London and draw a map of it".

    So in effect, both commands in ubiquity are not actually natural language commands, but rather a short-hand that stands for something else.

    Therefore if you actually want to translate

    search for HELLO

    into german it would be

    nach HELLO suchen

    or

    suche nach HELLO

    BUT since "suche" is actually "seek" or "find" rather then search, you also can say "suche HELLO [auf http://example.com". Unless I fell victim to the same computerized-language-blindness as my predecessor. To misquote Bones — "I am a programmer, not a linguist!".

  21. Research papers Says:

    I must say this article "Exploring Command Chaining in Ubiquity: Part 1" helped me a week ago with writing my research paper. Thanks for a link.


© 2006-2010 mitcho (Michael 芳貴 Erlewine).
Proudly powered by WordPress.
Entries (RSS) and Comments (RSS).
The views expressed on these pages are mine alone and do not
reflect those of my employers and clients, past and present.