<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>mitcho.com &#187; survey</title>
	<atom:link href="http://mitcho.com/blog/tag/survey/feed/" rel="self" type="application/rss+xml" />
	<link>http://mitcho.com</link>
	<description></description>
	<lastBuildDate>Fri, 10 Feb 2012 23:24:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4-alpha-19719</generator>
		<item>
		<title>Ubiquity i18n: questions to ask</title>
		<link>http://mitcho.com/blog/projects/ubiquity-i18n-questions-to-ask/</link>
		<comments>http://mitcho.com/blog/projects/ubiquity-i18n-questions-to-ask/#comments</comments>
		<pubDate>Mon, 23 Mar 2009 10:13:37 +0000</pubDate>
		<dc:creator>mitcho</dc:creator>
				<category><![CDATA[projects]]></category>
		<category><![CDATA[collaboration]]></category>
		<category><![CDATA[commands]]></category>
		<category><![CDATA[contribute]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[linguistics]]></category>
		<category><![CDATA[Mozilla Planet]]></category>
		<category><![CDATA[parser]]></category>
		<category><![CDATA[survey]]></category>
		<category><![CDATA[ubiquity]]></category>

		<guid isPermaLink="false">http://mitcho.com/blog/?p=1611</guid>
		<description><![CDATA[I recently have traveled a fair deal and have met many people excited about the Ubiquity project and its localization efforts. &#8220;I want to help,&#8221; say the people, but many are unsure where to start. As a linguist, studying a language involves looking at instances of that language as data. To this end, we as [...]
Related posts:<ol>
<li><a href='http://mitcho.com/blog/projects/contribute-how-your-language-identifies-its-arguments/' rel='bookmark' title='Contribute: how your language identifies its arguments'>Contribute: how your language identifies its arguments</a></li>
<li><a href='http://mitcho.com/blog/projects/writing-commands-with-semantic-roles/' rel='bookmark' title='Writing commands with semantic roles'>Writing commands with semantic roles</a></li>
<li><a href='http://mitcho.com/blog/projects/ubiquity-in-firefox-japanese/' rel='bookmark' title='Ubiquity in Firefox: Focus on Japanese'>Ubiquity in Firefox: Focus on Japanese</a></li>
</ol>

Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.]]></description>
			<content:encoded><![CDATA[<p>I recently have traveled a fair deal and have met many people excited about <a href="http://ubiquity.mozilla.com">the Ubiquity project</a> and <a href="https://wiki.mozilla.org/Labs/Ubiquity/i18n">its localization efforts</a>. &#8220;I want to help,&#8221; say the people, but many are unsure where to start.</p>

<p>As a linguist, studying a language involves looking at instances of that language as data. To this end, we as Ubiquity internationalizers need to get at some examples of <em>target utterances</em>. Here&#8217;s an example survey which could be a good starting point for native speakers who want to contribute information on their language, based on <a href="https://wiki.mozilla.org/Taskfox/Verbs">Blair&#8217;s list of common Ubiquity verbs</a>.</p>

<p><span id="more-1611"></span></p>

<hr style='border-top: 2px gray dashed; height: 0; color: white;'/>

<h2>A survey for Ubiquity localization</h2>

<h3>Instructions</h3>

<p>How would you express the following commands in your language? The words in CAPITAL LETTERS do not need to be translated. Feel free to give multiple possible answers for each command.</p>

<p>Try to express the same command rather than forcing a &#8220;literal translation&#8221;; for example, if there&#8217;s no &#8220;map&#8221; verb in your language, you could translate example (8) as <code>lookup a map of PLACE</code>. Please keep in mind that the <a href="http://en.wikipedia.org/wiki/addressee">addressee</a> is a computer.</p>

<h3>Basic word order / argument structure</h3>

<ol>
<li><code>search HELLO</code></li>
<li><code>search HELLO with google</code></li>
<li><code>translate HELLO from English to French</code></li>
<li><code>lookup the weather for PLACE</code></li>
<li><code>shop for SHOES with Amazon</code></li>
<li><code>email HELLO to Bill</code></li>
<li><code>email HELLO to ADDRESS</code></li>
<li><code>map PLACE</code></li>
<li><code>find HELLO</code></li>
<li><code>tab to HELLO</code> or <code>switch to HELLO tab</code></li>
</ol>

<p>&#8230;</p>

<h3>Pronominal/deictic arguments (aka &#8220;magic words&#8221;)</h3>

<ol>
<li><code>search this with google</code></li>
<li><code>translate this to French</code></li>
<li><code>bookmark this tab</code></li>
</ol>

<p>&#8230;</p>

<hr style='border-top: 2px gray dashed; height: 0; color: white;'/>

<h3>How this data is used</h3>

<p>Responses to these surveys would be used to identify certain salient features of the language, such as <a href="http://mitcho.com/blog/projects/three-ways-to-argue-over-arguments/">how the language codes for its arguments</a> (for example using <a href="http://en.wikipedia.org/wiki/adpositions">adpositions</a>, <a href="http://en.wikipedia.org/wiki/case marking">case marking</a>, or word order), whether the commands tend to be verb-inital or -final. Individual case markings, for example, can be identified by comparing <em><a href="http://en.wikipedia.org/wiki/minimal pairs">minimal pairs</a></em>—for example, by comparing item (1) and (2), we can learn how <code>google</code> in an instrumental role is marked, or by comparing example (2) and the &#8220;magic word&#8221; example (1), we can identify the appropriate &#8220;magic word&#8221; and determine whether the language uses any <a href="http://en.wikipedia.org/wiki/clitics">clitics</a> or not.</p>

<h3>Data collection</h3>

<p>In the future we ideally could build a web-based system to collect these &#8220;utterances.&#8221; We could also use such a system to automatically test our parsers in different languages against the sentences in the <em>command-bank,</em> or ultimately even generate parser parameters based on those sentences. That would essentially reduce the parser-construction process to a more run-of-the-mill string translation process.</p>
<p>Related posts:</p><ol>
<li><a href='http://mitcho.com/blog/projects/contribute-how-your-language-identifies-its-arguments/' rel='bookmark' title='Contribute: how your language identifies its arguments'>Contribute: how your language identifies its arguments</a></li>
<li><a href='http://mitcho.com/blog/projects/writing-commands-with-semantic-roles/' rel='bookmark' title='Writing commands with semantic roles'>Writing commands with semantic roles</a></li>
<li><a href='http://mitcho.com/blog/projects/ubiquity-in-firefox-japanese/' rel='bookmark' title='Ubiquity in Firefox: Focus on Japanese'>Ubiquity in Firefox: Focus on Japanese</a></li>
</ol>
<p>Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://mitcho.com/blog/projects/ubiquity-i18n-questions-to-ask/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
	</channel>
</rss>

