<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>mitcho.com &#187; Italian</title>
	<atom:link href="http://mitcho.com/blog/tag/italian/feed/" rel="self" type="application/rss+xml" />
	<link>http://mitcho.com</link>
	<description></description>
	<lastBuildDate>Fri, 10 Feb 2012 23:24:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4-alpha-19719</generator>
		<item>
		<title>Ubiquity in Italian!</title>
		<link>http://mitcho.com/blog/projects/ubiquity-in-italian-2/</link>
		<comments>http://mitcho.com/blog/projects/ubiquity-in-italian-2/#comments</comments>
		<pubDate>Mon, 18 May 2009 04:32:25 +0000</pubDate>
		<dc:creator>mitcho</dc:creator>
				<category><![CDATA[projects]]></category>
		<category><![CDATA[code]]></category>
		<category><![CDATA[Italian]]></category>
		<category><![CDATA[Mozilla Planet]]></category>
		<category><![CDATA[parser]]></category>
		<category><![CDATA[ubiquity]]></category>

		<guid isPermaLink="false">http://mitcho.com/blog/?p=2075</guid>
		<description><![CDATA[Thanks to the great work of Sandro Della Giustina, we now have a preliminary Italian parser for use with Ubiquity Parser 2. Sandro brought up a good point, however, about Italian prepositions which contract with the article and the head noun. For example, traduci dall'inglese al cinese translate from=the=English to=the Chinese One current solution is [...]
Related posts:<ol>
<li><a href='http://mitcho.com/blog/link/ubiquity-in-italian/' rel='bookmark' title='Ubiquity in Italian'>Ubiquity in Italian</a></li>
<li><a href='http://mitcho.com/blog/how-to/adding-your-language-to-ubiquity-parser-2/' rel='bookmark' title='Adding Your Language to Ubiquity Parser 2'>Adding Your Language to Ubiquity Parser 2</a></li>
<li><a href='http://mitcho.com/blog/projects/solving-a-romantic-problem-portmanteaued-prepositions/' rel='bookmark' title='Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions'>Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions</a></li>
</ol>

Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.]]></description>
			<content:encoded><![CDATA[<p>Thanks to the great work of Sandro Della Giustina, we now have a preliminary Italian parser for use with Ubiquity Parser 2. Sandro brought up a good point, however, about Italian prepositions which contract with the article <em>and</em> the head noun. For example,</p>


<div class="wp_syntax"><div class="code"><pre class="it" style="font-family:monospace;">traduci   dall'inglese     al     cinese
translate from=the=English to=the Chinese</pre></div></div>


<p>One current solution is to add <a href="http://en.wikipedia.org/wiki/zero-width space">zero-width spaces</a> after these contracted articles, <em>all&#8217;</em> and <em>dall&#8217;</em>.<sup id="fnref:1"><a href="#fn:1" rel="footnote">1</a></sup> The appropriate way to add this to the parser is by defining a custom <code>wordBreaker()</code> method.</p>


<div class="wp_syntax"><div class="code"><pre class="javascript" style="font-family:monospace;">it._patternCache.<span style="color: #660066;">contractionMatcher</span> <span style="color: #339933;">=</span> <span style="color: #003366; font-weight: bold;">new</span> RegExp<span style="color: #009900;">&#40;</span><span style="color: #3366CC;">'(^| )(all<span style="color: #000099; font-weight: bold;">\'</span>|dall<span style="color: #000099; font-weight: bold;">\'</span>)'</span><span style="color: #339933;">,</span><span style="color: #3366CC;">'g'</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
it.<span style="color: #660066;">wordBreaker</span> <span style="color: #339933;">=</span> <span style="color: #003366; font-weight: bold;">function</span><span style="color: #009900;">&#40;</span>input<span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
  <span style="color: #000066; font-weight: bold;">return</span> input.<span style="color: #660066;">replace</span><span style="color: #009900;">&#40;</span><span style="color: #000066; font-weight: bold;">this</span>._patternCache.<span style="color: #660066;">contractionMatcher</span><span style="color: #339933;">,</span><span style="color: #3366CC;">'$1$2<span style="color: #000099; font-weight: bold;">\u</span>200b'</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
<span style="color: #009900;">&#125;</span><span style="color: #339933;">;</span></pre></div></div>


<p>Grazie Sandro!</p>

<div class="footnotes">
<hr />
<ol>

<li id="fn:1">
<p>As <a href="http://blog.mozilla.com/nattokirai/">John Daggett</a> pointed out to me, in the future we may have to add an intermediate shallow parse instead of adding characters (in this case, the zero-width space) to the modified input.&#160;<a href="#fnref:1" rev="footnote">&#8617;</a></p>
</li>

</ol>
</div>
<p>Related posts:</p><ol>
<li><a href='http://mitcho.com/blog/link/ubiquity-in-italian/' rel='bookmark' title='Ubiquity in Italian'>Ubiquity in Italian</a></li>
<li><a href='http://mitcho.com/blog/how-to/adding-your-language-to-ubiquity-parser-2/' rel='bookmark' title='Adding Your Language to Ubiquity Parser 2'>Adding Your Language to Ubiquity Parser 2</a></li>
<li><a href='http://mitcho.com/blog/projects/solving-a-romantic-problem-portmanteaued-prepositions/' rel='bookmark' title='Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions'>Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions</a></li>
</ol>
<p>Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://mitcho.com/blog/projects/ubiquity-in-italian-2/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Solving Another Romantic Problem: Weak Pronouns</title>
		<link>http://mitcho.com/blog/projects/solving-another-romantic-problem/</link>
		<comments>http://mitcho.com/blog/projects/solving-another-romantic-problem/#comments</comments>
		<pubDate>Tue, 12 May 2009 08:09:31 +0000</pubDate>
		<dc:creator>mitcho</dc:creator>
				<category><![CDATA[projects]]></category>
		<category><![CDATA[Catalan]]></category>
		<category><![CDATA[French]]></category>
		<category><![CDATA[Italian]]></category>
		<category><![CDATA[Modern Greek]]></category>
		<category><![CDATA[Mozilla Planet]]></category>
		<category><![CDATA[parser]]></category>
		<category><![CDATA[Portuguese]]></category>
		<category><![CDATA[romance languages]]></category>
		<category><![CDATA[Spanish]]></category>
		<category><![CDATA[ubiquity]]></category>

		<guid isPermaLink="false">http://mitcho.com/blog/?p=2035</guid>
		<description><![CDATA[Yesterday I blogged on how to deal with portmanteau&#8217;ed prepositions in Ubiquity Parser 2, a common problem in various romance languages. Today I&#8217;ll propose an approach to another romantic problem. The problem: Weak pronouns in romance languages (as well as some other languages) have a special property where they cliticize to the verb, moving from [...]
Related posts:<ol>
<li><a href='http://mitcho.com/blog/projects/solving-a-romantic-problem-portmanteaued-prepositions/' rel='bookmark' title='Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions'>Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions</a></li>
<li><a href='http://mitcho.com/blog/observation/wheres-the-verb/' rel='bookmark' title='Where&#8217;s The Verb?'>Where&#8217;s The Verb?</a></li>
<li><a href='http://mitcho.com/blog/projects/this-week-on-ubiquity-parser-the-next-generation/' rel='bookmark' title='This week on Ubiquity Parser: The Next Generation'>This week on Ubiquity Parser: The Next Generation</a></li>
</ol>

Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.]]></description>
			<content:encoded><![CDATA[<p><em>Yesterday I blogged on <a href="http://mitcho.com/blog/projects/solving-a-romantic-problem-portmanteaued-prepositions/">how to deal with portmanteau&#8217;ed prepositions in Ubiquity Parser 2</a>, a common problem in various romance languages. Today I&#8217;ll propose an approach to another romantic problem.</em></p>

<h3>The problem:</h3>

<p>Weak pronouns in <a href="http://en.wikipedia.org/wiki/romance languages">romance languages</a> (as well as some other languages) have a special property where they <em>cliticize</em> to the verb, moving from its regular argument position to a position next to the verb. For example, in French, we have an imperative like (1) with gloss as (2):</p>


<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>1
2
</pre></td><td class="code"><pre class="fr" style="font-family:monospace;">Envoyez  le  lettre à  Pierre!
send.IMP the letter to Pierre</pre></td></tr></table></div>


<p>If we replace <em>le lettre</em> or <em>à Pierre</em> with a preposition (<em>le</em>, &#8220;it&#8221;, or <em>lui</em>, &#8220;to him&#8221;, respectively), those weak pronouns move next to the verb—in particular, (5) exemplifies the change in word order. Replacing both arguments with prepositions creates the stacked clitic form of (7).<sup id="fnref:3"><a href="#fn:3" rel="footnote">1</a></sup></p>


<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>3
4
5
6
7
8
</pre></td><td class="code"><pre class="fr" style="font-family:monospace;">Envoyez-la à  Pierre!
send   -it to Pierre
Envoyez-lui la  lettre!
send   -him the letter
Envoyez-le-lui!
send   -it-him</pre></td></tr></table></div>


<p>The fact that these weak pronouns are attached to the verb and lack separate delimiters mean that we will need a separate mechanism to parse these arguments: indeed, this functionality has been planned in <a href="https://wiki.mozilla.org/Labs/Ubiquity/Parser_2">Ubiquity Parser 2</a> as &#8220;step 3&#8221;. Here I&#8217;ll examine some data and discuss a strategy for the parsing of weak pronouns.</p>

<p><span id="more-2035"></span></p>

<h3>Weak pronouns in Ubiquity</h3>

<p>In Ubiquity the only pronoun we currently deal with is the deictic <code>object</code>-role anaphor, like &#8220;it,&#8221; &#8220;this,&#8221; etc. in English.<sup id="fnref:1"><a href="#fn:1" rel="footnote">2</a></sup> In addition, as these weak pronoun clitics cannot by definition be embedded within a larger noun phrase, its referent would constitute the entire <code>object</code> argument. As such, it is most logical to place clitic handling before argument structure parsing and simply hand the argument parser the argument string without the clitic.</p>

<h3>Marking the clitic</h3>

<p>We can classify languages with cliticized weak pronouns into two cases based on their processing considerations: languages that overtly mark the clitic and those which do not.</p>

<h4>Languages which delimit the clitic</h4>

<p>Some languages such as French (see above) clearly mark the boundary between the verb and the clitic. It will be relatively easy to parse weak pronouns in such languages as we can simply <a href="http://ubiquity.mozilla.com/trac/ticket/665">insert a no-width space</a> between the verb and the clitic. A list of clitics can then be designated in the parser (much like anaphora are now) and these weak pronouns can be interpreted as the selection (or &#8220;this&#8221;-referent).<sup id="fnref:2"><a href="#fn:2" rel="footnote">3</a></sup></p>

<p><strong>Portuguese:</strong> (from <a href="http://email.eva.mpg.de/~cysouw/pdf/cysouwDGFS.pdf">Cysouw 2003</a>)</p>


<div class="wp_syntax"><div class="code"><pre class="pr" style="font-family:monospace;">Come-o
eat -it</pre></div></div>


<p><strong>Catalan:</strong> (from <a href="http://www.cau.cat/blog/">Toni Hermoso Pulido</a>)</p>


<div class="wp_syntax"><div class="code"><pre class="ca" style="font-family:monospace;">Cerca-ho
search-it</pre></div></div>


<p><strong>Modern Greek:</strong> (from <a href="http://aix1.uottawa.ca/~romlab/pubs/RiveroTerzi.1995.pdf">Rivero and Terzi 1995</a>; I know, I know, Greek&#8217;s not a romance language, but it has weak pronoun clitics too&#8230; it&#8217;s all good.)</p>

<p>Modern Greek actually inserts a space between the verb and weak pronouns.</p>


<div class="wp_syntax"><div class="code"><pre class="ca" style="font-family:monospace;">Diavase to
read   -it</pre></div></div>


<h4>Languages which do not delimit the clitic</h4>

<p>Some languages do not insert any delimiter between the verb and the weak pronoun, essentially entering them as a single word (in the string sense, at least). These cases may be more difficult to parse, especially as there may be sound changes to the verb stem itself.</p>

<p><strong>Italian:</strong> (first example from <a href="http://books.google.com/books?id=tnXJVbGpMfEC">Kayne 1994</a>)</p>

<p>Italian is a case where some verbs actually conjoin with the verb in imperatives, much like their prepositions which I noted yesterday have an elaborate system of portmanteau&#8217;ed forms.</p>


<div class="wp_syntax"><div class="code"><pre class="it" style="font-family:monospace;">Fallo
do-it
Mangialo
eat  -it</pre></div></div>


<p><strong>Spanish:</strong> (first example from <a href="http://aix1.uottawa.ca/~romlab/pubs/RiveroTerzi.1995.pdf">Rivero and Terzi 1995</a>, second from <a href="http://www.cau.cat/blog/">Toni Hermoso Pulido</a>)</p>

<p>Spanish is the same way:</p>


<div class="wp_syntax"><div class="code"><pre class="es" style="font-family:monospace;">Léelo
read-it
Búscalo
search-it</pre></div></div>


<h3>Displaying the suggestion</h3>

<p>The current Ubiquity handling of anaphora (aka &#8220;magic words&#8221;) involves a display of the selection (replacement) text in a stylized way. One problem with clitics may be how to visually present this replacement to the user.</p>

<p><center><img src="http://mitcho.com/blog/wp-content/uploads/2009/05/picture-11.png" alt="Picture 1.png" border="0" width="284" height="160" /></center></p>

<p>For languages with a delimiter such as French we could simply present the selection as an object right after the verb, without the hyphen.</p>

<table>
<tr><th>input:</th><td>traduisez-le (translate-it)</td></tr>
<tr><th>suggestion:</th><td>traduisez <span style='  padding: 2px;
  -moz-border-radius: 3px;
  display: inline-block;
  font-variant: small-caps;
  background-color: #BBB;
  color: #333;
  position: relative;
  top: -2px;
  font-size: 8pt;
  font-weight: normal;
  border: 1px solid #777;'>selection</span></td></tr>
</table>

<p>Things may be more complicated, however, in languages where the clitic is not delimited from the verb, or where the verb form itself has changed due to the attachment of the clitic.</p>

<h3>Conclusion</h3>

<p>In this blog post I&#8217;ve tried to lay out some of the weak pronoun phenomena relevant to Ubiquity with some ideas on how to implement its parsing. I believe parsing weak pronouns should be relatively straightforward in languages with delimiters—for those which do not have delimiters, some creativity may be required in how building regular expressions or rules to detect the clitics and in presenting these suggestions to the user.</p>

<p><strong>Does your language have weak pronoun clitics? What do you think will be the challenges in trying to parse these arguments?</strong></p>

<div class="footnotes">
<hr />
<ol>

<li id="fn:3">
<p>Note that the reverse order of &#8220;Envoyez-lui-le&#8221; is ungrammatical&#8230; fortunately we most likely will not have to deal with multiple clitics&#8230; see footnote two below.&#160;<a href="#fnref:3" rev="footnote">&#8617;</a></p>
</li>

<li id="fn:1">
<p>This is not so much an informed decision that we should not do different kinds of anaphors but simply that we haven&#8217;t gotten around to implementing it. I personally am not sure, however, whether there is a real need for parsing for anaphors for roles other than <code>object</code> (for example, French <em>lui</em> as seen above which would be a <code>goal</code> anaphor).&#160;<a href="#fnref:1" rev="footnote">&#8617;</a></p>
</li>

<li id="fn:2">
<p>There is, however, a question of whether weak pronoun replacement should be obligatory or not: that is, if we see a regular anaphor right now such as &#8220;this,&#8221; we make two copies of the parse: one with the replacement, one without. In the case where we detect an anaphor, should the replacement be obligatory? I believe it should be, though, as with many other Parser 2 features, I believe we can continue to parse other options with no replacement but let the scoring system kill those parses off. If a verb has a clitic attached to it but we do not remove it, it most likely will do very poorly in scoring anyway.&#160;<a href="#fnref:2" rev="footnote">&#8617;</a></p>
</li>

</ol>
</div>
<p>Related posts:</p><ol>
<li><a href='http://mitcho.com/blog/projects/solving-a-romantic-problem-portmanteaued-prepositions/' rel='bookmark' title='Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions'>Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions</a></li>
<li><a href='http://mitcho.com/blog/observation/wheres-the-verb/' rel='bookmark' title='Where&#8217;s The Verb?'>Where&#8217;s The Verb?</a></li>
<li><a href='http://mitcho.com/blog/projects/this-week-on-ubiquity-parser-the-next-generation/' rel='bookmark' title='This week on Ubiquity Parser: The Next Generation'>This week on Ubiquity Parser: The Next Generation</a></li>
</ol>
<p>Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://mitcho.com/blog/projects/solving-another-romantic-problem/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Solving a Romantic Problem: Portmanteau&#8217;ed Prepositions</title>
		<link>http://mitcho.com/blog/projects/solving-a-romantic-problem-portmanteaued-prepositions/</link>
		<comments>http://mitcho.com/blog/projects/solving-a-romantic-problem-portmanteaued-prepositions/#comments</comments>
		<pubDate>Mon, 11 May 2009 05:19:17 +0000</pubDate>
		<dc:creator>mitcho</dc:creator>
				<category><![CDATA[projects]]></category>
		<category><![CDATA[arguments]]></category>
		<category><![CDATA[Catalan]]></category>
		<category><![CDATA[French]]></category>
		<category><![CDATA[Italian]]></category>
		<category><![CDATA[linguistics]]></category>
		<category><![CDATA[Mozilla Planet]]></category>
		<category><![CDATA[parser]]></category>
		<category><![CDATA[portmanteau]]></category>
		<category><![CDATA[romance languages]]></category>
		<category><![CDATA[ubiquity]]></category>

		<guid isPermaLink="false">http://mitcho.com/blog/?p=2019</guid>
		<description><![CDATA[The problem: In many romance languages, prepositions and articles often form portmanteau morphs, combining to form a single word.1 Some examples include (French) à + le > au, de + le > du, (Catalan) a + el > al, de + les > dels, per + el > pel. Italian has a particularly productive system [...]
Related posts:<ol>
<li><a href='http://mitcho.com/blog/observation/wheres-the-verb/' rel='bookmark' title='Where&#8217;s The Verb?'>Where&#8217;s The Verb?</a></li>
<li><a href='http://mitcho.com/blog/how-to/adding-your-language-to-ubiquity-parser-2/' rel='bookmark' title='Adding Your Language to Ubiquity Parser 2'>Adding Your Language to Ubiquity Parser 2</a></li>
<li><a href='http://mitcho.com/blog/observation/scoring-and-ranking-suggestions/' rel='bookmark' title='Scoring and Ranking Suggestions'>Scoring and Ranking Suggestions</a></li>
</ol>

Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.]]></description>
			<content:encoded><![CDATA[<h3>The problem:</h3>

<p>In many <a href="http://en.wikipedia.org/wiki/romance languages">romance languages</a>, prepositions and articles often form <a href="http://en.wikipedia.org/wiki/portmanteau">portmanteau morphs</a>, combining to form a single word.<sup id="fnref:2"><a href="#fn:2" rel="footnote">1</a></sup> Some examples include (French) à + le > au, de + le > du, (Catalan) a + el > al, de + les > dels, per + el > pel. Italian has a particularly productive system of portmanteau&#8217;ed prepositions and articles&#8230; I refer you to the <a href="http://en.wikipedia.org/wiki/Contraction (grammar)#Italian">contraction</a> article on Wikipedia.</p>

<p>As I <a href="http://mitcho.com/blog/how-to/adding-your-language-to-ubiquity-parser-2/">noted a couple weeks ago</a>, however, some combinations do not form portmanteaus.<sup id="fnref:1"><a href="#fn:1" rel="footnote">2</a></sup></p>

<p><span id="more-2019"></span>
<strong>French:</strong></p>

<ol>
<li>à + le > au</li>
<li>à + la > à la</li>
</ol>

<p>The problem with this is that if we use both <em>à</em> and <em>au</em> as delimiters, we may end up passing the definite article to the verb as part of the argument in some cases, but not in other cases.</p>

<ol>
<li>&#8220;<strong>à</strong> la table&#8221; = &#8220;<strong>to</strong> the table&#8221;</li>
<li>&#8220;<strong>au</strong> chat&#8221; = &#8220;<strong>to the</strong> cat&#8221;</li>
</ol>

<h3>The solution:</h3>

<p>The solution is a new step in <a href="https://wiki.mozilla.org/Labs/Ubiquity/Parser_2">the Parser 2 process</a> which normalizes the form of arguments. Each language&#8217;s parser can now optionally define a <code>normalizeArgument()</code> method which takes an argument and returns a list of normalized alternates. Normalized arguments are returned in the form of <code>{prefix: '', newInput: '', suffix: ''}</code>. For example, if you feed &#8220;la table&#8221; to the French <code>normalizeArgument()</code>, it ought to return</p>


<div class="wp_syntax"><div class="code"><pre class="javascript" style="font-family:monospace;"><span style="color: #009900;">&#91;</span><span style="color: #009900;">&#123;</span>prefix<span style="color: #339933;">:</span> <span style="color: #3366CC;">'la '</span><span style="color: #339933;">,</span> newInput<span style="color: #339933;">:</span> <span style="color: #3366CC;">'table'</span><span style="color: #339933;">,</span> suffix<span style="color: #339933;">:</span> <span style="color: #3366CC;">''</span><span style="color: #009900;">&#125;</span><span style="color: #009900;">&#93;</span></pre></div></div>


<p>If there are no possible normalizations, <code>normalizeArgument()</code> should simply return <code>[]</code>. Each alternative returned by <code>normalizeArgument()</code> is substituted into a copy of the possible parses just before nountype detection. The prefixes and suffixes are stored in the argument (as <code>inactivePrefix</code> and <code>inactiveSuffix</code>) so they can be incorporated into the suggestion display.</p>

<p>Here, for example, is how the inactive prefix &#8220;l&#8217;&#8221; is displayed in <a href="chrome://parser-demo/content/index.html">the parser demo</a>. This way the user is told that the &#8220;l&#8217;&#8221; prefix is being ignored, and the nountype detection and verb action can act on the argument &#8220;English&#8221;.<sup id="fnref:3"><a href="#fn:3" rel="footnote">3</a></sup> (In the future, of course, we could teach this nountype to accept the Catalan &#8220;anglès&#8221;.)</p>

<p><center><img src="http://mitcho.com/blog/wp-content/uploads/2009/05/picture-1.png" alt="Picture 1.png" border="0" width="320" height="29" /></center></p>

<p>The easiest way to produce this output is to use the <a href="https://developer.mozilla.org/en/Core_JavaScript_1.5_Reference/Global_Objects/String/match"><code>String.match()</code></a> method. For example <code>normalizeArgument()</code> code, I refer you to the <a href="http://ubiquity.mozilla.com/hg/ubiquity-firefox/file/12f5d9abf011/ubiquity/modules/parser/new/ca.js">Catalan</a> and <a href="http://ubiquity.mozilla.com/hg/ubiquity-firefox/file/12f5d9abf011/ubiquity/modules/parser/new/fr.js">French</a> parser files.</p>

<p>I hope that this solution will help make Ubiquity with Parser 2 feel <a href="http://mitcho.com/blog/projects/how-natural-should-a-natural-interface-be/">more natural</a> for many romance languages.</p>

<div class="footnotes">
<hr />
<ol>

<li id="fn:2">
<p>Thanks to <a href="http://people.ucsc.edu/~jpobrien/">Jeremy O&#8217;Brien</a> for helping me figure out how to refer to this phenomenon.&#160;<a href="#fnref:2" rev="footnote">&#8617;</a></p>
</li>

<li id="fn:1">
<p>This also relates to the issue of <a href="http://ubiquity.mozilla.com/trac/ticket/671">parsing multi-word delimiters</a>, though the argument normalization strategy covered here should reduce the necessity of multi-word delimiters.&#160;<a href="#fnref:1" rev="footnote">&#8617;</a></p>
</li>

<li id="fn:3">
<p>Thank you to contributor <a href="http://www.cau.cat/blog/">Toni Hermoso Pulido</a> for our first attempt at a Catalan parser!&#160;<a href="#fnref:3" rev="footnote">&#8617;</a></p>
</li>

</ol>
</div>
<p>Related posts:</p><ol>
<li><a href='http://mitcho.com/blog/observation/wheres-the-verb/' rel='bookmark' title='Where&#8217;s The Verb?'>Where&#8217;s The Verb?</a></li>
<li><a href='http://mitcho.com/blog/how-to/adding-your-language-to-ubiquity-parser-2/' rel='bookmark' title='Adding Your Language to Ubiquity Parser 2'>Adding Your Language to Ubiquity Parser 2</a></li>
<li><a href='http://mitcho.com/blog/observation/scoring-and-ranking-suggestions/' rel='bookmark' title='Scoring and Ranking Suggestions'>Scoring and Ranking Suggestions</a></li>
</ol>
<p>Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://mitcho.com/blog/projects/solving-a-romantic-problem-portmanteaued-prepositions/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Ubiquity in Italian</title>
		<link>http://mitcho.com/blog/link/ubiquity-in-italian/</link>
		<comments>http://mitcho.com/blog/link/ubiquity-in-italian/#comments</comments>
		<pubDate>Mon, 09 Mar 2009 20:43:12 +0000</pubDate>
		<dc:creator>mitcho</dc:creator>
				<category><![CDATA[link]]></category>
		<category><![CDATA[Italian]]></category>
		<category><![CDATA[localization]]></category>
		<category><![CDATA[Mozilla Planet]]></category>
		<category><![CDATA[overlord verbs]]></category>
		<category><![CDATA[ubiquity]]></category>

		<guid isPermaLink="false">http://mitcho.com/blog/?p=1562</guid>
		<description><![CDATA[flod put up a nice blog post on thinking about Ubiquity in Italian. flod points out that what seems natural to him as a speaker is the use of the imperative form of the verbs, but that some verbs may not translate neatly, even following the overlord verbs proposal: For example, the verb “make” is [...]
Related posts:<ol>
<li><a href='http://mitcho.com/blog/link/ubiquity-in-portuguese/' rel='bookmark' title='Ubiquity in Portuguese'>Ubiquity in Portuguese</a></li>
<li><a href='http://mitcho.com/blog/projects/localizing-ubiquity-an-open-letter-to-linguists/' rel='bookmark' title='Localizing Ubiquity: an open letter to linguists'>Localizing Ubiquity: an open letter to linguists</a></li>
<li><a href='http://mitcho.com/blog/projects/ubiquity-in-firefox-japanese/' rel='bookmark' title='Ubiquity in Firefox: Focus on Japanese'>Ubiquity in Firefox: Focus on Japanese</a></li>
</ol>

Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.]]></description>
			<content:encoded><![CDATA[<p>flod put up a nice blog post on <a href="http://www.yetanothertechblog.com/2009/03/08/thinking-ubiquity-in-italian/">thinking about Ubiquity in Italian</a>. flod points out that what seems natural to him as a speaker is the use of the imperative form of the verbs, but that some verbs may not translate neatly, even following the <a href="http://jonoscript.wordpress.com/2009/01/24/overlord-verbs-a-proposal/">overlord verbs proposal</a>:</p>

<blockquote>For example, the verb “make” is quite difficult to translate (too generic): “to make” could be “fare”, but “fare grassetto” (”make bold”) doesn’t make any sense, people would use more specific verbs:<br/><br/><ul><li>make bold ->  trasforma in grassetto (sounds like “change to bold”)</li><li>make page editable -> rendi pagina modificabile</li></ul></blockquote>

<p>This is a great point. Although the overlord verbs may naturally map into many languages, it may not be perfect for some commands in some languages. <strong>Where would English overlord verbs not translate well into your language?</strong></p>

<p>I suggest on flod&#8217;s blog that a &#8220;synonym&#8221; system could be implemented to map single verbs to specific overlord&#8217;ed functionality, but these would definitely have to be done on a language-specific basis, unfortunately adding a little work to the localization process.</p>
<p>Related posts:</p><ol>
<li><a href='http://mitcho.com/blog/link/ubiquity-in-portuguese/' rel='bookmark' title='Ubiquity in Portuguese'>Ubiquity in Portuguese</a></li>
<li><a href='http://mitcho.com/blog/projects/localizing-ubiquity-an-open-letter-to-linguists/' rel='bookmark' title='Localizing Ubiquity: an open letter to linguists'>Localizing Ubiquity: an open letter to linguists</a></li>
<li><a href='http://mitcho.com/blog/projects/ubiquity-in-firefox-japanese/' rel='bookmark' title='Ubiquity in Firefox: Focus on Japanese'>Ubiquity in Firefox: Focus on Japanese</a></li>
</ol>
<p>Related posts brought to you by <a href='http://yarpp.org'>Yet Another Related Posts Plugin</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://mitcho.com/blog/link/ubiquity-in-italian/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

