I can try, but Im not sure I will be able to reproduce this. I will be online tomorow with the data at hand. <br><br>I must add, that after that last email, I updated to the latest beta in order to install zap,and my utf went completely haywire. the problem on updating there may be somewhat difficult. this is on the cut and paste utf-8. I will try to reproduce some of these problems as well.<br><br>Lastly, I think what I need is a rospattern that goes from the utf 8 to the #nnn;, as this later one is working fine for Lucene. (ie,the saxon preprocessing for lucene does not, for some reason, accept extended characters as-is as valid xml) - the dev on the project tells me that xhtml should validate ie, that is is more correct to label xhtml as xml than as html. this makes sense, since a tag like <scholium>some text</scholium> (an actual example of my tags) is invalid html,but valid xml and valid xhtml.<br><br>Thanks,
Seth<br><br><b><i>"Patrick R. Michaud" <pmichaud@pobox.com></i></b> wrote:<blockquote class="replbq" style="border-left: 2px solid rgb(16, 16, 255); margin-left: 5px; padding-left: 5px;"> On Mon, Feb 12, 2007 at 04:43:44AM -0800, Seth Cherney wrote:<br>> I am trying to include utf-8 Greek letters, letters with diacritics, <br>> and diacritics only characters. Using the cut and past method <br>> works fairly well. However, the result is that, upon<br>> save, some of the characters are save in the format &#935; <br>> others are just saved as the letter. <br><br>First I should note that the PmWiki core doesn't do any sort<br>of conversions like this -- it all takes place in the browser.<br>All PmWiki ever sees is the Χ form (and it dutifully <br>records it that way, assuming that the author intended it to<br>be that way.)<br><br>Still, we could probably come up with a $ROSPattern that would<br>automatically convert &#nnn; into their utf-8
counterpart --<br>would that work?<br><br>I've created a page on pmwiki.org where we can analyze this<br>a bit further -- could you add some text showing the problems<br>you're encountering there?<br><br> http://www.pmwiki.org/wiki/UTF-8/GreekDiacritics<br><br>Thanks,<br><br>Pm<br><br></blockquote><br><p> 
<hr size=1>No need to miss a message. <a href="http://us.rd.yahoo.com/evt=43910/*http://mobile.yahoo.com/mail
">Get email on-the-go </a><br>with Yahoo! Mail for Mobile. <a href="http://us.rd.yahoo.com/evt=43910/*http://mobile.yahoo.com/mail
">Get started.</a>