<HTML>
<HEAD><TITLE>Tis2Utf8 converter</TITLE></HEAD>
<BODY BGCOLOR=WHITE>
<H1>Tis2Utf8 encoding converter</H1>

<P>
The standard for Thai encoding character is TIS-620. This standard was
defined by <A HREF="http://www.tisi.go.th/">Thai Industrial Standard
Institute</A>. In my opinion, TIS-620
is convenient when inputing Thai characters from keyboard because its
encoding is 8-bit encoding. To display TIS-620 document, conflict with
ISO-8859-X (Latin) occurs. Consider a situation when you want
to put Latin characters and Thai characters in the same file using
TIS-620 encoding, it is impossible. To make this possible, you should
use UTF-8 encoding or other alternative encodings such as UTF-7, UTF-16, etc.
</P>  

<P>
This service will convert plain text with TIS-620 encoding to UTF-8
encoding. The UTF-8 encoding is a way to encode <A
HREF="http://www.unicode.org">unicode</A> characters. This
encoding is considered to be suitable when you work on multi-lingual
document. Many browsers are supporting this encoding. For Netscape
Communicator example, you
can choose UTF-8 encoding from menu "view | encoding |
Unicode(UTF-8)".
</P>

<H3>Viewing UTF-8 file from browser</H3>
I use IE5 to view UTF-8 file that has Thai language inside. You may need Thai fonts (unicode) which can download freely from <A HREF="http://www.microsoft.com/windows/software/localize/thai2.htm">Microsoft site</A>. For general file, I think MS Word can read and write this encoding. I didn't try it yet. If you did it, please tell me too. If someone knows tool to convert Japanese encoding to utf8, please tell me too.

<H3>Resources</H3>
<UL>
<LI><A HREF="http://software.thai.net/tis-620/index.html">TIS-620 campaign</A>
<LI><a href="http://www.sunsite.auc.dk/RFC/rfc/rfc2279.html">RFC 2279</a>
</UL>

<H3>Source</H3>
Program was written in Perl module style, so it can be easily reuse. I plan to make Thai module for perl and this will be one of module's method.
<UL>
<LI><A HREF="tis2utf8.pl">tis2utf8.pl</A>
</UL>
<H3>Acknowledgment</H3>
Source code was discussed in <A HREF="mailto:tis620-cp@thai.com">tis620-cp@thai.com</A>. Thank you to members of tis620-cp who gave valuable comments.

<p>
<hr noshade width=80%>
<Form method="POST" action="/ZzzThai/cgi-bin/tis2utf8.cgi"
 ENCTYPE="multipart/form-data">

<P>File to convert: <INPUT NAME="userfile" TYPE="file" value=""></P>
<P>Return as <input type="radio" name="return_type" value="text/html"> html <input type="radio" name="return_type" value="text/plain" checked> plain-text
</P>

<P><Input type="submit" value="convert"></P>
</Form>
<HR NOSHADE>
<P ALIGN=CENTER>$Id: tis2utf8.html,v 1.1 1999/05/15 13:56:56 poon-v Exp poon-v $</P>


</BODY>
</HTML>
