JS中文字符串转换unicode编码函数

当我们需要用href 传递中文参数时我们可以用js的unicode函数对汉字进行转码如我们使用百度的时候汉字条件就会被编码

http://www.baidu.com/s?wd=%BA%C3%B5%C4%BA%DC

AJAX使用GET请求时传递中文字符串时也必须把中文字符串编码成unicode,一般会用到JS的自带函数escape().不过找到了更好的函数来确决中文字符转换成unicode编码的函数如下

function uniencode(text)
{
text = escape(text.toString()).replace(/\+/g, "%2B");
var matches = text.match(/(%([0-9A-F]{2}))/gi);
if (matches)
{
for (var matchid = 0; matchid < matches.length; matchid++)
{
var code = matches[matchid].substring(1,3);
if (parseInt(code, 16) >= 128)
{
text = text.replace(matches[matchid], '%u00' + code);
}
}
}
text = text.replace('%25', '%u0025');
return text;
}

当然如果你需要在服务器端得到utf-8码可以使用如下函数

function convert_int_to_utf8($intval)
{
$intval=intval($intval);
switch ($intval)
{
// 1 byte, 7 bits
case 0:
return chr(0);
case ($intval&0x7F):
return chr($intval);
// 2 bytes, 11 bits
case ($intval&0x7FF):
return chr(0xC0|(($intval>>6)&0x1F)).
chr(0x80|($intval&0x3F));
// 3 bytes, 16 bits
case ($intval&0xFFFF):
return chr(0xE0|(($intval>>12)&0x0F)).
chr(0x80|(($intval>>6)&0x3F)).
chr (0x80|($intval&0x3F));
// 4 bytes, 21 bits
case ($intval&0x1FFFFF):
return chr(0xF0|($intval>>18)).
chr(0x80|(($intval>>12)&0x3F)).
chr(0x80|(($intval>>6)&0x3F)).
chr(0x80|($intval&0x3F));
}
}
如果你使用的是如java等后台语言的话转码将更加简单了！！！

前几天，遇到一个问题，就是在浏览器地址栏传递中文时，出现乱码，考虑了一下，解决方式有很多，我还是采用了转换编码的方式，将中文转换为Unicode编码，然后再解码成中文，以下是实现的过程，非常简单！
package cy.code;

public class CyEncoder {
private String zhStr; //中文字符串
private String unicode;//将中文字符串转换为Unicode编码存储在这个属性上。

public CyEncoder(String zhStr){
this.zhStr = zhStr;
}

public String getZhStr() {
return zhStr;
}

public void setZhStr(String zhStr) {
this.zhStr = zhStr;
}

public String toUnicode(){
StringBuffer unicode = new StringBuffer();
for(int i=0; i<zhStr.length();i++){
char c = zhStr.charAt(i);
unicode.append("\\u" + Integer.toHexString(c));
}
this.unicode = unicode.toString();
return unicode.toString();
}
public String tozhCN(){
StringBuffer gbk = new StringBuffer();
String[] hex = unicode.split("\\\\u"); // 妈的，分割让我想了半天！！不是"\\u"，而是 "\\\\u"
for(int i=1;i<hex.length;i++){ // 注意要从 1 开始，而不是从0开始。第一个是空。
int data = Integer.parseInt(hex[i],16); // 将16进制数转换为 10进制的数据。
gbk.append((char)data); // 强制转换为char类型就是我们的中文字符了。
}
System.out.println("这是从 Unicode编码转换为中文字符了: " +gbk.toString());
return gbk.toString();
}
public static void main(String args[]){

CyEncoder fc = new CyEncoder("为布局发的说法");
System.out.println(fc.toUnicode());
fc.tozhCN();
}
}

posted on 2011-04-10 11:39 jiezzy 阅读(17386) 评论(1) 收藏举报

刷新页面返回顶部

JS中文字符串转换unicode编码函数

导航

公告