DarkMatter in Cyberspace
  • Home
  • Categories
  • Tags
  • Archives

汉字编码查询的Java实现


Java代码

package encoding; import java.io.UnsupportedEncodingException; import java.util.ArrayList; import java.util.List; public class EncodingProbe { public static void main(String[] args) throws UnsupportedEncodingException { String name = "联通移动"; List charsets = new ArrayList(); charsets.add("gbk"); charsets.add("gb18030"); charsets.add("unicode"); charsets.add("utf-8"); for (String charset : charsets) { System.out.println("bytes of " + name + " in charset " + charset + " is: " + bytes2HexString(name.getBytes(charset))); } } public static String bytes2HexString(byte[] b) { String ret = ""; for (int i = 0; i < b.length; i++) { String hex = Integer.toHexString(b[i] & 0xFF); if (hex.length() == 1) { hex = '0' + hex; } ret += hex; } return ret; } }

Windows平台上的其他方法

用UltraEdit,保存文件时可以选择编码格式;

Linux平台上的其他方法

file指令配合iconv指令。



Published

Dec 18, 2012

Last Updated

Dec 18, 2012

Category

Tech

Tags

  • 编码 6
  • charset 4
  • Java 106
  • 汉字 3

Contact

  • Powered by Pelican. Theme: Elegant by Talha Mansoor