了解11种常见的字符编码格式及其使用场景，让数据传输不再出现问题

🧑‍💻作者名称：DaenCode 🎤作者简介：啥技术都喜欢捣鼓捣鼓，喜欢分享技术、经验、生活。 😎人生感悟：尝尽人生百味，方知世间冷暖。

2025-08-30 05:06:55

admin

🧑‍💻作者名称：DaenCode 🎤作者简介：啥技术都喜欢捣鼓捣鼓，喜欢分享技术、经验、生活。 😎人生感悟：尝尽人生百味，方知世间冷暖。

前言

在有一次SDK开发中，方法间数据传输出现了问题，由于返回数据采用的数据编码格式不一致会导致数据长度发生变化。因此整理出常用的字符编码格式，减少出现问题的概率。

文章目录

前言🌟UTF-8编码🌟UTF-16编码🌟GBK编码🌟UTF-32编码🌟Base64编码🌟ASCII编码🌟ISO-8859-1编码🌟URL编码🌟HTML编码🌟Unicode编码🌟GBK编码🌟使用场景🌟写在最后

🌟UTF-8编码

UTF-8 使用变长编码方式。最常用的字符编码之一，支持全球范围内的字符表示。每个字符的长度可以从 1 到 4 个字节不等。

代码示例

// 编码

String text = "Hello, world!";

byte[] encodedBytes = text.getBytes(StandardCharsets.UTF_8);

System.out.println(Arrays.toString(encodedBytes));

// 解码

String decodedText = new String(encodedBytes, StandardCharsets.UTF_8);

System.out.println(decodedText); // Hello, world!

🌟UTF-16编码

定长编码方式，用于表示 Unicode 字符集。在 Java 中，字符串的内部表示采用 UTF-16 编码。

代码示例

// 编码

String text = "你好，世界！";

byte[] encodedBytes = text.getBytes(StandardCharsets.UTF_16);

System.out.println(Arrays.toString(encodedBytes));

// 解码

String decodedText = new String(encodedBytes, StandardCharsets.UTF_16);

System.out.println(decodedText);

🌟GBK编码

双字节编码方式。中文字符集编码，支持汉字和其他中文字符。

代码示例

// 编码

String text = "你好，世界！";

byte[] encodedBytes = text.getBytes("GBK");

System.out.println(Arrays.toString(encodedBytes));

// 解码

String decodedText = new String(encodedBytes, "GBK");

System.out.println(decodedText);

🌟UTF-32编码

定长编码方式，用于表示 Unicode 字符集。每个字符占用固定的 4 个字节空间，适用于对字符随机访问或精确控制字符长度的需求。

代码示例

// 编码

String text = "你好，世界！";

byte[] encodedBytes = text.getBytes(StandardCharsets.UTF_32);

System.out.println(Arrays.toString(encodedBytes));

// 解码

String decodedText = new String(encodedBytes, StandardCharsets.UTF_32);

System.out.println(decodedText);

🌟Base64编码

用于将二进制数据转换成可打印字符的编码方式

代码示例

// 编码

String text = "Hello, world!";

byte[] encodedBytes = Base64.getEncoder().encode(text.getBytes());

System.out.println(new String(encodedBytes)); // SGVsbG8sIHdvcmxkIQ==

// 解码

byte[] decodedBytes = Base64.getDecoder().decode(encodedBytes);

String decodedText = new String(decodedBytes);

System.out.println(decodedText); // Hello, world!

🌟ASCII编码

最早的字符编码方式，用于表示基本的英文字母、数字和特殊字符。ASCII 编码使用一个字节表示一个字符，范围为 0-127。

代码示例

// 编码

String text = "Hello, world!";

byte[] encodedBytes = text.getBytes(StandardCharsets.US_ASCII);

System.out.println(Arrays.toString(encodedBytes));

// 解码

String decodedText = new String(encodedBytes, StandardCharsets.US_ASCII);

System.out.println(decodedText);

🌟ISO-8859-1编码

也称为Latin-1编码，用于表示西欧语言字符集。ISO-8859-1 使用一个字节表示一个字符，范围为 0-255。

代码示例

// 编码

String text = "Hello, world!";

byte[] encodedBytes = text.getBytes(StandardCharsets.ISO_8859_1);

System.out.println(Arrays.toString(encodedBytes));

// 解码

String decodedText = new String(encodedBytes, StandardCharsets.ISO_8859_1);

System.out.println(decodedText);

🌟URL编码

用于在 URL 中表示特殊字符和非 ASCII 字符。通过将特殊字符转换为 %xx 的形式，其中 xx 是字符的 ASCII 值的十六进制表示。

代码示例

// 编码

String text = "Hello, world!";

String encodedText = URLEncoder.encode(text, StandardCharsets.UTF_8.toString());

System.out.println(encodedText); // Hello%2C+world%21

// 解码

String decodedText = URLDecoder.decode(encodedText, StandardCharsets.UTF_8.toString());

System.out.println(decodedText); // Hello, world!

🌟HTML编码

用于在 HTML 中表示特殊字符和保留字符。将特殊字符转换为 &entity; 的形式，其中 entity 是特定字符的名称或编号。

代码示例

import org.apache.commons.text.StringEscapeUtils;

// 编码

String text = "

Hello, world!

String encodedText = StringEscapeUtils.escapeHtml4(text);

System.out.println(encodedText); // <h1>Hello, world!</h1>

// 解码

String decodedText = StringEscapeUtils.unescapeHtml4(encodedText);

System.out.println(decodedText); //

Hello, world!

🌟Unicode编码

Unicode 是一种字符集，定义了字符与码点之间的映射关系。在 Java 中，字符串使用 UTF-16 编码表示 Unicode 字符。

代码示例

// 编码

String text = "你好，世界！";

byte[] encodedBytes = text.getBytes(StandardCharsets.UTF_16);

System.out.println(Arrays.toString(encodedBytes));

// 解码

String decodedText = new String(encodedBytes, StandardCharsets.UTF_16);

System.out.println(decodedText);

🌟GBK编码

Hex 编码是一种将字节数据转换为十六进制字符串的编码方式。它将每个字节转换为两个十六进制字符，从而以可读的方式表示二进制数据。

代码示例

import javax.xml.bind.DatatypeConverter;

// 编码

String text = "Hello, world!";

byte[] bytes = text.getBytes();

String encodedText = DatatypeConverter.printHexBinary(bytes);

System.out.println(encodedText); // 48656C6C6F2C20776F726C6421

// 解码

byte[] decodedBytes = DatatypeConverter.parseHexBinary(encodedText);

String decodedText = new String(decodedBytes);

System.out.println(decodedText); // Hello, world!

🌟使用场景

编码方式使用场景UTF-8编码互联网文本传输和存储，多语言环境和国际化应用UTF-16编码Windows操作系统，Java编程语言，多语言字符文本处理GBK编码中文汉字编码和处理，中国大陆和台湾地区常见UTF-32编码精确表示所有Unicode字符，不常用于存储空间敏感的场景Base64编码二进制数据传输和存储，例如在电子邮件中传输附件ASCII编码英文字符编码，常用于文本传输、数据存储、编程和键盘输入等场景。ISO-8859-1编码常用于西欧语言环境下的文本处理和传输。URL编码将URL中的非ASCII字符转换为%xx形式，保证传输和处理的正确性HTML实体编码将HTML中的特殊字符转换为实体引用，避免与HTML标记冲突，常用于Web开发和网页显示中。Unicode编码对全球范围字符的唯一标识，字符转换和互通的基础，用于处理多语言字符文本和实现Unicode字符的精确表示Hex编码二进制数据转换为十六进制字符串，常用于调试和数据处理

🌟写在最后

以上就是11种常见的字符编码格式。文中若有错误或遗漏，望大家在评论区及时提出指正以及补充。感谢大家的阅读与指正。

了解11种常见的字符编码格式及其使用场景，让数据传输不再出现问题

Hello, world!

Hello, world!

乒乓球世界杯直播_韩国世界杯最好成绩 - jiangnanyo.com

造价油水

cf所有角色一览表,CF角色大全一览，全面解析！

揭秘Git与SVN：效率与传统的碰撞，哪个才是你的最佳选择？

热血传奇开天哪里获得，传奇中的开天在哪可以打到

赛车：喷射

为什么我控制阴茎自己动左右左右

馄饨怎么包？图解馄饨的10种经典包法

您的位置：网游>开测表>灵山奇缘

空心菜需要切断吗，炒空心菜几分钟熟

5磅等于多少公克？

安装burpsuite

乐Pro3骁龙821怎么样？乐视乐Pro 3首发深度评测

404 Not Found