GB/T 13000-2025 Information technology—Universal coded character set(UCS)
GB/T 13000-2025 Information technology—Universal coded character set(UCS)
Basic Information
Scope
This document: — Defines the architecture of UCS; — Defines the terms used in UCS; — Describes the overall structure of the UCS code space; — Specifies the assigned planes of UCS: the Basic Multilingual Plane (BMP), the Supplementary Multilingual Plane (SMP), the Supplementary Ideographic Plane (SIP), the Third Ideographic Plane (TIP), and the Supplementary Special-Purpose Plane (SSP); — Defines the graphic character sets used for the writing forms of various languages worldwide; — Specifies the names and encoding representations of graphic and format characters in the UCS BMP, SMP, SIP, TIP, and SSP; — Specifies the encoding representations of control characters and special characters; — Specifies three encoding forms of UCS: UTF-8, UTF-16, and UTF-32; — Specifies seven encoding schemes of UCS: UTF-8, UTF-16, UTF-16BE, UTF-16LE, UTF-32, UTF-32BE, and UTF-32LE; — Specifies the management methods for future additional encoded characters. This document is applicable to technical products that support the informatization processing and exchange of graphic characters of various languages worldwide. Note: This document does not specify whether the encoded characters are suitable for use as identifiers in programming languages. Appendix A provides a reference document on characters suitable for use as identifiers.