String Data Type Syntax 之CHAR、VARCHAR、BINARY、VARBINARY

阅读 76

2022-02-18


11.3.1 String Data Type Syntax


The string data types are CHAR, VARCHAR, BINARY, VARBINARY, BLOB, TEXT, ENUM, and SET.


字符串数据类型为CHAR,VARCHAR,BINARY,VARBINARY,BLOB,TEXT,ENUM和SET。


In some cases, MySQL may change a string column to a type different from that given in a CREATE TABLE or ALTER TABLE statement. See Section 13.1.18.6, “Silent Column Specification Changes”.


在某些情况下,MySQL可能会将字符串列更改为与CREATE TABLE或ALTER TABLE语句中给定的类型不同的类型。 请参见第13.1.18.6节。


For definitions of character string columns (CHAR, VARCHAR, and the TEXT types), MySQL interprets length specifications in character units. For definitions of binary string columns (BINARY, VARBINARY, and the BLOB types), MySQL interprets length specifications in byte units.


对于字符串列(CHAR,VARCHAR和TEXT类型)的定义,MySQL以字符单位解释长度规范。 对于二进制字符串列(BINARY,VARBINARY和BLOB类型)的定义,MySQL以字节为单位解释长度规范。


Column definitions for character string data types CHAR, VARCHAR, the TEXT types, ENUM, SET, and any synonyms) can specify the column character set and collation:


字符串数据类型CHAR,VARCHAR,TEXT类型,ENUM,SET和任何同义词的列定义)可以指定列字符集和排序规则


​CHARSET​​​ is a synonym for ​​CHARACTER SET​​.



Specifying the CHARACTER SET binary attribute for a character string data type causes the column to be created as the corresponding binary string data type: CHAR becomes BINARY, VARCHAR becomes VARBINARY, and TEXT becomes BLOB. For the ENUM and SET data types, this does not occur; they are created as declared. Suppose that you specify a table using this definition:


为字符串数据类型指定CHARACTER SET二进制属性会使该列创建为相应的二进制字符串数据类型:CHAR变为BINARY,VARCHAR变为VARBINARY,TEXT变为BLOB。 对于ENUM和SET数据类型,不会发生这种情况。 它们是按声明创建的。 假设您使用此定义指定一个表:

CREATE TABLE t_b_1
(
c1 VARCHAR(10) CHARACTER SET binary,
c2 TEXT CHARACTER SET binary,
c3 ENUM('a','b','c') CHARACTER SET binary
);

The resulting table has this definition:

String Data Type Syntax 之CHAR、VARCHAR、BINARY、VARBINARY_sql

11.3.2 The CHAR and VARCHAR Types


The CHAR and VARCHAR types are similar, but differ in the way they are stored and retrieved. They also differ in maximum length and in whether trailing spaces are retained.


CHAR和VARCHAR类型相似,但存储和检索方式不同。它们的最大长度以及是否保留尾随空格也不同。


The CHAR and VARCHAR types are declared with a length that indicates the maximum number of characters you want to store. For example, CHAR(30) can hold up to 30 characters.


声明的CHAR和VARCHAR类型的长度表示您要存储的最大字符数。例如,CHAR(30)最多可容纳30个字符。


The length of a CHAR column is fixed to the length that you declare when you create the table. The length can be any value from 0 to 255. When CHAR values are stored, they are right-padded with spaces to the specified length. When CHAR values are retrieved, trailing spaces are removed unless the PAD_CHAR_TO_FULL_LENGTH SQL mode is enabled.


CHAR列的长度固定为创建表时声明的长度。长度可以是0到255之间的任何值。存储CHAR值时,将在它们的右边填充空格以达到指定的长度。检索CHAR值后,除非启用了PAD_CHAR_TO_FULL_LENGTH SQL模式,否则将删除尾部空格。


Values in VARCHAR columns are variable-length strings. The length can be specified as a value from 0 to 65,535. The effective maximum length of a VARCHAR is subject to the maximum row size (65,535 bytes, which is shared among all columns) and the character set used. See Section 8.4.7, “Limits on Table Column Count and Row Size”.


 VARCHAR列中的值是可变长度的字符串。长度可以指定为0到65535之间的值。 VARCHAR的有效最大长度取决于最大行大小(65535字节,在所有列之间共享)和所使用的字符集。请参见第8.4.7节“表列数和行大小的限制”


In contrast to CHAR, VARCHAR values are stored as a 1-byte or 2-byte length prefix plus data. The length prefix indicates the number of bytes in the value. A column uses one length byte if values require no more than 255 bytes, two length bytes if values may require more than 255 bytes.


与CHAR相比,VARCHAR值存储为1字节或2字节长的前缀以及数据。长度前缀指示值中的字节数。如果值需要不超过255个字节,则列使用一个长度字节;如果值可能需要超过255个字节,则列使用两个长度字节。


If strict SQL mode is not enabled and you assign a value to a CHAR or VARCHAR column that exceeds the column's maximum length, the value is truncated to fit and a warning is generated. For truncation of nonspace characters, you can cause an error to occur (rather than a warning) and suppress insertion of the value by using strict SQL mode. See Section 5.1.10, “Server SQL Modes”.


如果未启用严格的SQL模式,并且您为CHAR或VARCHAR列分配的值超过了该列的最大长度,则该值将被截断以适合并生成警告。对于非空格字符的截断,可以通过使用严格的SQL模式引起错误(而不是警告)并抑制插入值。请参见第5.1.10节“服务器SQL模式”。


VARCHAR values are not padded when they are stored. Trailing spaces are retained when values are stored and retrieved, in conformance with standard SQL.


存储VARCHAR值时不会对其进行填充。根据标准SQL,在存储和检索值时保留尾随空格。


The following table illustrates the differences between CHAR and VARCHAR by showing the result of storing various string values into CHAR(4) and VARCHAR(4) columns (assuming that the column uses a single-byte character set such as latin1)


下表通过显示将各种字符串值存储到CHAR(4)和VARCHAR(4)列中的结果来说明CHAR和VARCHAR之间的区别(假设该列使用单字节字符集,如latin1)。

String Data Type Syntax 之CHAR、VARCHAR、BINARY、VARBINARY_字符串_02


The values shown as stored in the last row of the table apply only when not using strict SQL mode; if strict mode is enabled, values that exceed the column length are not stored, and an error results


显示在表的最后一行中的值仅在不使用严格的SQL模式时才适用。 如果启用了严格模式,则不会存储超过列长度的值,并且会导致错误


​InnoDB​​​ encodes fixed-length fields greater than or equal to 768 bytes in length as variable-length fields, which can be stored off-page. For example, a ​​CHAR(255)​​​ column can exceed 768 bytes if the maximum byte length of the character set is greater than 3, as it is with ​​utf8mb4​​.


InnoDB将长度大于或等于768字节的固定长度字段编码为可变长度字段,可以将其存储在页面外。 例如,如果字符集的最大字节长度大于3(如utf8mb4一样),则CHAR(255)列可以超过768个字节。


If a given value is stored into the CHAR(4) and VARCHAR(4) columns, the values retrieved from the columns are not always the same because trailing spaces are removed from CHAR columns upon retrieval. The following example illustrates this difference:


如果给定值存储在CHAR(4)和VARCHAR(4)列中,则从这些列检索的值并不总是相同的,因为检索时会从CHAR列中删除尾随空格。 以下示例说明了这种差异:

CREATE TABLE vc (v VARCHAR(4), c CHAR(4));

INSERT INTO vc VALUES ('ab ', 'ab ');

SELECT CONCAT('(', v, ')'), CONCAT('(', c, ')') FROM vc;

 String Data Type Syntax 之CHAR、VARCHAR、BINARY、VARBINARY_sql_03


Values in CHAR, VARCHAR, and TEXT columns are sorted and compared according to the character set collation assigned to the column.


CHAR,VARCHAR和TEXT列中的值根据分配给该列的字符集排序规则进行排序和比较。


All MySQL collations are of type PAD SPACE. This means that all CHAR, VARCHAR, and TEXT values are compared without regard to any trailing spaces. “Comparison” in this context does not include the LIKE pattern-matching operator, for which trailing spaces are significant. For example:


所有MySQL归类均为PAD SPACE类型。 这意味着将比较所有CHAR,VARCHAR和TEXT值,而不考虑任何尾随空格。 在这种情况下,“比较”不包括LIKE模式匹配运算符,对于后者,尾随空格很重要。 例如:

CREATE TABLE names (myname CHAR(10));

INSERT INTO names VALUES ('Jones');

SELECT myname = 'Jones', myname = 'Jones ' FROM names;

SELECT myname LIKE 'Jones', myname LIKE 'Jones ' FROM names;

String Data Type Syntax 之CHAR、VARCHAR、BINARY、VARBINARY_sql_04

This is not affected by the server SQL mode.


11.3.3 The BINARY and VARBINARY Types


The BINARY and VARBINARY types are similar to CHAR and VARCHAR, except that they store binary strings rather than nonbinary strings. That is, they store byte strings rather than character strings. This means they have the binary character set and collation, and comparison and sorting are based on the numeric values of the bytes in the values.


BINARY和VARBINARY类型类似于CHAR和VARCHAR,不同之处在于它们存储二进制字符串而不是非二进制字符串。即,它们存储字节字符串而不是字符串。这意味着它们具有二进制字符集和排序规则,并且比较和排序基于值中字节的数字值。


The permissible maximum length is the same for BINARY and VARBINARY as it is for CHAR and VARCHAR, except that the length for BINARY and VARBINARY is measured in bytes rather than characters.


BINARY和VARBINARY的最大允许长度与CHAR和VARCHAR相同,但BINARY和VARBINARY的长度以字节而不是字符为单位。


The BINARY and VARBINARY data types are distinct from the CHAR BINARY and VARCHAR BINARY data types. 


BINARY和VARBINARY数据类型不同于CHAR BINARY和VARCHAR BINARY数据类型。


If strict SQL mode is not enabled and you assign a value to a BINARY or VARBINARY column that exceeds the column's maximum length, the value is truncated to fit and a warning is generated. For cases of truncation, to cause an error to occur (rather than a warning) and suppress insertion of the value, use strict SQL mode. See Section 5.1.10, “Server SQL Modes”.


如果未启用严格的SQL模式,并且您为BINARY或VARBINARY列分配的值超过了该列的最大长度,则该值将被截断以适合并生成警告。对于截断的情况,要导致发生错误(而不是警告)并抑制该值的插入,请使用严格的SQL模式。请参见第5.1.10节“服务器SQL模式”。


When BINARY values are stored, they are right-padded with the pad value to the specified length. The pad value is 0x00 (the zero byte). Values are right-padded with 0x00 for inserts, and no trailing bytes are removed for retrievals. All bytes are significant in comparisons, including ORDER BY and DISTINCT operations. 0x00 and space differ in comparisons, with 0x00 sorting before space.


存储BINARY值时,将使用填充值将其右填充到指定的长度。填充值为0x00(零字节)。值用0x00右填充以进行插入,并且不会删除尾随字节以进行检索。所有字节在比较中都是有效的,包括ORDER BY和DISTINCT操作。比较中0x00和空格有所不同,其中0x00在空格之前排序。


Example: For a BINARY(3) column, 'a ' becomes 'a \0' when inserted. 'a\0' becomes 'a\0\0' when inserted. Both inserted values remain unchanged for retrievals.


示例:对于BINARY(3)列,插入时'a'变为'a \ 0'。插入时,“ a \ 0”变为“ a \ 0 \ 0”。两个插入的值均保持不变以进行检索。


For VARBINARY, there is no padding for inserts and no bytes are stripped for retrievals. All bytes are significant in comparisons, including ORDER BY and DISTINCT operations. 0x00 and space differ in comparisons, with 0x00 sorting before space.


对于VARBINARY,没有用于插入的填充,也没有剥离任何字节以进行检索。所有字节在比较中都是有效的,包括ORDER BY和DISTINCT操作。比较中0x00和空格有所不同,其中0x00在空格之前排序。


For those cases where trailing pad bytes are stripped or comparisons ignore them, if a column has an index that requires unique values, inserting values into the column that differ only in number of trailing pad bytes results in a duplicate-key error. For example, if a table contains 'a', an attempt to store 'a\0' causes a duplicate-key error.


对于那些删除尾部填充字节或比较将其忽略的情况,如果一列具有要求唯一值的索引,则将仅尾部填充字节数不同的值插入该列会导致重复键错误。例如,如果表包含“ a”,则尝试存储“ a \ 0”会导致重复键错误


You should consider the preceding padding and stripping characteristics carefully if you plan to use the BINARY data type for storing binary data and you require that the value retrieved be exactly the same as the value stored. The following example illustrates how 0x00-padding of BINARY values affects column value comparisons


如果计划使用BINARY数据类型存储二进制数据,并且要求检索的值与存储的值完全相同,则应仔细考虑上述填充和剥离特性。下面的示例说明BINARY值的0x00填充如何影响列值比较:

mysql> CREATE TABLE t (c BINARY(3));
Query OK, 0 rows affected (0.01 sec)

mysql> INSERT INTO t SET c = 'a';
Query OK, 1 row affected (0.01 sec)

mysql> SELECT HEX(c), c = 'a', c = 'a\0\0' from t;
+--------+---------+-------------+
| HEX(c) | c = 'a' | c = 'a\0\0' |
+--------+---------+-------------+
| 610000 | 0 | 1 |
+--------+---------+-------------+
1 row in set (0.09 sec)


 If the value retrieved must be the same as the value specified for storage with no padding, it might be preferable to use VARBINARY or one of the BLOB data types instead


如果检索到的值必须与为存储指定的值相同且没有填充,则最好使用VARBINARY或BLOB数据类型之一





精彩评论(0)

0 0 举报