JAVA15 文本块
背景
摘要
文本块是一种多行字符串字面形式,可避免使用大多数转义序列,以可预测的方式自动格式化字符串,并在需要时让开发人员控制格式。
历史
- JEP 355 第一次预览(JDK 13) 提出
- JEP 368 第二次预览(JDK 14), 两个新的转义序列
- JEP 378 正式发布(JDK 15)
不足
- 不直接支持字符串插值( interpolation ), 见 String::formatted
- 不支持原始字符串
详细
结构定义
来源1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63TextBlock:
""" {TextBlockWhiteSpace} LineTerminator {TextBlockCharacter} """
TextBlockWhiteSpace:
WhiteSpace but not LineTerminator
TextBlockCharacter:
InputCharacter but not \
EscapeSequence
LineTerminator
WhiteSpace:
the ASCII SP character, also known as "space"
the ASCII HT character, also known as "horizontal tab"
the ASCII FF character, also known as "form feed"
LineTerminator:
the ASCII LF character, also known as "newline"
the ASCII CR character, also known as "return"
the ASCII CR character followed by the ASCII LF character
InputCharacter:
UnicodeInputCharacter but not CR or LF
UnicodeInputCharacter:
UnicodeEscape
RawInputCharacter
UnicodeEscape:
\UnicodeMarker HexDigit HexDigit HexDigit HexDigit
UnicodeMarker:
u {u}
RawInputCharacter:
any Unicode character
EscapeSequence:
\b (backspace BS, Unicode \u0008)
\s (space SP, Unicode \u0020)
\t (horizontal tab HT, Unicode \u0009)
\n (linefeed LF, Unicode \u000a)
\f (form feed FF, Unicode \u000c)
\r (carriage return CR, Unicode \u000d)
\LineTerminator (line continuation, no Unicode representation)
\" (double quote ", Unicode \u0022)
\' (single quote ', Unicode \u0027)
\\ (backslash \, Unicode \u005c)
OctalEscape (octal value, Unicode \u0000 to \u00ff)
OctalEscape:
\OctalDigit
\OctalDigit OctalDigit
\ZeroToThree OctalDigit OctalDigit
OctalDigit:
(one of)
0 1 2 3 4 5 6 7
ZeroToThree:
(one of)
0 1 2 3
测试用例
1 | package com.pancc.up.jdks; |
编译器处理步骤
- 行结束符被翻译为 LF (\u000A)
- 最大努力的删除左右缩进使用的空白字符
- 对内容中的 Escape 序列进行解释, 如: \s
补充
编辑器换行符使用 LF
- IntelliJ IDEA: Settings -> Editor -> Code Style ==> Line separator(Unix and MacOS(\n))
- Notepad++: 设置 -> 首选项 -> 新建 ==> 格式(行尾) (Unix(LF))
一些实用的方法
- String::stripIndent():用于从文本块内容中剥离附带的空白空间
- String::translateEscapes():用于翻译转义序列
- String::formatted(Object… args):简化文本块中的值替换