unicode-trojan
The named text file contains a Unicode codepoint that has been identified as a potential security risk.
There are two distinct attack vectors. One is homoglyphs in which text looks confusingly similar to what a reader might expects, but is actually different. The second is birectional attacks, in which the rendered text hides potentially malicious characters.
Here are the relevant codepoints:
- ARABIC LETTER MARK (
U+061C
) - LEFT-TO-RIGHT MARK (
U+200E
) - RIGHT-TO-LEFT MARK (
U+200F
) - LEFT-TO-RIGHT EMBEDDING (
U+202A
) - RIGHT-TO-LEFT EMBEDDING (
U+202B
) - POP DIRECTIONAL FORMATTING (
U+202C
) - LEFT-TO-RIGHT OVERRIDE (
U+202D
) - RIGHT-TO-LEFT OVERRIDE (
U+202E
) - LEFT-TO-RIGHT ISOLATE (
U+2066
) - RIGHT-TO-LEFT ISOLATE (
U+2067
) - FIRST STRONG ISOLATE (
U+2068
) - POP DIRECTIONAL ISOLATE (
U+2069
)
You can also run a similar check in your shell with that command:
grep -r $'[\u061C\u200E\u200F\u202A\u202B\u202C\u202D\u202E\u2066\u2067\u2068\u2069]'
The registered vulnerabilities are CVE-2021-42694 ("Homoglyph") and CVE-2021-42574 ("Bidirectional Attack").
Severity: | pedantic |
Experimental: | true |
See also
- list of all the affected packages
- the source of this tag