Terminal renders Unicode emoji skin tone modifiers as invalid character escape sequences

Originator:mxn
Number:rdar://20590060 Date Originated:17-Apr-2015
Status:Open Resolved:
Product:OS X Product Version:
Classification:Other Bug Reproducible:Always
 
Summary:
If you paste a composed Unicode character sequence such as ā€œšŸ‘ŠšŸ¼ā€ (U+1F44A U+1F3FC) into a shell, Terminal renders it as ā€œšŸ‘Š\U+1F3FCā€ rather than ā€œšŸ‘ŠšŸ¼ā€. You can select each glyph individually, for example the ā€œ\ā€ or the ā€œ+ā€.

Steps to Reproduce:
1. Copy the following composed character sequence: šŸ‘ŠšŸ¼.
2. Paste it into a Terminal shell.

Expected Results:
Either ā€œšŸ‘ŠšŸ¼ā€, or ā€œšŸ‘Šā€ followed by ā€œšŸ¼ā€ (a tan-colored swatch).

Actual Results:
ā€œšŸ‘Š\U+1F3FCā€. The buffer contains U+1F44A U+1F3FC as expected.

Version:
OS X 10.10.3 (14D131)

Notes:
ā€œ\U+1F3FCā€ is not a valid character escape sequence in Bash or the Swift CLI, which is where I first discovered this behavior.

Configuration:


Attachments:

Comments


Please note: Reports posted here will not necessarily be seen by Apple. All problems should be submitted at bugreport.apple.com before they are posted here. Please only post information for Radars that you have filed yourself, and please do not include Apple confidential information in your posts. Thank you!