search.twitter.com breaks with Unicode characters outside the BMP.
Unicode characters outside the Basic Multilingual Plane (BMP) display correctly in Twitter but not search.twitter.com. For example, the message
π―π¬ π² π£π©π π³ πππ¦π₯ π ππ²π π¦π― π Β·ππ±ππΎπ― π¨π€ππ©ππ§π
("Now I have a SCIM to type in the Shavian alphabet" in the Shavian alphabet) appears correctly at http://twitter.com/marnanel/status/10... but on http://search.twitter.com/search?q=ma... it appears as a string of nonsense characters:
Ρ―Ρ¬ Ρ² Ρ£Ρ©Ρ Ρ³ ΡΡΡ¦Ρ₯ Ρ ΡΡ²Ρ Ρ¦Ρ― Ρ Β·ΡΡ±ΡΡΎΡ― Ρ¨Ρ€ΡΡ©ΡΡ§Ρ
π―π¬ π² π£π©π π³ πππ¦π₯ π ππ²π π¦π― π Β·ππ±ππΎπ― π¨π€ππ©ππ§π
("Now I have a SCIM to type in the Shavian alphabet" in the Shavian alphabet) appears correctly at http://twitter.com/marnanel/status/10... but on http://search.twitter.com/search?q=ma... it appears as a string of nonsense characters:
Ρ―Ρ¬ Ρ² Ρ£Ρ©Ρ Ρ³ ΡΡΡ¦Ρ₯ Ρ ΡΡ²Ρ Ρ¦Ρ― Ρ Β·ΡΡ±ΡΡΎΡ― Ρ¨Ρ€ΡΡ©ΡΡ§Ρ
2
people have this question
I have this question, too!
Tell me when someone answers.
The more people who ask this question, the more it gets noticed.
The more people who ask this question, the more it gets noticed.
-
Inappropriate?these problems also apply to this one, which uses mixed non-BMP and BMP characters:
http://twitter.com/marnanel/statuses/...
this is perhaps easier to debug
I’m curious
Loading Profile...


