Emoji Support

Emojis and their codes are defined by the Unicode Consortium. They are listed on the full emoji list page on the consorium's site.

You can use emojis in your filters and classification rules.

Emojis in query filters

PYLON query filters support all unicode characters, therefore you can reference any of the emojis in the unicode list in your filter conditions.

When you write filters containing emojis you need to use the unicode value for the emoji.

In some circumstances the emoji itself will work, but due to inconsistencies in the way that applications handle emojis we recommend that you use the unicode value.

As an example imagine you'd like to filter for Facebook topic data stories that contain a heart (❤). You would look up the unicode value for the character (U+2764) and use this in your filter condition:

fb.all.content contains "\u2764"

To filter for stories that contain any of a set of emojis (❤,✈,⛵) you would again use the code for each character:

fb.all.content contains_any "\u2764,\u2708,\u26F5"

You can use the operators such as contains_any and contains because emojis are tokenized in PYLON recordings.

Make sure that when you use emojis your filter is encoded as a unicode string. For example in Python:

from datasift import Client
client = Client("YOUR USERNAME", "IDENTITY API KEY")
client.pylon.analyze(id, analyze_parameters, filter=u'fb.all.content contains "\u2764"')

Emojis in interaction filters

Interaction filters (for both PYLON and STREAM products) currently support a limited set of emojis for filtering and classification rules. The supported emojis are listed below.

Again, when you write filters containing emojis you need to use the unicode value for the emoji.

Revisiting the first example above the CSDL differs for an interaction filter. You would look up the unicode value for the heart character (U+2764) convert this to the hexadecimal equivalent \x{2764} and use this in your filter condition:

fb.content regex_partial "\\x{2764}"

Notice that the backslash is escaped, and that you need to use a regular expression operator such as regex_partial because emojis are not tokenized as they are for query filters.

The same is true for a set of emojis:

fb.content regex_partial "\\x{2764}|\\x{2708}|\\x{26F5}"

Emojis supported in interaction filters

U+26D4 No Entry
U+26D3 Chains
U+26D1 Helmet With White Cross
© U+00A9 Copyright Sign
U+2764 Heavy Black Heart
U+2763 Heavy Heart Exclamation Mark Ornament
U+2602 Umbrella
U+2603 Snowman
U+2600 Black Sun With Rays
U+2601 Cloud
U+2604 Comet
U+2328 Keyboard
U+261D White Up Pointing Index
U+26EA Church
U+27A1 Black Rightwards Arrow
U+26C5 Sun Behind Cloud
U+26C4 Snowman Without Snow
U+26C8 Thunder Cloud And Rain
U+24C2 Circled Latin Capital Letter M
U+26CF Pick
U+26CE Ophiuchus
U+2199 South West Arrow
U+2196 North West Arrow
U+2197 North East Arrow
U+2194 Left Right Arrow
U+2195 Up Down Arrow
U+2618 Shamrock
U+2753 Black Question Mark Ornament
U+2754 White Question Mark Ornament
U+2755 White Exclamation Mark Ornament
U+2757 Heavy Exclamation Mark Symbol
U+2611 Ballot Box With Check
U+2615 Hot Beverage
U+2614 Umbrella With Rain Drops
U+260E Black Telephone
® U+00AE Registered Sign
U+26F0 Mountain
U+26F1 Umbrella On Ground
U+26F2 Fountain
U+26F3 Flag In Hole
U+2620 Skull And Crossbones
U+26F5 Sailboat
U+2622 Radioactive Sign
U+2623 Biohazard Sign
U+26F8 Ice Skate
U+26F9 Person With Ball
U+3299 Circled Ideograph Secret
U+2747 Sparkle
U+2744 Snowflake
U+3297 Circled Ideograph Congratulation
U+263A White Smiling Face
U+2692 Hammer And Pick
U+2694 Crossed Swords
U+2697 Alembic
U+2696 Scales
U+2699 Gear
U+269C Fleur
U+269B Atom Symbol
U+2639 White Frowning Face
U+2638 Wheel Of Dharma
U+21A9 Leftwards Arrow With Hook
U+2733 Eight Spoked Asterisk
U+2734 Eight Pointed Black Star
U+274E Negative Squared Cross Mark
U+274C Cross Mark
U+26FA Tent
U+262F Yin Yang
U+26FD Fuel Pump
U+262A Star And Crescent
U+2702 Black Scissors
U+2626 Orthodox Cross
U+2728 Sparkles
U+2721 Star Of David
U+26F4 Ferry
U+3030 Wavy Dash
U+2122 Trade Mark Sign
U+2648 Aries
U+2649 Taurus
U+26F7 Skier
U+271D Latin Cross
U+26AA Medium White Circle
U+26AB Medium Black Circle
U+2714 Heavy Check Mark
U+2716 Heavy Multiplication X
U+2712 Black Nib
U+26A1 High Voltage Sign
U+26A0 Warning Sign
U+25AB White Small Square
U+25AA Black Small Square
U+2651 Capricorn
U+2650 Sagittarius
U+2653 Pisces
U+2652 Aquarius
U+264F Scorpius
U+264D Virgo
U+264E Libra
U+264B Cancer
U+264C Leo
U+264A Gemini
U+203C Double Exclamation Mark
U+26BD Soccer Ball
U+26BE Baseball
U+2705 White Heavy Check Mark
U+2693 Anchor
U+27B0 Curly Loop
U+2709 Envelope
U+2708 Airplane
U+2660 Black Spade Suit
U+2663 Black Club Suit
U+2665 Black Heart Suit
U+2666 Black Diamond Suit
U+2668 Hot Springs
U+2198 South East Arrow
U+262E Peace Symbol
U+267F Wheelchair Symbol
U+25C0 Black Left
U+267B Black Universal Recycling Symbol
U+2795 Heavy Plus Sign
U+2796 Heavy Minus Sign
U+2797 Heavy Division Sign
U+26B1 Funeral Urn
U+21AA Rightwards Arrow With Hook
U+231A Watch
U+231B Hourglass
U+25B6 Black Right
U+270C Victory Hand
U+270B Raised Hand
U+270A Raised Fist
U+26B0 Coffin
U+270F Pencil
U+270D Writing Hand
U+27BF Double Curly Loop