Hashtags and Cashtags

Hashtags

Hashtags frequently appear in Tweets. They're easy to identify because the first character is always #. For example:

I found an awesome Big Data platform called #DataSift.

In this example, the hashtag is #DataSift. Because the hash symbol is classed as punctuation, hashtags are treated as separate words, so when you filter for "#apple" you are looking for the word "#" followed by the word "apple", with zero or more whitespace characters in between.

There are several different ways to filtering on hashtags. You could write:

interaction.content contains "#DataSift"

This will match any interactions containing "#DataSift", or "# DataSift", as the "#" symbol is tokenized as a separate word.

Alternatively, you could write:

interaction.content contains "DataSift"

This will match any interactions containing "#DataSift" or "DataSift". As in the above example, the "#" symbol is tokenized as a separate word, so filtering for your hashtag without the "#" symbol will still match this hashtag.

Or you could use our twitter.hashtags target, which filters on hashtags in Tweets and ignores everything else in Tweets. The following piece of CSDL would therefore only match Tweets with the #DataSift hashtag:

twitter.hashtags contains "DataSift"

Cashtags

DataSift uses 'cashtags' to allow you to filter for stock ticker symbols; for example:

twitter.text any "$AAPL, $GOOG"

This works because we have chosen to treat the $ symbol as an alphabetical character, not as punctuation.