Take a look at our Augmentations page too.
Here are the individual sources that DataSift accesses. Click on each one to see what you can filter against:
Targets are key components of the filters you write in DataSift.
We currently offer more than 300 targets, and that number is growing. The targets documentation lists each one, organized by the service which provides them. It also gives examples of how to use them.
The value for each target has a data type: integer, string, float, geo, or an array of any of these types.
Individual targets are not always present. For instance, the Twitter targets will not be present in Myspace objects. Furthermore, some of the Twitter targets might not be present even if the object comes from Twitter. For instance, the Retweet targets are not populated unless the object is a Retweet. If it is a Retweet, some of the Tweet targets will not be exist because the significant information is carried in the Retweet part of the object. Our documentation for each target indicates whether or not the target always exists.
You can browse the consolidated list of all the targets and augmentations.
Here are some samples of Twitter output objects.
The common targets are useful when you want to create streams that work for input objects from any source.
For example, if an input object comes from Twitter, interaction.content contains the same string that you would find in twitter.text, the text of a Tweet. Alternatively, if the content comes from YouTube, interaction.content might contain a comment concerning a video.
The main content of the post; for example: the text of a Tweet from Twitter.
The main content of a post. For example, if the input object is a Tweet, interaction.content contains the text of the Tweet.
It is a general-purpose target. Use it to filter across all the data sources we have rather than restricting your search to just one.
NoteThe filtering engine ignores links and Twitter @mentions. Use the interaction.link target to filter for links and interaction.mentions to filter for @mentions.
1. Filter for posts from any data source that mention the iPad in their main content:
2. Filter for Tweets and Retweets about big data that contain links to DataSift or that mention our founder:
The main content of the post. It is a general target which you can use to filter across all the data sources we have.
Filter for posts from any data source that mention the iPad in their main content.
The location from which this input object was created.
The latitude and longitude from which a post was sent.
Learn more about longitude and latitude.
CSDL offers three geographical operators:
See also: twitter.geo.
1. To see objects that include interaction.geo information:
2. To see objects posted within a 50 kilometer radius of DataSift's UK office in Reading:
The latitude and longitude from which a post was sent.
Use our set of special operators to filter for specific geo locations. The Geo box operator allows you to filter for content sent from within any rectangle that you define. The Radius operator allows you to filter for content sent from within any circle you define. The Polygon operator allows you to search for content sent from within any polygon you define, with up to 32 vertices.
Filter for messages sent within 1 mile of the Empire State Building.
A link to an individual post.
A direct link to an individual post. For example, if a post came from Twitter, interaction.link points to the individual page for the Tweet.
1. To see examples of links in the interaction.links target:
A direct link to an individual post.
If a post came from Twitter, interaction.link points to the individual page for the Tweet.
A random float in the range 0-100. Used for sampling data.
By default, a stream in DataSift looks at all of the input objects that arrive from your chosen data sources. For example, this filter looks at every input object sent to DataSift along the Twitter Firehose:
In situations where you are performing statistical analysis on data, you can use the technique of sampling.
The interaction.sample target is an internally generated floating-point random number between 0 and 100.
This filter samples 5.25 percent of the incoming input objects and ignores the rest:
Twitter limits your to 500,000 Tweets in a 24-hour period. You can use interaction.sample to reduce your data consumption.
1. To sample 1 percent of incoming Tweets:
2. Filter for all the Tweets that mention "coffee" and for a 10% sample of the Retweets that mention coffee:
3. You can even nest the samples:
By default, a stream in DataSift looks at all of the input objects that arrive from your chosen data sources. This filter samples a specified percent of incoming input objects and ignores the rest. This target is particularly useful when you want to perform statistical analysis on data.
Twitter also limits your data consumption to 500,000 Tweets in a 24-hour period. So you can use interaction.sample to reduce your data consumption.
Sample 2 percent of incoming messages.
The application that was used to generate and send a post.
The application that was used to generate and send a post.
This target can take a wide range of values. Here are some examples:
1. Filter for content generated in TweetDeck:
The application that was used to generate and send a post. For example, A.plus for Blackberry, Google2Tweet, TweetDeck, and so forth.
Filter for posts generated on an iPad.
The title associated with a post.
The title associated with a post. If the post is a blog, for example, it is meaningful to filter by title. If the post is a Tweet, there will be no title.
1. Filter for posts that mention the iPad in their title:
The title associated with the content.
Filter for all content with iPad in its title.
The data source that generated a post; for example: "twitter".
The data source that supplied a post.
| If the Data source is: | The interaction.type is: | |
| 2channel | 2ch | |
| Amazon | amazon | |
| Blogs | blog | |
| Boards | board | |
| DailyMotion | dailymotion | |
| Demographics | ||
| Flickr | flickr | |
| IMDb | imdb | |
| Newscred | newscred | |
| Topix | topix | |
| Videos | video | |
| Wikipedia | wikipedia | |
| YouTube | youtube |
1. Filter for content from Twitter and YouTube that mentions "apple":
2. Filter for content from Twitter only:
To make sure that this filter runs correctly, go to the Data Sources page and make sure that Twitter is activated. Activating Twitter automatically de-activates your Demographics feed so, be careful. Do not activate Twitter without first checking whether you have any Demographics streams running.
Note that we are sampling just 2 percent of the feed to reduce costs and make sure we do not exceed our Twitter rate limit.
3. Filter content from Demographics:
To make sure that this filter runs correctly, go to the Data Sources page and make sure that Demographics is activated. Activating Demographics automatically de-activates your Twitter feed so, be careful. Do not activate Demographics without first checking whether you have any Twitter streams running.
Note that we are sampling just 2 percent of the feed to reduce costs and make sure we do not exceed our Twitter rate limit.
The data source from which an interaction originated.
Filter for posts from Twitter, Youtube, and IMDb.
A URL linking to the author's avatar.
A URL linking to the author's avatar.
1. This target is rarely used in filtering, but the data is delivered in the output object. Your client software can process it if you chose to use it. To see examples of interaction.author.avatar:
A URL to the author's avatar.
This target is rarely used for filtering, but the data is delivered in the output object.
The author's id on the service from which they generated a post.
The author's id on the service from which they generated a post.
1. To see examples of ids for all the services from which DataSift accepts data:
The author's id on the service from which they generated a post.
Filter for content by DataSift's user id on Twitter, 155505157.
The URL the user has assigned for their homepage or profile.
A link to the author's main page on the service from which they generated a post.
1. To see examples of interaction.author.link for all the services from which DataSift accepts data:
The URL the user has assigned for their homepage or profile.
Filter for homepage links to datasift.com.
The real name of the author of a post.
The author's real name on the service from which they generated the post.
For example, if the post is a Tweet, interaction.author.name contains the string you would find in twitter.user.name.
1. Filter for content written by authors with "Linus" in their real name:
The real name supplied by the author, on the service from which they generated the post.
Filter for posts from people with "Bill Gates" as their real name.
The username of the author of a post.
The author's username on the service from which they generated a post.
For example, if the post is a Tweet, interaction.author.username contains the string you would find in twitter.user.screen_name.
1. Filter for content written by authors with "microsoft" in their username:
The username supplied by the author, on the service from which they generated a post.
Filter for authors with "apple" in their username.
Regardless of the data source, interaction.sample always contains a random floating-point number between 0 and 100.
| Interaction | 2ch |
| interaction.content | 2ch.content |
| interaction.geo | 2ch.geo |
| interaction.link | 2ch.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | 2ch.source |
| interaction.title | 2ch.title |
| interaction.type | "2ch" |
| interaction.author.avatar | 2ch.author.avatar |
| interaction.author.id | 2ch.author.id |
| interaction.author.link | 2ch.author.link |
| interaction.author.name | 2ch.author.name |
| interaction.author.username | 2ch.author.username |
| Interaction | Amazon |
| interaction.content | amazon.content |
| interaction.geo | amazon.geo |
| interaction.link | amazon.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | amazon.source |
| interaction.title | amazon.title |
| interaction.type | "amazon" |
| interaction.author.avatar | amazon.author.avatar |
| interaction.author.id | amazon.author.id |
| interaction.author.link | amazon.author.link |
| interaction.author.name | amazon.author.name |
| interaction.author.username | amazon.author.username |
| Interaction | Bitly |
| interaction.content | |
| interaction.geo | bitly.geo |
| interaction.link | bitly.url |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.sample | |
| interaction.source | bitly.cname |
| interaction.title | |
| interaction.type | |
| interaction.author.avatar | |
| interaction.author.id | |
| interaction.author.link | |
| interaction.author.name | |
| interaction.author.username |
| Interaction | Blog |
| interaction.content | blog.content |
| interaction.geo | blog.geo |
| interaction.link | blog.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | blog.source |
| interaction.title | blog.title |
| interaction.type | "blog" |
| interaction.author.avatar | blog.author.avatar |
| interaction.author.id | blog.author.id |
| interaction.author.link | blog.author.link |
| interaction.author.name | blog.author.name |
| interaction.author.username | blog.author.username |
| Interaction | Board |
| interaction.content | board.content |
| interaction.geo | board.geo |
| interaction.link | board.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | board.source |
| interaction.title | board.title |
| interaction.type | "board" |
| interaction.author.avatar | board.author.avatar |
| interaction.author.id | board.author.id |
| interaction.author.link | board.author.link |
| interaction.author.name | board.author.name |
| interaction.author.username | board.author.username |
| Interaction | DailyMotion |
| interaction.content | dailymotion.content |
| interaction.geo | dailymotion.geo |
| interaction.link | dailymotion.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | dailymotion.source |
| interaction.title | dailymotion.title |
| interaction.type | "dailymotion" |
| interaction.author.avatar | dailymotion.author.avatar |
| interaction.author.id | dailymotion.author.id |
| interaction.author.link | dailymotion.author.link |
| interaction.author.name | dailymotion.author.name |
| interaction.author.username | dailymotion.author.username |
| Interaction | |
| interaction.content | One of: facebook.message, facebook.icon, facebook.story, facebook.description, or facebook.caption |
| interaction.geo | |
| interaction.link | |
| interaction.mentions | facebook.to.names |
| interaction.mention_ids | facebook.to.ids |
| interaction.sample | |
| interaction.source | facebook.source |
| interaction.title | facebook.name |
| interaction.type | "facebook" |
| interaction.author.avatar | facebook.author.avatar |
| interaction.author.id | facebook.author.id |
| interaction.author.link | facebook.author.link |
| interaction.author.name | facebook.author.name |
| interaction.author.username |
| Interaction | Flickr |
| interaction.content | flickr.content |
| interaction.geo | flickr.geo |
| interaction.link | flickr.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | flickr.source |
| interaction.title | flickr.title |
| interaction.type | "flickr" |
| interaction.author.avatar | flickr.author.avatar |
| interaction.author.id | flickr.author.id |
| interaction.author.link | flickr.author.link |
| interaction.author.name | flickr.author.name |
| interaction.author.username | flickr.author.username |
| Interaction | IMDb |
| interaction.content | imdb.content |
| interaction.geo | imdb.geo |
| interaction.link | imdb.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | imdb.source |
| interaction.title | imdb.title |
| interaction.type | "imdb" |
| interaction.author.avatar | imdb.author.avatar |
| interaction.author.id | imdb.author.id |
| interaction.author.link | imdb.author.link |
| interaction.author.name | imdb.author.name |
| interaction.author.username | imdb.author.username |
| Interaction | NewsCred |
| interaction.content | newscred.article.content, newscred.image.caption, or newscred.video.caption |
| interaction.geo | |
| interaction.link | newscred.article.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.sample | |
| interaction.source | newscred.source.name |
| interaction.title | newscred.article.title or newscred.video.title |
| interaction.type | "newscred" |
| interaction.author.avatar | |
| interaction.author.id | |
| interaction.author.link | |
| interaction.author.name | First entry in the newscred.article.authors array |
| interaction.author.username | |
| Interaction | |
| interaction.content | reddit.content |
| interaction.geo | reddit.geo |
| interaction.link | reddit.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | reddit.source |
| interaction.title | reddit.title |
| interaction.type | "reddit" |
| interaction.author.avatar | reddit.author.avatar |
| interaction.author.id | reddit.author.id |
| interaction.author.link | reddit.author.link |
| interaction.author.name | reddit.author.name |
| interaction.author.username | reddit.author.username |
| Interaction | Topix |
| interaction.content | topix.content |
| interaction.geo | topix.geo |
| interaction.link | topix.link |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | topix.source |
| interaction.title | topix.title |
| interaction.type | "topix" |
| interaction.author.avatar | topix.author.avatar |
| interaction.author.id | topix.author.id |
| interaction.author.link | topix.author.link |
| interaction.author.name | topix.author.name |
| interaction.author.username | topix.author.username |
| Interaction | |
| interaction.content | twitter.text |
| interaction.geo | twitter.geo |
| interaction.link | |
| interaction.mentions | twitter.mentions or twitter.retweet.mentions |
| interaction.mention_ids | twitter.mention_ids or twitter.retweet.mention_ids |
| interaction.sample | |
| interaction.source | twitter.source |
| interaction.title | |
| interaction.type | "twitter" |
| interaction.author.avatar | twitter.user.profile_image_url |
| interaction.author.id | twitter.user.id |
| interaction.author.link | |
| interaction.author.name | twitter.user.name |
| interaction.author.username | twitter.user.screen_name |
| This Interaction target: | Video: |
| interaction.content | video.content |
| interaction.geo | |
| interaction.link | video.videolink |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | |
| interaction.title | video.title |
| interaction.type | |
| interaction.author.avatar | |
| interaction.author.id | |
| interaction.author.link | video.author.kink |
| interaction.author.name | video.author.name |
| interaction.author.username |
| This Interaction target: | Wikipedia: |
| interaction.content | wikipedia.comment |
| interaction.geo | |
| interaction.link | |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | |
| interaction.title | wikipedia.title |
| interaction.type | |
| interaction.author.avatar | |
| interaction.author.id | |
| interaction.author.link | |
| interaction.author.name | |
| interaction.author.username |
| This Interaction target: | YouTube: |
| interaction.content | youtube.content |
| interaction.geo | |
| interaction.link | youtube.videolink |
| interaction.mentions | |
| interaction.mention_ids | |
| interaction.source | |
| interaction.title | youtube.title |
| interaction.type | |
| interaction.author.avatar | |
| interaction.author.id | |
| interaction.author.link | youtube.author.link |
| interaction.author.name | youtube.author.name |
| interaction.author.username |
List of user IDs for users mentioned in an interaction.
List of user IDs for users mentioned in an interaction.
This target will match interactions otherwise matched by twitter.mention_ids and twitter.retweet.mention_ids as well as facebook.to.ids.
Be aware a user's ID on one site might, by chance, be the same as another user's ID on another site. For example, my numeric ID on Facebook might be the same as some else's Twitter ID.
List of users/usernames mentioned in an interaction.
List of users/usernames mentioned in an interaction.
This target will match interactions otherwise matched by twitter.mentions and twitter.retweet.mentions as well as facebook.to.names.
Make sure that the name(s) you use as the argument of this target point to the accounts owned by the same user(s).
Review these JSON examples of the Twitter output format.
A list of the domains from links mentioned in a Tweet.
A list of the domains from links mentioned in a Tweet.
Remember that Twitter has an automated link shortener which converts links mentioned in a Tweet into links belonging to the t.co domain. The twitter.domains target contains the unwrapped version of the link, not the t.co shortened version.
The author of the Tweet might have shortened a link already; they might have converted apple.com, for example, into a bit.ly shortened link. Twitter's automatic shortener might, in turn, convert that bit.ly link into a t.co link. In this example, the twitter.domains target unwraps the t.co link to a bit.ly link, but it does not resolve that bit.ly link all the way back to the original apple.com link. In other words, if a link has been shortened, the process always performs one unwrap but no more.
DataSift has a powerful link aggregator of its own, the Links Augmentation, which fully resolves links back to the page they point to. Take a look at the links.domain target.
1. Filter for Tweets that include links to Apple.com. Note that the target is an array and we are filtering for just one value so we use the == operator rather than contains.
2. Filter for Tweets that include links to google.com or microsoft.com. Note that the target is an array and we are filtering for a value from a list so we use the in operator rather than contains.
A list of the domains mentioned in links in a Tweet. This target allows you to filter for Tweets that mention particular websites.
Filter for Tweets that mention pages from flickr.com.
The location from which a Tweet was sent.
The geographical location information for a Tweet represented by a pair of longitude and latitude coordinates.
Learn more about longitude and latitude.
CSDL offers three geographical operators:
See also: interaction.geo.
1. Filter for Tweets originating within a 50 kilometer radius of DataSift's UK office in Reading:
2. Filter for Tweets originating within a region represented by a rectangle:
3. Filter for Tweets originating within a region represented by a polygon:
The geo location, in latitude and longitude, from which a Tweet was posted. You can draw the area that you’re interested in as a circle, a rectangular box, or a polygon defined by up to 32 points.
Filter for Tweets posted from within a 10-mile radius of New York City.
The Twitter username of the author a Tweet replies to (if it is a reply).
If a Tweet is a reply, twitter.in_reply_to_screen_name contains the username of the author a Tweet is replying to.
Otherwise it is not populated.
1. Filter for Tweets that are replies:
2. Filter for Tweets that reply to Tweets from Datasift:
Note that the argument is "datasift" rather than "@datasift". You do not include the @ symbol.
The Twitter username of the author who posted the original Tweet, which received a reply. This target is useful when you want to see everything that an author Tweets together with all the responses.
Filter for all the replies to Lady Gaga.
A list of links mentioned in a Tweet.
A list of the links mentioned in this Tweet.
Note that DataSift also has a powerful link aggregator of its own, available via the Links Augmentation.
1. Filter for Tweets that include a link to a particular page at yahoo.com. Note that the target is an array and we are filtering for just one value so we use the == operator rather than contains.
2. Filter for Tweets that include a link to any page from a set of two or more pages. Note that the target is an array and we are filtering for one value from a list so we use the in operator rather than contains.
A list of links mentioned in a Tweet.
Filter for all the Tweets that mention the link "http://scores.espn.go.com/nfl/".
A list of Twitter usernames mentioned in a Tweet.
A list of the Twitter usernames mentioned in a Tweet.
Take care to use the correct operator with this target. As the examples show, use == if you are filtering for just one username and use in if you are filtering for a list of two or more usernames.
1. Filter for Tweets that mention @datasift:
Note that the argument is "datasift" rather than "@datasift". Also, remember that the target is an array and we are filtering for just one value so we use the == operator (not contains).
2. Filter for Tweets that mention @LadyGaga or @Beyonce:
The target is an array and we are filtering for a list of values so we use the in operator (not contains).
You cannot filter for usernames or links in the twitter.text target.
For example, suppose the original text of a Tweet is:
"@datasift launched today! http://bitly.com/r6pmP0"
The Tweet will not match against this filter:
twitter.text contains "@datasift"
OR twitter.text subst "bitly"
It will match against this filter:
twitter.text contains "launched today"
A list of the Twitter usernames mentioned in a Tweet.
Filter for Tweets that mention "BarackObama" and "MittRomney".
The source of the Tweet. For example, "Twitter for iPad" or "TweetDeck".
The source of the Tweet. For example, "Twitter for iPad" or "TweetDeck".
1. Filter for Tweets that were not sent from the Twitter.com user interface:
2. Filter for people Tweeting from a Blackberry:
Notice that we use the substr operator here because there is more than one Twitter client for the Blackberry.
Here are some typical examples of sources that you might see in a DataSift stream:
| 4MedProNetwork | ALToolbar | TwitPal |
| DROID | TwitBird | shareaholic |
| MetroTwit | Prinz | SOICHA |
| Indeed.com | Tumblr | Tween |
| HootSuite | TweetCaster for Android | TweetDeck |
| WPTweeter | Write Longer | |
| Keitai Web | UberSocial for BlackBerry | Echofon |
| Tweet Button | twitterfeed | Twitterrific for Mac |
| Twitter for iPad | Twitter for iPhone | Twitter for Android |
| ShootingStarPro | Revenge Of The Fallen | twicca |
| Gravity | shabik360 | ShootingStar |
| FingerLaser | Shazam on iOS | Mobile Web |
| Buffer | SocialScope |
Note that DataSift returns them wrapped in anchor tags in the JSON output like this:
<a href="http://www.tumblr.com/" rel="nofollow">Tumblr</a>
To collect 'clean' copies of the sources, you could consider using the interaction.source target.
The source from where a Tweet originated.
Filter for Tweets sent from Instagram users.
The text of a Tweet.
The 140-character text of a Tweet.
DataSift's filtering engine strips URLs and @mentions from the text of the Tweet so make sure you select your target carefully:
| Use this target: | To filter for this content: |
| twitter.text | The text of a Tweet only. |
| twitter.links | Links embedded in a Tweet. |
| twitter.domains | Domains of links embedded in a Tweet. |
| twitter.mentions | Twitter user screen names mentioned in a Tweet. |
You can write a filter that looks for "bitly" in twitter.text like this:
twitter.text contains "bitly"
It matches this Tweet: "Have you used bitly today?"
However, to filter for the link in this Tweet: "Take a look at http://bitly.com/r6pmP0"
you would use the twitter.links or twitter.domains targets or the Links augmentation.
Similarly, to look for Tweets that refer to a specific user like this one: "Have you tried @DataSift yet?"
you would use the twitter.mentions target.
1. Filter for Tweets that refer to a birthday:
2. Filter for people talking about the 2012 Olympics on Twitter:
You cannot filter for usernames or links in the twitter.text target.
For example, suppose the original text of a Tweet is:
"@DataSift launched today! http://bitly.com/r6pmP0"
The Tweet will not match against this filter:
"twitter.text contains "@datasift"
OR twitter.text subst "bitly"
It will match against this filter:
twitter.text contains "launched today"
The 140-character main text of a Tweet.
Filter for Tweets that contain the text "iPhone 5 release".
The Twitter user's biographical description.
The biography information from the Twitter user's Profile page. Twitter limits this field to 160 characters.
1. Filter for Tweets from people who include selected words in their Twitter Bio:
The 160-character biography information from the Twitter user's Profile page.
Filter for Tweets from anyone who includes "IT" in their Twitter Profile.
The number of followers the user has.
The number of followers the author of a Tweet has on Twitter.
1. Filter for Tweets from popular Twitterers:
The number of followers the author of a Tweet has on Twitter.
Filter for Tweets from Twitter members who have more than 500,000 followers.
Ratio of followers to following users.
The number of people who follow this Twitter user divided by the number of people the user follows on Twitter.
1. Filter for Tweets from authors who are followed by 10 times more people than they follow:
2. Filter for Tweets from authors who have posted more than 10,000 Tweets and who are followed by 50 times more people than they follow:
The ratio of the number of people who follow a Twitter user divided by the number of people that user follows on Twitter.
Filter for Tweets from authors who are followed by 10 times more people than they follow.
The number of people an author follows.
The number of people an author follows on Twitter.
1. Filter for Tweets from authors who follow more than 100,000 people on Twitter:
The number of other Twitter users that a Twitter user follows.
Filter for Tweets from users who follow more than 100,000 people on Twitter.
Unique ID of the Twitter user.
A user's unique id on Twitter. For example, the Twitter id for @DataSift is 155505157.
The id is not exposed on the Twitter website but can be used if you write code to access the Twitter API.
Note that Twitter treats user IDs and Tweet IDs differently.
Filter for all Tweets from a list of accounts you specify by their user id.
1. Filter for Tweets from DataSift:
The unique user id of a Twitter user.
Filter for tweets by Lady Gaga, using her Twitter id "14230524".
Two-character language code that the user set in Twitter.
The language the Twitter user selected from the drop-down list on their Settings page.
It is an indication of their primary language but it might be misleading. For example, a user might select French as their mother tongue but compose their Tweets in English. DataSift offers an augmentation called Language which determines the language based on a statistical analysis of groups of characters.
The drop-down list shows the full name of the language (English, French, German, and so on) but the twitter.user.lang target contains the code for the language.
Take a look at the list of languages that Twitter supports.
1. Filter for Tweets from authors who specify French as their preferred language:
2. Filter for Tweets from authors who specify French as their preferred language but Tweet in English:
3. Filter for Tweetz written in Lolcatz:
You might need to wait for these to show up. At the time of writing there are fewer than 100 per hour but if you're patient you'll see them.
The default language selected by a Twitter user on their Settings page.
Filter for Tweets from authors who specify French as their preferred language.
Number of lists in which a user appears.
The number of Twitter lists on which the author of a Tweet appears.
1. Filter for Tweets from people who appear on more than 100 lists:
The number of Twitter lists on which the author of a Tweet appears.
Filter for Tweets from people who appear on more than 100 lists.
The string description of the Twitter user's location.
The string description of the Twitter user's location. This is supplied by the Twitter user in the Location field on their Profile page. It is a freeform field, so they can type anything they choose:
1. Filter for Tweets from people who include Japan in their location description:
The string description of a Twitter user's location.
Filter for Tweets from users who state their location as London.
The 'real' name the user supplied to Twitter.
A Twitter user's 'real' name. Since this is a free-text entry field in the Twitter settings, it could contain anything. It could be the user's genuine name, perhaps augmented with a title (such as Dr or Professor), a suffix (such as "BA" or "Jnr" or "III"), or a post nominal (such as "FRSC" or "FAIA") indicating membership of an organization or professional body.
The range of possibilities is wide and includes obvious parody accounts, and accounts that are run by companies or organizations. For instance, the @climagic account, which Tweets about Unix-based computer operating systems, has a "real name" of Command Line Magic.
Note that a Twitter user can adjust their real name over time. They can change their screenname (for example @LadyGaga) too, but their Twitter id cannot be changed.
1. Filter for Tweets from users who have included "doctor" in their real name:
The name supplied by a Twitter user. Since this is a free-text entry field in the Twitter settings, it could contain anything. The name might indicate a name, parody account, or an account run by a company or organization.
Filter for Tweets from users that include "Snr" or "Sr" in their names.
The number of days since a user joined Twitter.
The number of days since this user joined Twitter.
1. Filter for Tweets from authors who joined Twitter more than three years ago:
2. Filter for Tweets from authors who are in their second month of Twitter membership:
The number of days since a user joined Twitter.
Filter for Tweets from users who have been members of Twitter for more than five years.
A user's Twitter username.
An author's Twitter username. For example, the screenname for the @DataSift account on Twitter is "DataSift".
Note that Twitter users can adjust their screenname over time so @LadyGaga could change to @LadyG, for example, if that screenname were available. A user's Twitter id cannot be changed so, for consistency over time, we recommend this procedure:
Filter on a screenname until you have collected at least one Tweet.
Examine the output JSON data to determine the Twitter id that corresponds to the screenname.
Rewrite your CSDL filter to use the id instead of the screenname.
1. Filter for Tweets from authors who include "marketing" in their Twitter username:
2. Filter for Tweets from a set of authors:
A user's Twitter username.
Filter for Tweets from users with "Apple" in their Twitter username.
The number of messages a Twitter user has posted.
The total number of Tweets and Retweets a Twitter user has posted.
1. Filter for Tweets from prolific Twitterers:
The total number of Tweets and Retweets a Twitter user has posted.
Filter for Tweets from people who have posted more than 100,000 messages.
A Twitter user's timezone.
The time zone that a Twitter user selected in their Twitter Settings.
Note that Twitter uses city names to specify timezones. For example, using this target, your filter might match against Edinburgh and against London, but these both represent a single timezone.
1. Filter for Tweets from the same timezone as Tokyo:
2. Filter for Tweets from users who have not selected London as their time zone:
Here's a complete list of the values you can filter against with twitter.user.time_zone:
The time zone that a Twitter user has selected in their Twitter Settings.
Filter for Tweets from users in the New York time zone.
The URL a user chose to include in their Twitter profile.
The URL a user chose to include in their Twitter profile. This is optional, so it is frequently unpopulated.
1. Look for Tweets that include a twitter.user.url:
The URL a Twitter user chose to include in their Twitter profile. This is an optional field, so it might be unpopulated.
Filter for Tweets from users that include a link containing "wordpress" in their profile.
The country a Tweet was sent from.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.place.* targets are unpopulated for that Tweet.
The country from which a Tweet was sent.
See also twitter.place.country_code.
1. Filter for Tweets for which the user-supplied location information is in Canada:
Twitter treats the US and UK as "United States" and "United Kingdom". Here are some sample countries that you can filter against:
| Algeria | Argentina | Armenia |
| Australia | Austria | Azerbaijan |
| Bahamas | Belarus | Belgien |
| Belgium | Bosnia and Herzegovina | Brasil |
| Canada | Chile | Colombia |
| Costa Rica | Curacao | Denmark |
| Dominican Republic | Ecuador | Egypt |
| El Salvador | Estonia | Finland |
| France | Georgia | Germany |
| Greece | Guadeloupe | Guatemala |
| Honduras | Hong Kong | Hungary |
| India | Indonesia | Ireland |
| Italia | Italy | Jamaica |
| Japan | Korea, Republic Of | Kosovo |
| Lao People's Democratic Republic | Latvia | Luxembourg |
| Macedonia, the Former Yugoslav Republic Of | Malaysia | Martinique |
| Mexico | Monaco | Morocco |
| Norway | Pakistan | Panama |
| Peru | Philippines | Poland |
| Portugal | Russia | Saudi Arabia |
| Serbia | South Africa | Spain |
| Sri Lanka | Suomi | Sweden |
| Switzerland | Taiwan, Republic of China | Thailand |
| The Netherlands | Trinidad and Tobago | Tunisia |
| Turkey | Ukraine | United Arab Emirates |
| United Kingdom | United States | Uruguay |
| Venezuela, Bolivarian Republic Of |
The country from which a Tweet was sent. You can filter against countries by name or you can use Country Code if you prefer to filter by their 2-character ISO code.

Filter for Twitter content from Italy.
The code representing the country a Tweet was sent from.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.place.* targets are unpopulated for that Tweet.
The two-character country code that corresponds to the location the Twitter user supplied for an individual Tweet.
You can learn more about country codes at this International Standards Organization page.
See also twitter.place.country.
1. Filter for Tweets for which the user-supplied location information is in Canada:
The two-character country code that corresponds to the location the Twitter user supplied for an individual Tweet.
Filter for Tweets from France.
Full name of the location from which a Tweet was sent.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.place.* targets are unpopulated for that Tweet.
Full details of the location the Twitter user supplied for an individual Tweet. For example:
See also: twitter.place.name.
1. Filter for Tweets for which the user-supplied location information is London, England:
2. Filter for Tweets from any location called London:
Full details of the location the Twitter user supplied for an individual Tweet. For example:
Filter for Tweets from Chicago.
Short name of the location from which this Tweet was sent.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.place.* targets are unpopulated for that Tweet.
Typically, a shortened version of twitter.place.full_name. For example:
| If twitter.place.full_name is this: | twitter.place.name might be this: |
| Charlotte, NC | Charlotte |
| London, England | London |
| Division No. 11, Alberta | Division No. 11 |
Note that a twitter.place.full_name maps to a unique twitter.place.name but, in the reverse direction, one twitter.place.name might correspond to many different twitter.place.full_name values.
In other words, this filter:
twitter.place.full_name == "London, England"
always produces objects with twitter.place.name set to "London".
But this filter:
twitter.place.name == "London"
produces objects with twitter.place.full_name set to "London, England" or "London, KY" and, almost certainly, other values too.
1. Filter for Tweets for which the user-supplied location information is London:
2. Filter for Tweets from locations called London that are outside the United Kingdom:
Typically, a shortened version of twitter.place.full_name. For example:
| If twitter->place->full name is this: | twitter->place->name might be this: |
| Charlotte, NC | Charlotte |
| London, England | London |
| Division No. 11, Alberta | Division No. 11 |
Filter for Tweets from any town or city called London.
The type of location a Tweet was sent from; for example: city, neighborhood, point of interest.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.place.* targets are unpopulated for that Tweet.
The place_type supplied by Twitter. Consult Twitter's documentation for further details. Typical values are:
The poi value indicates a point of interest such as Stonehenge or Niagara Falls.
1. Filter for Tweets from points of interest:
2. Filter for Tweets from cities and countries that include the string "England" in their name:
The place_type supplied by Twitter. Typical values are:
Filter for Tweets from likely tourist locations that mention a particular restaurant chain.
For a Tweet with location information, this string contains a link to the Twitter API to retrieve further information about the location.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.place.* targets are unpopulated for that Tweet.
For a Tweet with location information, this string contains a link to the Twitter API to retrieve further information about the location.
1. Filter for Tweets with additional place information:
Here's a sample return value from this filter:
http://api.twitter.com/1/geo/id/df0e3d16d714e714.json
For a Tweet with location information, this string contains a link to the Twitter API to retrieve further information about the location.
Filter for Tweets that have this additional location information.
The total number of Retweets for this Tweet.
The total number of Retweets for this Tweet.
See also links.retweet_count. Note that the Links augmentation handles the concept of a Retweet count as an overall count, not on a Tweet-by-Tweet basis.
1. Filter for Tweets that have been Retweeted more than 50 times:
The total number of Retweets for a particular Tweet.
![]() |
You could also take a look at Retweet count in the Links augmentation.
Note that the Links augmentation handles the concept of a Retweet count as an overall count, not on a Tweet-by-Tweet basis. |
Filter for Tweets that have been Retweeted more than 50 times.
List of domains from links in a Tweet that was Retweeted.
List of domains from links in the Tweet that was Retweeted.
1. Filter for Retweets of Tweets that link to apple.com:
Note that the target is an array and we are filtering for just one value so we use the == operator rather than contains.
2. Filter for Retweets of Tweets that link to google.com, microsoft.com, or both:
Note that the target is an array and we are filtering for a value from a list so we use the in operator rather than contains.
List of domains from links in a Tweet that was Retweeted.
Filter for Retweets of Tweets that link to google.com, microsoft.com, or both:
Time in seconds between this Retweet and the original Tweet.
The time interval measured in seconds between posting of the original Tweet and posting of this Retweet.
1. Filter for messages that were Retweeted within 15 seconds:
The time interval measured in seconds between the posting of the original Tweet and reposting of its Retweet.
Filter for Retweets that were made within one minute of the original Tweet.
List of links mentioned in a Tweet that was Retweeted.
A list of links mentioned in a Tweet that was Retweeted.
1. Filter for objects that include a particular Retweeted link:
2. Filter for objects that include a Retweeted link from a list:
A list of links mentioned in a Tweet that was Retweeted.
Filter for Twitter objects that include a particular Retweeted link.
The source of a Retweet; for example: "web" or "Tweetdeck".
The source of the Retweet; for example: "web" or "Tweetdeck".
1. Filter for Retweets that were not sent from the Twitter.com user interface:
2. Filter for people Retweeting from a Blackberry:
Notice that we use the substr operator here because there is more than one Twitter client for the Blackberry.
This tells you the source of the Retweet; for example: "web" or "Tweetdeck".
Filter for Retweets sourced from a Blackberry.
The Retweet text.
The 140-character text of a Retweet.
DataSift's filtering engine strips URLs and @mentions from the Retweet so make sure you select your target carefully:
| Use this target: | To filter for this content: |
| twitter.retweet.text | The text of a Retweet only. |
| twitter.links | Links embedded in a Retweet. |
| twitter.domains | Domains of links embedded in a Trtweet. |
| twitter.mentions | Twitter user screen names mentioned in a Retweet. |
See also: twitter.text
1. Filter for Retweets that mention Apple:
The text of a Retweet.
Filter for Retweets that mention Apple.
The Twitter biography information for the user who Retweeted a Tweet.
The biography information from the Profile page of the Twitter user who Retweeted a Tweet. Twitter limits this field to 160 characters.
1. Filter for Retweets from people who include selected words in their Twitter Bio:
The 160-character biography information from the Profile page of the Twitter user who Retweeted a Tweet.
Filter for Retweets from people who mention "Samsung" in their Twitter bio.
The number of followers the user has.
The number of followers the author of a Retweet has on Twitter.
1. Filter for Retweets from popular Twitter users:
The number of followers the author of a Retweet has on Twitter.
Filter for Retweets from popular Twitter users.
Ratio of followers to following users.
The number of people who follow a Retweeting user divided by the number of people the user follows on Twitter.
This information relates to the Retweeting user and has nothing to do with the author of the original Tweet.
1. Filter for Retweets sent by Twitter users who are followed by 10 times more people than they follow:
2. Filter for Retweets from authors who have posted more than 10,000 Tweets and who are followed by 50 times more people than they follow:
Note that we user twitter.retweet.user.statuses_count in the code rather than twitter.user.statuses_count because the twitter.user namespace might not be present for Retweets.
The ratio of the number of people who follow a Retweeting user and the number of people the Retweeting user follows on Twitter.
Filter for Retweets from authors who have posted more than 10,000 Tweets and who are followed by 50 times more people than they follow.
The number of people a Retweeting user follows.
The number of people a Retweeting user follows on Twitter.
1. Filter for Retweets from people who follow more than 100,000 Twitter members:
The number of people a Retweeting user follows on Twitter.
Filter for Retweets from people who follow more than 100,000 Twitter members.
The id of the Retweeting user.
The Retweeting user's unique id on Twitter. For example, the Twitter id for DataSift is 155505157.
The id is not exposed on the Twitter website but can be used if you write code to access the Twitter API.
Note that Twitter handles user ids and Tweet ids differently.
1. Filter for Retweets from DataSift:
The unique user id of Retweeting user on Twitter.
Filter for Retweets from DataSift by our Twitter id, 155505157.
The 'filtering' level of filtering applied to a Tweet.
Twitter's estimation of the importance of a Tweet. This target can take these values:
The medium and high entries roughly correlate to the "Top Tweets" results for searches on twitter.com.
Here's how it works. Twitter assigns a filter level to each Tweet. For example, I might write a Tweet that Twitter decides to mark as "low", which means that they don't see it as particularly significant. A moment later, you write a Tweet and they mark it as "medium" which is the highest level currently possible.
This DataSift filter will deliver both of our Tweets:
While this one will find yours but not mine:
1. Filter for Tweets that have their filter level set to "medium":
For more information, take a look at Twitter's blog, Introducing new metadata for Tweets.
A list of hashtags mentioned in a Tweet.
A list of Twitter hashtags mentioned in a Tweet.
When using this target for filtering, ensure that you use the correct operator with this target:
== operator in operatorSince this target is an array and we are filtering for a list of values, we are using the == or in operator (not contains or contains_any).
1. Filter for Tweets that mention #snow:
Note that the argument is "snow" and not "#snow"; you do not need to include the hashtag symbol in the CSDL.
2. Filter for Tweets that mention #snow or #work:
A list of Twitter hashtags mentioned in a Tweet.
When using this target for filtering, ensure that you use the correct operator with this target:
== operator in operatorSince this target is an array and we are filtering for a list of values, we are using the == or in operator (not contains or contains_any).
Filter for Tweets that mention #Apple or #Samsung.
The language (BCP 47 code) a Tweet was written in, as identified by Twitter.
The language a Tweet was written in, as identified by Twitter's machine language detection algorithms. The values are valid BCP 47 language identifiers, and may represent any of the languages listed on Twitter's advanced search page, or "und" if no language could be detected.
DataSift already has a language detection mechanism, of course, offered by our Language augmentation. But remember that there is a third way to find out which language a user prefers, by examining the language an author selected in their Settings page on Twitter. You can filter against this using twitter.user.lang, twitter.retweet.user.lang, or twitter.retweeted.user.lang. Take care, though, because users select their language from a drop-down list. They might make a mistake, or select a langauge that is not their own, or perhaps Tweet in more than one language. The bottom line here us that there might be a discrepancy between the language of the Tweet and the main language the user specified in their profile.
1. Filter for Tweets written in a language other than English:
A string displayed in place of the URL to an image included in a Tweet.
A string displayed in place of the URL to an image included in a Tweet.
When an author uploads an image to attach to their Tweet, Twitter automatically assigns it a unqiue display URL that is included in the Tweet, instead of the actual URL to an image. You can use this target to filter for content if you know the display URL of an image attached to a Tweet. This target is useful when you need to find out what people are saying about an image posted by a particular celebrity or brand.
1. Filter for Tweets that include an image with a particular display URL:
A string displayed in place of the URL to an image included in a Tweet.
When an author uploads an image to attach to their Tweet, Twitter automatically assigns it a unqiue display URL that is included in the Tweet, instead of the actual URL to an image. You can use this target to filter for content if you know the display URL of an image attached to a Tweet. This target is useful when you need to find out what people are saying about an image posted by a particular celebrity or brand.
Filter for Tweets that include an image with a particular display URL.
The URL of an image attached to a Tweet.
The URL of an image attached to a Tweet.
A Tweet can include usernames, hashtags, media, and the author's location. At the time of writing, the only type of media available is images. Authors can use Twitter's Upload Photo feature to attach an image to their Tweet. You can use this target to filter for content if you know the URL of a media file included in a Tweet.
Remember, the URL of an image is different than the display URL visible in the Tweet itself. To view the full URL, click on the display URL of an image in a Tweet. Then right-click on the image and click View Image. The full URL is then visible in the address bar of your browser.
1. Filter for Tweets that include a particular image:
The URL of an image attached to a Tweet.
A Tweet can include usernames, hashtags, media, and the author's location. At the time of writing, the only type of media available is images. Authors can use Twitter's Upload Photo feature to attach an image to their Tweet. You can use this target to filter for content if you know the URL of a media file included in a Tweet.
Remember, the URL of an image is different than the display URL visible in the Tweet itself. To view the full URL, click on the display URL of an image in a Tweet. Then right-click on the image and click View Image. The full URL is then visible in the address bar of your browser.
Filter for Tweets that mention "Samsung" and include a particular image.
The type of media included a Tweet.
The type of media included in a Tweet.
A Tweet can include usernames, hashtags, media, and the author's location. At the time of writing, the only type of media available is images. Authors can use Twitter's Upload Photo feature to attach an image to their Tweet.
Ensure that when you use this target, you filter for the media type 'photo'. For instance, you can use this target to filter for Tweets that contain an image and mention a particular keyword.
1. Filter for Tweets with photo media attached and that mention "snow":
The type of media included in a Tweet.
A Tweet can include usernames, hashtags, media, and the author's location. At the time of writing, the only type of media available is images. Authors can use Twitter's Upload Photo feature to attach an image to their Tweet.
Ensure that when you use this target, you filter for the media type 'photo'. For instance, you can use this target to filter for Tweets that contain an image and mention a particular keyword.
Filter for Tweets with photo media attached and mentioning Apple.
A list of ids of Twitter users mentioned in a Tweet.
A list of the ids of Twitter users mentioned in a Tweet.
Take care to use the correct operator with this target. As the examples show, use == if you are filtering for just one username and use in if you are filtering for a list of two or more usernames.
1. Filter for Tweets that mention @datasift by our Twitter id, 155505157:
Remember that the target is an array and we are filtering for just one value so we use the == operator.
2. Filter for Tweets that mention any Twitter id from a list of id:
A list of the ids of Twitter users mentioned in a Tweet.
Filter for Tweets that mention LadyGaga and Beyonce by their user ids "14230524, 31239408".
Additional locality detail for Twitter place information.
This target allows you to filter against the locality information that is visible in the JSON output embedded within the twitter.place.attributes element.
The twitter.place.attributes target can be difficult to filter against unless you know exactly what is in there. The twitter.place.attributes.locality attribute is a lower-level target. Since it is more specific, it is more valuable in filters.
In the US, this target typically contains the name of a city.
1. Filter for Tweets that mention Oakland as their locality:
The additional locality information that is attached to a Tweet, similar to the twitter.place.attributes target.
Filter for Tweets with additional locality information set as Soho.
Additional region detail for Twitter place information.
This target allows you to filter against the locality information that is visible in the JSON output embedded within the twitter.place.attributes element.
The twitter.place.attributes target can be difficult to filter against unless you know exactly what is in there. The twitter.place.attributes.region attribute is a lower-level target. Since it is more specific, it is more valuable in filters.
In the US, this target typically contains the name of a state.
1. Filter for Tweets from Pensylvania:
The additional region information that is attached to a Tweet, similar to the twitter.place.attributes target. For US, this target typically contains the name of a state.
Filter for Tweets with additional region information set as North Carolina.
Additional street address detail for Twitter place information.
This target allows you to filter against the street address information that is visible in the JSON output embedded within the twitter.place.attributes element.
The twitter.place.attributes target can be difficult to filter against unless you know exactly what is in there. The twitter.place.attributes.street_address attribute is a lower-level target. Since it is more specific, it is more valuable in filters.
1. Filter for Tweets from the White House:
The additional street address information that is attached to a Tweet, similar to the twitter.place.attribtutes target.
Filter for Tweets with additional street address information set as London Eye.
The 'filtering' level of filtering applied to a Retweet.
Twitter's estimation of the importance of a Retweet. The values are:
The medium and high entries roughly correlate to the "Top Tweets" results for searches on twitter.com.
Here's how it works. Twitter assigns a filter level to each Tweet. For example, I might post a Retweet that Twitter decides to mark as "low", which means that they don't see it as particularly significant. A moment later, you post a Retweet and they mark it as "medium" which is the highest level currently possible.
This DataSift filter will deliver both of our Retweets:
twitter.retweet.filter_level in "low, medium"
While this one will find yours but not mine:
twitter.retweet.filter_level in "medium"
A list of hashtags mentioned in a Retweet.
A list of Twitter hashtags mentioned in a Retweet.
When using this target for filtering, ensure that you use the correct operator with this target.
== operator in operatorSince this target is an array and we are filtering for a list of values, we are using the == or in operator (not contains or contains_any).
1. Filter for Retweets that mention #LadyGaga or #Beyonce or #Shakira":
Note that the argument is "LadyGaga" and not "#LadyGaga"; you do not need to include the hashtag symbol in the CSDL.
A list of Twitter hashtags mentioned in a Retweet.
When using this target for filtering, ensure that you use the correct operator with this target.
== operator in operatorFilter for Retweets that mention the hashtags Apple, Samsung or Android.
The language (BCP 47 code) a Retweet was written in, as identified by Twitter.
The language a Retweet was written in, as identified by Twitter's machine language detection algorithms. The values are valid BCP 47 language identifiers, and may represent any of the languages listed on Twitter's advanced search page, or "und" if no language could be detected.
DataSift already has a language detection mechanism, of course, offered by our Language augmentation.
Remember that there is a third way to find out which language a user prefers. That's by examining the language the author of a Tweet selected in their Settings page on Twitter. You can filter against this in twitter.user.lang, twitter.retweet.user.lang, or twitter.retweeted.user.lang. Take care, though. Users usually choose a language from a drop-down list, and it's usually their main language. There's no guarantee that this is actually their real one or, most importantly, that this is the language of the current Tweet: many users in fact write messages in different languages, so there might be a discrepancy between the language of the tweet and the main language of the user as specified in their profile.
A string displayed in place of the URL to an image included in a Retweet.
A string displayed in place of the URL to an image included in a Retweet.
When an author uploads an image to attach to their Tweet, Twitter automatically assigns it a unqiue display URL that is included in the Tweet, instead of the actual URL to an image. The display URL is also included in Retweets. You can use this target to filter for content if you know the display URL of an image attached to a Retweet.
1. Filter for Retweets that mention @datasift and include an image with a particular display URL:
A string displayed in place of the URL to an image included in a Retweet.
When an author uploads an image to attach to their Tweet, Twitter automatically assigns it a unqiue display URL that is included in the Tweet, instead of the actual URL to an image. The display URL is also included in Retweets. You can use this target to filter for content if you know the display URL of an image attached to a Retweet.
Filter for Retweets that mention @datasift and include an image with the display URL "http://pic.twitter.com/CQJ0CPqc".
The URL of an image attached to a Retweet.
The URL of an image attached to a Retweet.
A Retweet can include usernames, hashtags, media, and the author's location. At the time of writing, the only type of media available is images. You can use this target to filter for content if you know the URL of a media file included in a Retweet. For instance, suppose a celebrity or brand share an image on Twitter, you can use this target to filter for Retweets of the image and find out what people are Tweeting about it.
Remember, the URL of an image is different than the display URL visible in the Retweet itself. To view the full URL, click on the display URL of an image in a Retweet. Then right-click on the image and click View Image. The full URL is then visible in the address bar of your browser.
1. Filter for Retweets that mention Lady Gaga and include a particular image:
The URL of an image attached to a Retweet.
A Retweet can include usernames, hashtags, media, and the author's location. At the time of writing, the only type of media available is images. You can use this target to filter for content if you know the URL of a media file included in a Retweet. For instance, suppose a celebrity or brand share an image on Twitter, you can use this target to filter for Retweets of the image and find out what people are Tweeting about it.
Remember, the URL of an image is different than the display URL visible in the Retweet itself. To view the full URL, click on the display URL of an image in a Retweet. Then right-click on the image and click View Image. The full URL is then visible in the address bar of your browser.
Filter for Retweets that mention "@LadyGaga" and include a particular image.
The type of media included in a Retweet.
The type of media included in a Retweet.
A Retweet can include usernames, hashtags, media, and the author's location. At the time of writing, the only type of media available is images. Ensure that when you use this target, you filter for the media type 'photo'. For instance, you can use this target to filter for Retweets that contain an image and mention a particular keyword.
1. Filter for Retweets with photo media attached and mentioning "music":
The type of media included in a Retweet.
A Retweet can include usernames, hashtags, media, and the author's location. At the time of writing, the only type of media available is images. Ensure that when you use this target, you filter for the media type 'photo'. For instance, you can use this target to filter for Retweets that contain an image and mention a particular keyword.
Filter for Retweets with photo media attached and mentioning Lady Gaga.
The financial symbols (cashtags) in a Retweet.
Financial ticker symbols (cashtags) identify stocks in Tweets. For example, Google stock is $GOOG and Apple stock is $AAPL.
The twitter.symbols target makes it easy to filter for these.
See also: twitter.symbols.
1. Filter for Retweets that mention Apple or Google stock:
2. Filter for Retweets that mention Microsoft and Intel stock:
Two-character language code that a Retweeting user selected on Twitter's Settings page.
The language a Retweeting user selected from the drop-down list on their Settings page.
Note that the value may be misleading. For example, a user might select French as their mother tongue but compose their Tweets in English. DataSift offers an augmentation called Language which determines the language based on a statistical analysis of 3-letter groups of characters.
The drop-down list shows the full name of the language (English, French, German, and so on) but the twitter.user.lang target contains the two-character code for the language.
Take a look at the list of languages that Twitter supports.
1. Filter for Retweets from authors who specify French as their preferred language:
2. Filter for Retweets from authors who specify French as their preferred language but Tweet in English:
The language a Retweeting user selects from the drop-down list on their Twitter Settings page. Note that the value may be misleading. For example, a user might select French as their mother tongue but compose their Tweets in English. DataSift offers an augmentation called Language which determines the language based on a statistical analysis of 3-letter groups of characters.
Filter for Retweets from authors who specify French as their preferred language.
If the Twitter has verified the Retweeting user, this target is set to 1.
Allows you to filter for Retweets by authors whose accounts have been verified. The verification process typically applies to individuals and organizations in the public eye, and serves to confirm that the person really is who they claim to be. For example, if you visit Lady Gaga's Twitter page, you'll notice a white checkmark over a blue cloud next to her name. This is Twitter's icon for a verified account.
1. Filter for Retweets by verified users of Tweets by verified users:
2. Filter for Retweets by unverified users of Tweets by verified users:
3. Filter for Retweets by verified users of Tweets by unverified users:
1. This target is an integer. If the account has been verified, twitter.retweet.user.verified is set to 1.
2. If the user is unverified, the target is not populated.
3. DataSift offers three similar targets that help you to filter for content by verified users:
| twitter.user.verified | Filter for Tweets by verified users | |
| twitter.retweet.user.verified | Filter for Retweets by verified users. The author of the original Tweet might or might not be vertified. | |
| twitter.retweeted.user.verified |
Filter for Retweets of Tweets by verified users. The Retweeting user might or might not be verified. |
Allows you to filter for Retweets by users whose accounts have been verified. A verified Twitter account usually belongs to individuals and organizations in the public eye, and serves to confirm that the person really is who they claim to be.
Filter for Retweets from the verified account of Mitt Romney.
Additional location information about a Tweet that has been Retweeted.
Additional location information about a Tweet that has been Retweeted.
1. Filter for Retweets that include twitter.retweeted.place.attributes information:
Additional location information about a Tweet that has been Retweeted.
Filter for Retweets that include location information about the original Tweet.
A URL containing a Twitter API call to retrieve additional place information.
A URL containing a Twitter API call to retrieve additional place information about the place of origin of a Tweet that has been retweeted. For example, if twitter.retweeted.place.url contains http://api.twitter.com/1/geo/id/d9ebaeaac39060be.json, the Tweet originated from the town of Ourense in Northern Spain.
If you make that API call, Twitter returns a set of points representing a polygon around the town, plus additional meta data.
1. To see examples of Retweets that include twitter.retweeted.place.url:
Here's the JSON output for that API call for Ourense:
A URL containing a Twitter API call to retrieve additional place information about the place of origin of a Tweet that has been retweeted. If you make that API call, Twitter returns a set of points representing a polygon around the town, plus additional meta data.
If the Twitter has verified the user, this target is set to 1.
Allows you to filter for Retweets of Tweets by authors whose accounts have been verified. The verification process typically applies to individuals and organizations in the public eye, and serves to confirm that the person really is who they claim to be. For example, if you visit Lady Gaga's Twitter page, you'll notice a white checkmark over a blue cloud next to her name. This is Twitter's icon for a verified account.
1. Filter for Retweets by verified users who have more than 100,000 followers:
2. Filter for Retweets from verified users with "official" in their username:
1. This target is an integer. If the account has been verified, twitter.retweeted.user.verified is set to 1.
2. If the user is unverified, the target is not populated.
3. DataSift offers three similar targets that help you to filter for content by verified users:
| twitter.user.verified | Filter for Tweets by verified users | |
| twitter.retweet.user.verified | Filter for Retweets by verified users. The author of the original Tweet might or might not be vertified. | |
| twitter.retweeted.user.verified |
Filter for Retweets of Tweets by verified users. The Retweeting user might or might not be verified. |
Allows you to filter for Retweeted users whose accounts have been verified. A verified Twitter account usually belongs to individuals and organizations in the public eye, and serves to confirm that the person really is who they claim to be.
Filter for Retweets from the verified account of Barack Obama.
If the object is a status message from Twitter, this target contains its type.
If the object is a status message from Twitter, this target contains its type. It can take these values:
| user_protect | The user has protected (made private) their account. |
| user_unprotect | The user has unprotected (made public) their account. |
| user_suspend | Twitter has suspended a user's account. |
| user_unsuspend | Twitter has released the suspension of a user's account. |
| user_delete | Twitter has deleted one a user's account. |
| user_undelete | Twitter has restored a deleted account. |
| user_withheld | Not currently used. |
| status_withheld | Not currently used. |
Please refer to our Twitter User Status Messages documentation for guidelines on how to process each of these messages.
1. Filter for users who have unprotected their Twitter account. That is, formerly private accounts that have recently been made public:
This target denotes the type of a status message. If the object is a status message from Twitter, this target contains its type.
Filter for Tweets from users who have their deleted Twitter account restored.
The financial symbols (cashtags) in a Tweet.
Financial ticker symbols (cashtags) identify stocks in Tweets. For example, Google stock is $GOOG and Apple stock is $AAPL.
The twitter.symbols target makes it easy to filter for these.
See also: twitter.retweet.symbols.
1. Filter for Tweets that mention Google or Apple stock:
Notice that we omit the $ symbol.
2. Filter for Tweets that mention Microsoft and Intel stock:
Again, notice that we omit the $ symbol.
At signup, Twitter users can indicate their native language. This appears in these DataSift targets:
The list Is expanding over time. At the time of writing, Twitter offers:
The number of Tweets a user has favorited.
The number of Tweets a user has favorited.
Twitter use the UK spelling, "favourite_count" in their output, and we follow their convention, both in our target name and our output.
1. Filter for Tweets about ice cream from users who have favorited more than 50 Tweets:
Take care when you use this target in isolation. For example, this filter for Tweets from users who have favorited more than 25 Tweets looks fine, but since most Twitter users have favorited that many Tweets, your filter will return almost the entire content of Twitter in real time:
twitter.user.favourites_count > 25
If you ran this filter you would quickly reach your rate limit.
If the Twitter has verified the user, this target is set to 1.
Allows you to filter for Tweets by authors whose accounts have been verified. The verification process typically applies to individuals and organizations in the public eye, and serves to confirm that the person really is who they claim to be. For example, if you visit Lady Gaga's Twitter page, you'll notice a white checkmark over a blue cloud next to her name. This is Twitter's icon for a verified account.
1. Filter for verified users with more than 100,000 followers:
2. Filter for Tweets from unverified users with "official" in their username:
1. This target is an integer. If the account has been verified, twitter.user.verified is set to 1.
2. If the user is unverified, the target is not populated.
3. DataSift offers three similar targets that help you to filter for content by verified users:
| twitter.user.verified | Filter for Tweets by verified users | |
| twitter.retweet.user.verified | Filter for Retweets by verified users. The author of the original Tweet might or might not be vertified. | |
| twitter.retweeted.user.verified |
Filter for Retweets of Tweets by verified users. The Retweeting user might or might not be verified. |
Indicates whether a Twitter account is verified. A verified Twitter account usually belongs to individuals and organizations in the public eye, and serves to confirm that the person really is who they claim to be.
Filter for Tweets from the verified account of Barack Obama.
The number of lists a Retweeting user appears in.
The number of Twitter lists that the person making a Retweet appears in.
1. Filter for Retweets from people who appear on more than 300 lists:
The number of Twitter lists that the person making a Retweet appears in.
Filter for Tweets from users who appear in more than 300 Twitter lists.
The string description of the Twitter's user's location.
The string description of the Twitter user's location. This is supplied by the Twitter user on their profile page. It is a free-form field, so they can type anything they choose to:
1. Filter for Retweets from people who include Japan in their location description:
The string description of the Twitter user's location. This is supplied by the Twitter user on their profile page. It is a free-form field, so they can type anything they choose to.
Filter for Retweets from people who include Japan in their location description.
The "real name" a Retweeting user supplied in Twitter's settings page.
A Retweeting user's 'real' name. Since this is a free-text entry field in the Twitter settings, it could contain anything. It could be the user's genuine name, perhaps augmented with a title (such as Dr or Professor), a suffix (such as "BA" or "Jnr" or "III"), or a post nominal (such as "FRSC" or "FAIA") indicating membership of an organization or professional body.
The range of possibilities is wide and includes obvious parody accounts, and accounts that are run by companies or organizations. For instance, the @climagic account, which Tweets about Unix-based computer operating systems, has a "real name" of Command Line Magic.
Note that a Twitter user can adjust their real name over time. They can change their screenname (for example @LadyGaga) too, but their Twitter id cannot be changed.
1. Filter for Retweets from users who include "doctor" in their real name:
A Retweeting user's real name. Since this is a free-text entry field in the Twitter settings, it could contain anything. It could be the user's genuine name, perhaps augmented with a title (such as Dr or Professor), a suffix (such as "BA" or "Jnr" or "III"), or a post nominal (such as "FRSC" or "FAIA") indicating membership of an organization or professional body.
Filter for Retweets from users who have included "doctor" in their real name.
The number of days a Retweeting user been a member of Twitter.
The number of days since a Retweeting user joined Twitter.
1. Filter for Retweets from authors who joined Twitter more than 3 years ago:
The number of days since a Retweeting user joined Twitter.
Filter for Retweets from people who are in their second month of membership of Twitter.
The Retweeting user's Twitter username.
The Twitter username of the person who made the Retweet.
1. Filter for Tweets from authors who include "marketing" in their Twitter username:
The Twitter username of the person who Retweeted a Tweet.
Filter for Retweets from users who include "marketing" in their Twitter username.
The number of Tweets/Retweets a Retweeting user has posted.
The number of Tweets/Retweets a Retweeting user has posted.
You can use this, for example, to filter for Retweets from users who have posted more than 20,000 messages.
1. Filter for Retweets from prolific Twitterers:
The number of Tweets/Retweets a Retweeting user has posted.
Filter for Retweets from users who have Tweeted or Retweeted more than 20,000 times.
The Retweeting user's time zone.
The time zone that a Retweeting user selected in their Twitter settings.
1. Filter for Retweets from the same time zone as Tokyo:
2. Filter for Retweets from users who have not set London as their time zone:
The twitter.retweet.user.time_zone target is a string. It takes these values:
The time zone that a Retweeting user selected in their Twitter settings.
Filter for Retweets from users who have set New York City as their time zone.
The URL a Retweeting user added in their Twitter profile.
The URL a retweeting user added in their Twitter profile.
1. Filter for Retweets that include a twitter.retweet.user.url:
The URL a retweeting user added in their Twitter profile.
Filter for Retweets that include a twitter.retweet.user.url.
The unique ID of the Tweet that was Retweeted.
The unique ID of the Tweet that was Retweeted. It is stored as a string.
Note that Twitter treats user IDs and Tweet IDs differently.
1. If you know a ID of a Tweet, you can filter for all new Retweets of that original Tweet:
The unique id of the Tweet that was Retweeted
Filter for all new Retweets of an original Tweet by its unique id.
List of Twitter usernames mentioned in a Tweet that was Retweeted.
A list of Twitter usernames mentioned in a Tweet that was Retweeted.
1. Filter for Retweets of Tweets that mention @rihanna by her Twitter username:
Note that the target is an array and we are filtering for just one value so we use the == operator.
2. Filter for Retweets of Tweets that mention @rihanna or @Ladygaga by their Twitter username:
Note that the target is an array and we are filtering for one value from a list so we use the in operator.
List of Twitter usernames mentioned in a Tweet that was Retweeted.
Filter for Retweets of Tweets that mention @rihanna or @Ladygaga by their Twitter username.
List of Twitter user ids mentioned in a Tweet that was Retweeted.
A list of Twitter user ids mentioned in a Tweet that was Retweeted.
Take care to use the correct operator with this target. As the examples show, use == if you are filtering for just one username and use in if you are filtering for a list of two or more usernames.
1. Filter for Retweets that mention @ladygaga by the Twitter id 14230524:
Remember that the target is an array and we are filtering for just one value so we use the == operator.
2. Filter for Retweets that mention any of the Twitter ids from a list of ids:
Since the target is an array and we are filtering for one value from a list, so we use the in operator.
A list of Twitter user ids mentioned in a Tweet that was Retweeted.
Take care to use the correct operator with this target. As the examples show, use == if you are filtering for just one username and use in if you are filtering for a list of two or more usernames.
Filter for Retweets that mention @Beyonce by their Twitter id 31239408.
The country of origin of a Tweet that was Retweeted.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.retweeted.place.* targets are unpopulated for that Tweet.
The country that corresponds to the location supplied for the original Tweet. This target holds no information about the Twitter user who made the Retweet.
When a Tweet is Retweeted, twitter.place.country is copied to twitter.retweeted.place.country.
1. Filter for Retweets of Tweets made by users who indicatted that they're located in Canada:
The country that corresponds to the location for an original Tweet that is being Retweeted.
Filter for Tweets originally made by users located in Canada, that are being Retweeted.
The country code for a Retweeted Tweet.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.retweeted.place.* targets are unpopulated for that Tweet.
The two-character country code that corresponds to the location for an original Tweet that is being Retweeted.
When a Tweet is Retweeted, twitter.place.country_code is copied to twitter.retweeted.place.country_code.
You can learn more about country codes at this Wikipedia page.
1. Filter for Retweets of Tweets for which the user-supplied location information is in Canada:
The two-character country code that corresponds to the location for an original Tweet that is being Retweeted.
When a Tweet is Retweeted, twitter>>place>>country code is copied to twitter>>retweeted>>place>>country code.
Filter for Retweets of Tweets posted in Canada.
Full name of the origin of a Retweeted Tweet.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.retweeted.place.* targets are unpopulated for that Tweet.
The full name of the location the Twitter user supplied for an original Tweet that is being Retweeted. The user chooses from a Twitter database of locations.
For example:
When a Tweet is Retweeted, twitter.place.full_name is copied to twitter.retweeted.place.full_name.
See also: twitter.retweeted.place.name.
1. Filter for Retweets of Tweets for which the user-supplied location information is London, England:
The full name of the location the Twitter user supplied for an original Tweet that is being Retweeted. The user chooses from a Twitter database of locations.
For example:
When a Tweet is Retweeted, twitter>>place>>full_name is copied to twitter>>retweeted>>place.full_name.
Filter for Tweets from London, England that are being Retweeted.
The short name of the location from which a Tweet that is being Retweeted was posted.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.retweeted.place.* targets are unpopulated for that Tweet.
Typically, a shortened version of twitter.retweeted.place.full_name. For example:
| If twitter.retweeted.place.full_name is this: | twitter.retweeted.place.name might be this: |
| Charlotte, NC | Charlotte |
| London, England | London |
| Division No. 11, Alberta | Division No. 11 |
Note that there is a one-to-one mapping from twitter.place.full_name to twitter.place.name but a one-to-many mapping in the reverse direction.
In other words, this filter:
twitter.retweeted.place.full_name == "London, England"
always produces objects with twitter.place.name set to "London".
But this filter:
twitter.retweeted.place.name == "London"
produces objects with twitter.retweeted.place.full_name set to "London, England" or "London, Ohio" or other values.
1. Filter for Retweets of Tweets for which the user-supplied location information is London. There are several cities called London, and this filter delivers results from them all:
A shortened version of the name of the location from which a Tweet that is being Retweeted, was posted.
Typically, a shortened version of twitter>>retweeted>>place>>full name. For example:
| If the full name is this: | The placename might be this: |
| Charlotte, NC | Charlotte |
| London, England | London |
| Division No. 11, Alberta | Division No. 11 |
Note that there is a one-to-one mapping from twitter.place.full_name to twitter.place.name but a one-to-many mapping in the reverse direction.
Filter for Retweets of Tweets origiinally posted in London. There are several cities called London, and this filter delivers results from them all.
The type of location a Retweeted Tweet was made from.
NoteTwitter location data is available only if a Twitter user chooses to activate it. For each individual Tweet, they can choose whether or not to include location data and which location they want to specify, regardless of their real location. For example, a user in New York City could select Edinburgh, Scotland as the purported origin of a Tweet. If they choose not to include location data, all the twitter.retweeted.place.* targets are unpopulated for that Tweet.
The place_type supplied by Twitter. Consult Twitter's documentation for further details. Typical values are:
The poi value indicates a point of interest such as Stonehenge or Niagara Falls.
1. Filter for Retweets of Tweets from points of interest:
The place_type supplied by Twitter. Typical values are:
Filter for messages from likely tourist locations that mention a particular airline, hotel chain, restaurant chain, or beverage brand.
The source of the Retweeted Tweet. For example: "web" or "TweetDeck".
The source of an original Tweet that is being Retweeted. For example, "web" indicates that the original Tweet was posted via Twitter's website whereas "TweetDeck" indicates that it came from the TweetDeck Twitter client.
1. Filter for Tweets that were not sent from the Twitter.com user interface:
2. Filter for people Tweeting from a Blackberry:
Notice that we use the substr operator here because there is more than one Twitter client for the Blackberry.
The source of an original Tweet that is being Retweeted. For example, "web" indicates that the original Tweet was posted via Twitter's website whereas "TweetDeck" indicates that it came from the TweetDeck Twitter client.
Filter for Retweets that were originally sent from a Blackberry.
The biography information for a Twitter user who is being Retweeted.
The biography information for a Twitter user who is being Retweeted.
1. Filter for Retweets of Tweets posted by people who include the term "marketing" in their Twitter bio:
The biography information of the Twitter user whose Tweet is being Retweeted.
Filter for Retweets of Tweets written by users who include the term "technology" in their Twitter bio.
The number of followers a Retweeted author has.
The number of followers the author of a Retweeted Tweet has on Twitter.
Note that this data concerns the Retweeted author. It does not contain any information about the person who made the Retweet.
1. Filter for Retweets of Tweets from popular Twitterers:
The number of followers the author of a Retweeted Tweet has on Twitter.
Note that this data concerns the Retweeted author. It does not contain any information about the person who made the Retweet.
Filter for Retweets of Tweets written by people who have more than 1,000 followers.
Ratio of followers-to-following for a Twitter user who is being Retweeted.
The number of people who follow a Retweeted Twitter user divided by the number of people the user follows.
1. Filter for Retweets of Tweets by authors who are followed by 10 times more people than they follow:
2. Filter for Retweets of Tweets by authors who have posted more than 10,000 Tweets and who are followed by 50 times more people than they follow:
The ratio of the number of people who follow a Retweeted Twitter user and the number of people the user follows.
Filter for Retweets of Tweets by authors who are followed by 10 times more people than they follow.
The number of people a Retweeted user follows.
The number of people the Retweeted user follows.
1. Filter for Retweets of material from authors who follow more than 1,000 people:
The number of people a Retweeted user follows.
The Twitter id of a Retweeted user.
A Retweeting user's unique id on Twitter. For example, the Twitter id for the @DataSift Twitter account is 155505157.
The id is not exposed on the Twitter website but can be used if you write code to access the Twitter API.
Note that Twitter treats user IDs and Tweet IDs differently.
1. Filter for Retweets of Tweets from DataSift:
A Retweeting user's unique id on Twitter. For example, the Twitter id for the @DataSift Twitter account is 155505157.
The id is not exposed on the Twitter website but can be used if you write code to access the Twitter API.
Two-character language code for the language the Retweeted user has set Twitter to.
The language the Twitter user selected from the drop-down list on their Settings page.
Note that the value may not be reliable. For example, a user might select French as their mother tongue but compose their Tweets in English. DataSift offers an augmentation called Language which determines the language based on a statistical analysis of 3-letter groups of characters.
The drop-down list shows the full name of the language (English, French, German, and so on) but the twitter.user.lang target contains the two-character code for the language.
Take a look at the list of languages that Twitter supports.
1. Filter for Retweets of Tweets from authors who specify French as their preferred language:
2. Filter for Retweets of Tweets from authors who specify French as their preferred language but Tweet in English:
The language a Retweeted Twitter user selected on their Settings page.
It is an indication of their primary language but it might be misleading. For example, a user might select French as their mother tongue but compose their Tweets in English. DataSift offers an augmentation called Language which determines the language based on a statistical analysis of groups of characters.

Filter for Retweets of Tweets from users who specify French as their preferred language.
Number of lists this Retweeted User is listed in.
The number of Twitter lists that the author of the original Tweet appears on.
This target holds no information about the person who made the Retweet.
1. Filter for Retweets of Tweets from people who appear on more than 10,000 lists:
The number of lists on which the Twitter user, who has Retweeted a Tweet, appears.
Filter for Retweets of Tweets from people who appear on more than 100 lists.
The string description of the Retweeted Twitter User has posted.
The string description of the Twitter user's location. This is supplied by the Twitter user in the Location field on their Profile page. It is a freeform field, so they can type anything they choose to:
1. Filter for Retweets of Tweets from people who include Japan in their location description:
The string description of a Retweeted Twitter user's location. This is supplied by the Twitter user in the Location field on their Profile page. It is a freeform field, so they can type anything they choose.
Filter for Retweets of Tweets from authors who state that they live in "my home".
The "real name" the Retweeted user has assigned to themself.
The Retweeted user's real name.
1. Filter for Retweets of Tweets from users who have included "doctor" in their real name:
The real name supplied by a Retweeted user on Twitter. Since this is a free-text entry field in the Twitter settings, it could contain anything. It could be the user's genuine name, perhaps augmented with a title (such as Dr or Professor), a suffix (such as "BA" or "Jnr" or "III"), or a post nominal (such as "FRSC" or "FAIA") indicating membership of an organization or professional body.
Alternatively, the name might indicate a parody account, or an account run by a company or organization.
Filter for Retweets of Tweets from users that inclue "PhD" in their real name.
The number of days the Retweeted user has been a member of Twitter.
The number of days since the author of the original Tweet joined Twitter.
Note that this target holds no information about the person who sent the Retweet.
1. Filter for Retweets of Tweets from authors who joined Twitter more than three years ago:
The number of days since a Retweeted user joined Twitter.
Filter for Retweeted Tweets from users who have been Twitter members for more than a year.
The Retweeted user's Twitter username.
The Twitter username of the author of the original Tweet.
1. Filter for Retweets of Tweets from Lady Gaga:
2. Filter for Retweets of Tweets from official DataSift accounts or DataSift's senior management.
A Retweeted user's Twitter username.
Filter for Retweets of Tweets from users who include "Lady" in their Twitter username.
The number of Tweetsand Retweets a Retweeted Twitter user has posted.
The total number of Tweets and Retweets that a Retweeted Twitter user has posted.
1. Filter for Retweets of Tweets from prolific Twitterers:
The total number of Tweets and Retweets a Retweeted Twitter user has posted.
Filter for Tweets and Retweets from people who have posted more than 100,000 messages.
The Retweeted user's time zone.
The time zone that the Twitter user selected on their Settings page. This is the time zone for the original author, not for the person who Retweeted.
1. Filter for Retweets of Tweets from the same time zone as Tokyo:
2. Filter for Retweets of Tweets from users who have not selected London as their time zone:
twitter.retweeted.user.time_zone is a string. It takes these values:
The time zone that a Reetweeted Twitter user has selected in their Twitter Settings. For a complete list of the timezones that you can use, please consult out developer documentation.
Filter for Retweets of Tweets from people in the San Francisco timezone.
The URL the retweeted user included in their Twitter profile.
The URL the retweeted user included in the Website field in their Twitter profile.
1. Look for retweets that include a twitter.retweeted.user.url:
2. Filter for Retweeted Tweets from users who who include apple.com in the Website field in their settings page.
The URL a Retweeted user has included in their Twitter profile.
This object is an example of the format of a Retweet after DataSift has normalized and augmented the content:
Twitter has different types of IDs
| ID: | Type: |
| User ID | int |
| Tweet or Retweet ID | string |
For example:The popularity of Twitter means that Tweet IDs now exceed 32 bits so they are now presented as strings.
twitter.user.id == 155505157 and twitter.retweeted.id == "104502373526945792"
The twitter.user namespace exists for Tweets but not for Retweets so this filter ignores Lady Gaga's Retweets:
twitter.user.screen_name == "ladygaga"
To find Retweets only, use this code:
twitter.retweet.user.screen_name == "ladygaga"
To find Lady Gaga's Tweets and Retweets:
interaction.author.username == "ladygaga"
This object is an example of the format of a Tweet when it first reaches DataSift:
By default Twitter disables location information.
Twitter users can opt in by selecting "Add a location to your Tweets" on their Settings page.
The content of twitter.place.place_type is blank unless they opt in. Further, users choose on a Tweet-by-Tweet basis whether to include place information.
Once users have opted in, they must select a location known to Twitter. Based on the user's selection, Twitter includes additional information such as:
Note that Twitter performs no verification on this field so users are not forced to enter accurate information. A user in New York City is free to enter Edinburgh, Scotland as their location.
This object is an example of the format of a Tweet after DataSift has normalized and augmented the content:
The application that generated an update.
The application that generated a Facebook post. For example:
1. Filter for content from applications that include "phone" in their name:
The mobile application that generated a Facebook post.
Filter for Facebook content from the application Facebook for Blackberry® smartphones.
A link to an author's avatar.
A link to an author's avatar.
1. This target is rarely used in filtering, but the data is delivered in the output object. Your client software can process it if you chose to use it. To see examples of facebook.author.avatar:
The URL to the avatar of a Facebook user.
This target is rarely used in filtering, but the data is delivered in the output object.
A Facebook author's id.
An author's unique Facebook id. It is numeric but held in a string.
1. To see some objects that have Facebook author ids:
2. To search for post from an author using their id. Note that the argument is a string:
The unique id assigned to a Facebook user.
Filter for Facebook posts from a particular Facebook user by their id.
A link to an author's Facebook profile page.
A link to an author's Facebook profile page.
1. Filter for content from authors based on their Profile page only:
2. Filter for a particular Profile page:
The URL to a Facebook user's profile page.
Filter for Facebook content from a particular user by a link to their profile page.
An author's real name on Facebook.
An author's real name on Facebook. This is supplied on signup but it can subsequently be changed by the user.
1. Filter for content from authors who include "apple" in their name:
2. Filter for content from authors with the name "Joe Smith":
The real name supplied by a Facebook user.
Filter for Facebook content from users who include "Jobs" in their name.
The contents of the Facebook caption.
The caption for a Facebook post.
|
This type of message: |
Has a facebook.caption: |
|
status |
no |
|
link |
yes |
|
photo |
yes |
|
video |
yes |
1. Filter for Facebook videos that include "nascar" in their caption:
The caption to a Facebook post such as a link, a photo, and a video.
Filter for Facebook content that includes "football" in their caption.
The content of the Facebook description.
The description for a Facebook post.
|
This type of message: |
Has a facebook.description: |
|
status |
no |
|
link |
yes |
|
photo |
yes |
|
video |
yes |
1. Filter for photographs that include "apple" in the description:
The description text accompanying a Facebook post such as a link, a photo, and a video.
Filter for Facebook content where description mentions the word "iPhone 5".
The number of times that a Facebook post has been "liked".
The number of times that a Facebook post has been "liked".
1. Filter for Facebook posts that have been "liked" more than five times:
The number of times that a Facebook post has been "liked".
Filter for posts that have been "liked" at least 30 times.
A list of the ids of Facebook members who have "liked" a post.
Filter for Facebook posts that have been "liked" by specified Facebook user. With this targert, you specify the user by their Facebook id.
Note that the argument is an array of int values.
See also facebook.likes.names.
1. Filter for Facebook posts that have been "liked" by a specified Facebook user:
2. Filter for Facebook posts that have been "liked" by any member of a list of Facebook users:
The user ids of Facebook users who have "liked" a post.
Filter for Facebook content that has been "liked" by a specific Facebook user based on their user id.
A list of the names of Facebook members who have "liked" this post.
A list of the names of Facebook members who have "liked" this post.
Filter for Facebook posts that have been "liked" by specified Facebook users. With this targert, you specify users by their Facebook name.
See also facebook.likes.ids.
1. Filter for Facebook posts that have been "liked" by a specified Facebook user:
2. Filter for Facebook posts that have been "liked" by any member of a list of Facebook users:
The names of Facebook users who have "liked" this post.
Filter for Facebook content that has been "liked" by users with the name "Bill Gates".
The link contained in a post.
The link contained in a Facebook post.
| This type of message: | Has facebook.link: |
|
status |
no |
|
link |
yes |
|
photo |
yes |
|
video |
yes |
1. Filter for Facebook posts that link to content on Flickr:
The URL contained in a Facebook post such as a link, a photo, and a video.
Filter for Facebook content with a URL containing "nytimes.com".
The content of a Facebook message.
The main content of a post on Facebook.
1. Filter for content that mentions "apple":
2. Filter for content that mentions "Apple" and does not mention "Microsoft":
The main content of a post on Facebook.
Filter for Facebook content that mentions Burberry.
The Facebook name assigned to a post.
The name associated with a Facebook status update, link, picture, or video.
1. Filter for content that has "music" in the facebook.name:
2. Filter for an exact match against "Lady Gaga video":
The name associated with a Facebook status update, link, picture, or video.
Filter for Facebook content that contains "Lady Gaga" in the name.
The Open Graph 'by' for a post.
The Open Graph 'by' information for a Facebook post.
To learn more, visit Facebook's Open Graph developer's page or look at the Open Graph protocol.
1. To see Facebook messages that include Open Graph facebook.og.by information:
The Open Graph 'by' information for a Facebook post.
Filter for Facebook content using the Open Graph 'by' information of a Facebook post.
The Open Graph description for a post.
The Open Graph description information for a Facebook post.
To learn more, visit Facebook's Open Graph developer's page or look at the Open Graph protocol.
1. To see Facebook messages that include Open Graph description information:
The Open Graph 'description' information for a Facebook post.
Filter for Facebook content using the Open Graph 'description' information of a Facebook post.
The Open Graph length for a post.
The Open Graph length information for a Facebook post.
To learn more, visit Facebook's Open Graph developer's page or look at the Open Graph protocol.
1. To see Facebook messages that include Open Graph length information:
The Open Graph 'length' information for a Facebook post.
Filter for Facebook content using the Open Graph 'length' information of a Facebook post.
The Open Graph Location for a post.
The Open Graph location information for a Facebook post.
To learn more, visit Facebook's Open Graph developer's page or look at the Open Graph protocol.
To see Facebook messages that include an Open Graph location:
The Open Graph 'location' information for a Facebook post.
Filter for Facebook content using the Open Graph 'location' information of a Facebook post.
The Open Graph photos information for a post.
The Open Graph photos information for a Facebook post.
To learn more, visit Facebook's Open Graph developer's page or look at the Open Graph protocol.
1. To see Facebook messages that include Open Graph photographs:
The Open Graph 'photos' information for a Facebook post.
Filter for Facebook content using the Open Graph 'photos' information of a Facebook post.
The Open Graph title for a post.
The Open Graph title for a Facebook post.
To learn more, visit Facebook's Open Graph developer's page or look at the Open Graph protocol.
1. To see Facebook posts that include an Open Graph title:
The Open Graph 'title' information for a Facebook post.
Filter for Facebook content using the Open Graph 'title' information of a Facebook post.
The Open Graph type for a post.
The Open Graph type information for a Facebook post.
To learn more, visit Facebook's Open Graph developer's page or look at the Open Graph protocol.
1. To see Facebook messages that include Open Graph type information:
The Open Graph 'type' information for a Facebook post.
Filter for Facebook content using the Open Graph 'type' information of a Facebook post.
The name and id of the source application that generated this data.
The name and id of the source application that generated this data. For example:
|
If facebook.application contains: |
facebook.source might contain: |
|
LG Phone |
LG Phone (53702860994) |
|
Facebook for iPhone |
Facebook for iPhone (6628568379) |
|
Facebook for Every Phone |
Facebook for Every Phone (139682082719810) |
|
Facebook for BlackBerry® smartphones |
Facebook for BlackBerry® smartphones |
1. Filter for content from applications that include "phone" in their source:
The name and id of the source application that generated the Facebook content.
Filter for Facebook content from iPhone users by specifying "iPhone, 6628568379".
List of Facebook user ids that a message was send to.
An array of strings containing the Facebook ids of all the Facebook users a post was written to.
Another array of strings, facebook.to.names, contains their names.
1. To see Facebook messages that include a specified facebook.to.ids:
2. To see Facebook messages that include at least one of these facebook.to.ids:
The user ids of Facebook members to whom a message was sent to.
Filter for Facebook messages that include a specified Facebook user.
List of Facebook user names a message was sent to.
An array of strings containing the Facebook names of all the Facebook users a post is written to.
DataSift offers another target, facebook.to.ids, which is an array of their ids.
1. To see Facebook messages that include one specified value for facebook.to.names:
2. To see Facebook messages that include at least one name from a list of facebook.to.names:
The list of Facebook usernames a message was sent to.
Filter for Facebook messages that were sent to a specified Facebook username.
The type of an update.
The type of message the author has posted on Facebook:
1. Filter for messages that contain videos:
2. Filter for messages that contain neither photographs nor videos:
The type of content a user has posted on Facebook. It can be either a status message, a link, a photo, or a video.
Filter for status updates and links posted on Facebook.
When you write external code for 2channel, bear in mind that some languages (including Java and PHP) cannot handle variable names that start with a number.
If you write purely in CSDL, you do not need to take any special precautions. If you write external code using DataSift's client libraries, the libraries will handle the issue for you (as they use the array notation) so, again, you do not need to take any additional action.
However, if you write external code that hits a DataSift API directly, be aware that you must take extra care. For example, in PHP you might avoid the problem by changing this code:
$obj->2ch->author->name
to this code:
$xyz = "2ch";
$obj->{$xyz}->author->name
or this code:
$obj->{'2ch'}->author->name
An alternative approach is to work in array notation rather than object notation, to ensure that you avoid problems:
$interaction['2ch']['author']['name'];
The real name supplied by the author of a post.
The real name supplied to 2Channel by the author of a post.
1. Filter for content from one named author:
2. Filter for content from anyone named "xyz":
The name associated with content from 2Channel data source.
Filter for 2Channel content from users with "John" in their real name.
The text of a 2ch post.
The text of a 2Channel post. The majority of 2Channel content is written in Japanese. There is some content in English.
1. Filter for posts that mention the "iPad":
The text contained in a 2Channel post.
Filter for 2Channel content for posts containing "iPad".
Currently this is always set to "HTML".
Currently this is always set to "HTML".
1. At the time of writing, the format is always "HTML" but this filter checks whether any new formats are available.
The format of a 2Channel interaction.
Filter for 2Channel content with "HTML" as its default format.
A link to an original post.
A link to an original post.
1. To see some examples of the content of 2ch.link:
2. Filter for a particular value in 2ch.link:
A URL to 2Channel content.
This target is rarely used for filtering. You can filter using this target if you have the direct URL for 2Channel content.
A link to a complete message board thread.
A link to a complete message board thread.
1. To see examples of the content of 2ch.thread:
The URL to a complete 2Channel thread.
This target is rarely used for filtering. You can filter using this target if you have the direct URL for a 2Channel thread.
The title of a post.
The title of a post.
1. Filter for 2channel posts that include "iPad" in their title:
The title of a post on 2Channel.
Filter for 2Channel posts that mention the Apple in their title.
Indicates whether a post contains the first message in a new thread.
Indicates whether a post contains the first message in a new board thread or a response to an existing message in a thread.
It can take these values:
1. Filter for posts that start a thread and that mention Toyota:
2. Filter for posts that respond to a thread and that mention Honda:
Indicates whether the 2Channel content is a "thread" or a "post".
Filter for 2Channel "threads" that mention Toyota.
A link to an author's profile.
A link to an author's profile page.
1. To see examples of links:
Posts from Amazon can be user generated or site generated. This target has meaning for user-generated posts only.
A URL to an author's profile page on Amazon. This target filters for user-generated posts only.
Filter for content from Amazon where you have the URL to the author's profile page.
The real name of an author.
The real name of the author of a message.
1. Filter for content from one named author:
2. Filter for content from anyone named "Smith":
Posts from Amazon can be user generated or site generated. This target has meaning for user-generated posts only.
The name supplied by the author of a message on Amazon. This target has meaning for user-generated posts only.
Filter for Amazon content posted by people named "John".
The main content of a post.
The main content of a post. It can be user-generated text or, for an Amazon product page, system-generated text.
1. Filter for content that mentions "Google":
The main content of an Amazon post. The text can be user-generated or system-generated.
Filter for Amazon content that mentions Lego.
The format of the content of a post.
The format of a post; currently always "HTML".
1. At the time of writing, the format is always "HTML" but this filter checks whether any new formats are available.
The format of an Amazon post. By default it is "HTML", but this may change in the future.
Filter for Amazon content by content type "HTML".
Link to a user-generated post or to an Amazon page.
Link to a user-generated post or to an Amazon page.
1. To see some examples of the content of amazon.link:
2. Filter for a particular value in amazon.link:
The URL to user-generated Amazon post or to an Amazon page.
Filter for Amazon content by checking if the URL contains the substring "discount".
Link to the complete message thread.
Link to the complete message thread. This is for user-generated content on Amazon.
1. To see examples of the content of amazon.thread:
The URL to a complete message thread on Amazon. You can filter for user-generated content on Amazon.
Filter for threads that mention anyone from a list of Amazon authors you specify.
The title of an Amazon-generated page or a piece of user-generated content.
The title of an Amazon-generated page or a piece of user-generated content.
1. Filter for content that includes "iPad" in the title:
The title of a system-generated page or a piece of user-generated content on Amazon.
Filter for Amazon content with "iPad" in its title.
Indicates whether an message is the first message in a new thread or a response to an existing message.
Indicates whether a message contains the first message in a new board thread or a response to an existing message in a thread.
It can take these values:
1. Filter for messages that start a thread and that mention comedy.
1. Filter for messages that respond to a thread and mention "Apple":
Indicates whether the Amazon content is a "thread" or a "post".
Filter for Amazon "threads" that mention Pepsi.
A link to a member's profile page.
A link to a member's profile page on Dailymotion.
1. To see examples of the content of this field:
The URL to a Dailymotion member's profile page.
Filter for all the Dailymotion content by a member using the direct URL to their profile page.
The username of the individual who uploaded a video.
The username of the person who uploaded a video.
1. Filter for videos from authors who have a particular string as part of their username:
The username supplied to Dailymotion by a member uploading content.
Filter for Dailymotion content uploaded by users with the word "apple" in their username.
The 'category' selected for a video when it was uploaded.
The 'category' selected for a video when it was uploaded.
1. Filter for videos that belong to a "news" category.
The 'category' of a video uploaded on Dailymotion. Since it is not mandatory to select a category, some videos are not classified by category.
Filter for Dailymotion content that belong to the "technology" category.
The text associated with a video.
The text associated with a video.
1. Filter for videos that mention the iPad in their content:
The text or comment associated with Dailymotion content.
Filter for Dailymotion content by text or comments that mention iPad.
Currently this is always set to "HTML".
Currently this is always set to "HTML".
1. At the time of writing, the format is always "HTML" but this filter checks whether any new formats are available.
The format of a message on Dailymotion. Currently, it is by default set to "HTML".
Since this target is currently set to "HTML" in all cases, it is not used in filtering yet.
The duration of a video.
The duration of a video, measured in seconds.
1. Filter for videos that are less than 20 seconds long:
The duration of a Dailymotion video, measured in seconds.
Filter for videos shorter than 3600 seconds.
List of the 'tags' applied to the video when it was uploaded.
When a user uploads a video, they can add tags. For example, a new video about DataSift might include this set of tags:
"DataSift, big data, new feature, Twitter"
1. Filter for videos tagged with "Lady Gaga":
When a user uploads a video on Dailymotion, they can add tags to describe the content of the video.
Filter for Dailymotion videos with the tags "apple, ipad, iphone".
A thumbnail photo associated with a video.
A link to a thumbnail image from a video.
1. To see examples of the content you can filter against in dailymotion.thumbnail:
The URL to a thumbnail image representing a Dailymotion video.
Filter for Dailymotion content where the URL to the thumbnails contains the words "iphone".
The title of a video.
A string containing the title of a video.
1. Filter for videos that have "Lady Gaga" in their title:
The title of a Dailymotion video.
Filter for Dailymotion content with "Coca Cola" in the title.
A link to a video.
A link to a page that displays a video.
1. To see example content for dailymotion.videolink:
The URL to a Dailymotion page that displays a video.
Filter for Dailymotion content by the URL which contains the word "football".
A link to an author's profile on Flickr.
A link to the user's profile.
1. To see examples of links:
The URL to a member's profile page on Flickr.
Filter for Flickr content by the link to a member's profile page.
A member's real name.
A Flickr member's real name.
1. Filter for content from one named Flickr member:
2. Filter for content from anyone named "Smith":
The name supplied by a member on Flickr.
Filter for Flickr content from members named "Steve Jobs".
A member's username.
The username a member has chosen on Flickr.
1. Filter for content from members who have a particular string as part of their username:
The username that a member has chosen on Flickr.
Filter for Flickr content from members who have "kittens" as part of their username.
The main content.
The main content of a post.
1. Filter for Flickr content that mention dogs.
The content associated with a Flickr image.
Filter for Flickr images related to flowers.
The format of the content.
The format of the post; currently always "HTML".
1. At the time of writing, the format is always "HTML" but this filter checks whether any new formats are available.
The format of a post on Flickr. Currently it is set to "HTML" by default.
Since this target is currently set to "HTML" by default for all content, it is not used in filtering yet.
Link to the dedicated page for a piece of content.
Link to the dedicated page for this image or other content.
1. To see some examples of the content of flickr.link:
2. Filter for a particular value in flickr.link:
The URL to a dedicated page for Flickr content.
Filter for Flickr content if you know the direct URL to the content on Flickr.
Link to the complete message board thread.
Link to the complete message board thread.
1. To see examples of the content of board.thread:
The URL to a complete message board thread on Flicrk. This target enables you to see the first message and all the responses.
Filter for Flickr content where the message board URL contains the word "instagram".
The title of a post.
The title of a post.
1. Filter for content that includes "McDonald's" in the title:
The title of a post on Flickr.
Filter for content that includes "Apple" in the title.
Indicates whether this is the first message in a new thread.
Indicates whether this is the first message in a new thread or a response to an existing message. It can take these values:
1. Filter for posts that start a thread and that mention baseball.
This target indicates whether a Flickr message is a "thread" or a "post". If a message is a "thread", it is identified the first message in a thread. If it is a "post", it is identified as a subsequent message in the thread.
Filter Flickr content for "threads" that mention baseball.
A link to an author's profile page.
A link to an author's profile page.
1. To see examples of links:
The URL to a member's profile page on IMDb.
Filter for IMDb content where the URL contains "tt0454876".
A member's real name.
A member's real name, as supplied to IMDb on signup.
1. Filter for content from one named IMDb member:
2. Filter for content from anyone named "Smith":
A member's real name, as supplied to IMDb on signup.
Filter for IMDb content from members with "John" in their name.
The main content of a post.
The main content of a post.
1. Filter for content that mentions "Star Wars":
The text or comment associated with an IMDb post.
Filter for IMDb content with text or comments that mentions Lord of the Rings.
The format of the content.
The format of the post; currently always "HTML".
1. At the time of writing, the format is always "HTML" but this filter checks whether any new formats are available.
The format of an IMDb post. Currently it is set as "HTML" by default.
You can use this target to check whether any new formats are available for IMDb content.
A link to the post.
A link to the individual page for the post.
1. To see some examples of the content of 2ch.link:
2. Filter for a particular value in 2ch.link:
The URL to an original post on IMDb.
This target is rarely used in filtering unless you are filtering for specific IMDb content and have the link to it.
Link to the complete message thread.
Link to the complete message thread.
1. To see examples of the content of imdb.thread:
The URL to a complete message board thread on IMDb.
Filter for IMDb content where the link to a message board thread contains "tt0454876".
The title of a post.
The title of a post.
1. Filter for 2channel posts that include "Star Wars" in their title:
The title of a post on IMDb.
Filter for IMDb content that mentions the King Kong in their title.
Indicates whether the interaction contains the first message in a new thread.
Indicates whether the interaction contains the first message in a new board thread or a response to an existing message in a thread.
It can take these values:
1. Filter for posts that start a thread and that Ocean's Eleven:
2. Filter for posts that respond to a thread and that Ocean's Twelve:
Indicates whether a message posted on IMDb is a "thread" or a "post".
If the message is a "thread", it is identified as the first message in a thread. If it is a "post", it is identified as a subsequent message in a thread.
Filter for IMDb posts that are in response to a thread about "Life of Pi".
A link to an author's profile page.
A link to an author's profile page.
1. To see examples of links:
The URL to a member's profile page on Reddit.
Filter for Reddit content by a specific member's profile page link.
The 'real' name of the author of the post.
The 'real' name of the author of the post.
1. Filter for content from one named author:
The real name supplied by a member to Reddit on signup.
Filter for Reddit content by members with "John" as their name.
The text of a post.
The text of a post.
1. Filter for posts that mention the iPad:
The text of a Reddit post.
Filter for Reddit content that mentions the iPad.
Currently this is always set to "HTML".
Currently this is always set to "HTML".
1. At the time of writing, the format is always "HTML" but this filter checks whether any new formats are available.
The format of a Reddit post. Currently, the format is set to "HTML" by default.
You can use this target to check for newer formats of Reddit posts.
A link directly to the post.
A link to the page for this post.
1. To see examples of the content of reddit.link:
The URL to an original Reddit post.
Rarely used in filtering. You can use this target if you have the direct link to a specific Reddit post.
Link to the complete message board thread.
Link to the complete message board thread.
1. To see examples of the content of reddit.thread:
The URL to a complete message board thread on Reddit.
Rarely used for filtering. You can use this target if the direct link to a specific message board thread on Reddit.
The title of the post.
The title of the post.
1. Filter for posts that include "iPad" in their title:
The title of a Reddit post.
Filter for Reddit content that mentions the iPad in its title.
Indicates whether a message is the first in a new board thread.
Indicates whether a message is the first in a new board thread or a response to an existing message in a thread.
It can take these values:
1. Filter for posts that start a thread and that mention baseball.
2. Filter for posts that respond to a thread and mention "Apple":
Indicates whether a Reddit message is a "thread" or a "post".
If it is a "thread", then it is identified as the first message in a thread. If it's a "post", it is identified as a subsequent message in a thread.
Filter for Reddit content, specifically "threads" that mention CNN.
A link to an author's profile page.
A link to an author's profile page on YouTube.
1. To see examples of the content of this field:
The URL to a member's profile page on YouTube.
Rarely used for filtering. You can use this target if you want filter using the link to a member's profile page.
The 'real' name supplied by an author.
The 'real' name supplied by the author of a video or video comment.
1. Filter for YouTube videos posted by a particular YouTube member:
The real name supplied by the YouTube member who uploaded a video or commented on video.
Filter for YouTube content posted by members named "John".
The category selected for a video when it was uploaded.
The YouTube category selected for a video when it was uploaded.
Note that this is not a compulsory selection in YouTube so you will probably encounter videos that do not belong to any category.
1. Filter for videos in the "Film & Animation" category:
At the time of writing, the YouTube categories are:
The YouTube category selected for a video when it was uploaded.
Filter for YouTube videos in the "Film & Animation" category.
A link to the comments page for a video.
A link to the comments page for a video.
1. To see some examples of the content of youtube.comments.link:
The URL to a comments page for a video on YouTube.
Rarely used for filtering. You can use this target if you want to filter using the direct link to the comments page of a specific video on YouTube.
The text or comment associated a the video.
The text or comment associated with a video.
1. Filter for videos with text or comments that mention the iPad:
2. Filter for content from all our data sources (including, for example, message boards, Twitter, Facebook, and so on) that mention the iPad:
The text or comment associated with a YouTube video.
Filter for videos with text or comments that mention the iPad.
The format of the content.
The format of the message; currently always "HTML".
1. At the time of writing, the format is always "HTML" but this filter checks whether any new formats are available.
The format of a message on YouTube. Currently it is set to "HTML" by default.
Rarely used for filtering. You can use this target to check for new formats of messages on YouTube.
The duration of a video.
The duration of a video, measured in seconds.
1. Filter for very short YouTube videos:
2. Filter for videos that are betweet 55 and 65 seconds long:
The duration of a YouTube video, measured in seconds.
Filter for YouTube videos shorter than 20 seconds. You can use this target in combination with youtube.content, to filter for videos of a specified length about a particular topic.
List of the 'tags' applied to a video when it was uploaded.
When a user uploads a video, they can add tags. For example, a new video about DataSift might include this set of tags:
"DataSift, big data, new feature, Twitter"
1. Filter for videos tagged with "Lady Gaga":
2. Filter for videos tagged "iPad" or belonging to a category that includes the word "mobile". Not all video sites include tags so you cannot guarantee that they will be present. When you filter against video.tags, you might choose to include a logical OR command:
When a user uploads a video on YouTube, they can add tags to describe the content of the video.
Filter for YouTube videos that contain the tags "Apple, brand new, iPhone 5".
A thumbnail photo associated with a video.
A link to a thumbnail image from a video.
1. To see examples of the content you can filter against in youtube.thumbnail:
The URL to a thumbnail image from a YouTube video.
Rarely used for filtering.
The title of a video.
A string containing the title of a YouTube video.
1. Filter for videos that have "Lady Gaga" in their title:
The title of a YouTube video.
Filter for videos with Samsung in the title.
Indicates whether this is a new video or a comment about an existing video.
This target matches "video" if the interaction contains a video and "comment" if it contains a "comment" about a video.
1. Filter for YouTube videos and exclude comments about videos:
Indicates whether this is a new YouTube video or a comment about an existing youTube video.
Filter for YouTube videos on NFL games where the comments contain the keywords "superb, excellent, well done".
A link to a video.
A link to the page that displays a video.
To see example content for youtube.videolink:
The URL to a page that displays a YouTube video.
Rarely used for filtering. You can use this target if you have a direct link to YouTube content.
An author's geographical location.
An author's geographical location. Typically Topix supplies this as a US city and state. Here are some sample values that we have seen in this target:
| Altamonte Springs, FL | Bethlehem, PA | |
| Chambersburg, PA | Danville, KY | |
| Grand Island, NE | Hammond, IN | |
| Houston, TX | Indianapolis, IN | |
| Los Angeles, CA | Milford, CT | |
| Newville, PA | New York, NY | |
| Seminole, FL | South Windsor, CT | |
| Spring, TX | West Mifflin, PA |
However, you might find other values, such as:
1. Search for Topix content from authors in Texas:
A Topix author's geographical location. Typically Topix supplies this as a US city and state.
Filter for Topix content from authors in Texas.
The 'real' name of the author of a post.
The 'real' name of the author of a post.
1. Filter for content from one named author:
The real name supplied by a member to Topix on signup.
Filter for Topix content from members with names that contain "Jane".
The text of this post.
The text of this post.
1. Filter for posts that mention the iPad:
The full text of a Topix post.
Filter for Topix content that mentions Chicago.
Currently this is always set to "HTML".
Currently this is always set to "HTML".
1. At the time of writing, the format is always "HTML" but this filter checks whether any new formats are available.
The format of a post on Topix. Currently the format is set to "HTML" by default.
You can use this target to check for new formats of posts on Topix.
A link directly to the post.
A link to the page for this post.
1. To see examples of the content of topix.link:
The URL to an original post on Topix.
Rarely used in filtering. You can use this target for filtering if you have the direct link to a specific post on Topix.
Link to the complete message board thread.
Link to the complete message board thread.
1. To see examples of the content of topix.thread:
The URL to a complete message board thread on Topix.
Rarely used for filtering. You can use this target if you have the direct link to specific message board thread on Topix.
The title of a post.
The title of a post.
1. Filter for posts that include "iPad" in their title:
The title of a Topix post.
Filter for Topix content that mentions the iPad in their title.
Indicates whether the interaction contains the first message in a new board thread.
Indicates whether the interaction contains the first message in a new board thread or a response to an existing message in a thread.
It can take these values:
1. Filter for posts that start a thread and that mention baseball.
2. Filter for posts that respond to a thread and mention "Apple":
Indicates the type of a Topix message. If the message is a "thread", it is identified as the first message posted in a thread. If it is a "post", it is identified as a subsequent message on a thread.
Filter for Topix content where new threads mention the word "iPhone 5".
If you're just getting started with links resolution, here's some recommended reading.
Read this excellent blog from Lorenzo Alberton, DataSift's Chief Technical Architect, describing the Bitly data source and the Links augmentation 'under the hood'.
This blog looks at Datasift's latest link resolution services, including a new partnership with bitly, the #1 link sharing platform, which powers 75 percent of the world’s largest media companies and half of the Fortune 500 companies. With over 20,000 white-labeled domains, bitly generates 200M clicks/day.
This blog looks at two powerful protocols, Open Graph from Facebook and Twitter Cards from Twitter. The ideas behind these technologies allow social media sites two provide embedded content accompanied by metadata that describes the content being presented, distributed, and consumed.
Our Introduction page explains how the Links augmentation examines links within the content of a message and resolves them to their final endpoint. At the same time it also aggregates the content of the link so that filtering can be performed against the content of the page that the link was pointing at.
Browse the Links augmentation targets.
The Links augmentation looks at any links within the content of a message and resolves them to their final endpoint. At the same time it also aggregates the content of the link so that filtering can be performed against the content of the page that the link was pointing at.
DataSift follows all types of shortened links (for example, bit.ly and Twitter's own t.co shortener) and follows each redirect until the final web page is found. The final resolved link is also visible (as links.url) to be filtered against
The Link augmentation works in near real time; only links which have not previous been discovered are taken out of the real-time flow and are re-inserted (normally in under two seconds) back into the flow of data.

Here are the key points you need to know first:
1. You can filter against the title of a linked page:
2. You can filter against specific domains. We use the in operator here rather than contains because this target is an array of strings:
3. You can filter against the retweet count:
Note that this example has no meaning if there is more than one link in the object because each link has a unique retweet count.
An input object might contain more than one link so the Links augmentation is designed to handle multiple links. The targets for the Links augmentation are arrays of strings or arrays of integers. There is one array element for each link. For example, for a Tweet that contains three links, there will be three array elements.
DataSift keeps the array elements in step automatically. For example, if links.title contains:
| 2 | Yahoo! |
| 1 | |
| 0 | eBay |
links.url contains:
| 2 | http://www.yahoo.com |
| 1 | http://www.google.com |
| 0 | http://www.ebay.com |
You perform operations on these arrays as if they were simple strings or integers. For example, the following filter succeeds if it finds a match on at least one row in the array.
The number of seconds since we first saw a particular link in the Twitter Firehose.
Please note: we are deprecating this target.
It will continue to compile but it will not match any data.
The number of seconds since a particular link was created in our database. As soon as we see an author use a new link, we aggregate it. In other words, this target indicates the first time we saw a link in the Twitter Firehose.
This information is useful, for example, when you write filters that look for new links only.
1. Filter for posts containing links that have been added to our database within the past 100 seconds:
The number of seconds since a particular link was created in our database. As soon as we see an author use a new link, we aggregate it. In other words, this target indicates the first time we saw a link in the Twitter Firehose.
This information is useful, for example, when you write filters that look for new links only.
Filter for posts containing links that have been added to our database within the past 100 seconds.
The fully qualified domain name of a link, resolved to the final URL.
The Fully Qualified Domain Name (FQDN) of a link. This information may be used for statistical purposes to find out which domains publish most popular content.
Remember that a link might have been shortened. In all cases, links.domain contains the domain of the fully expanded link.
1. Filter for Tweets that contain a link to any eBay.com page:
2. Filter for posts mentioning URLs that include any domains from a list:
3. Filter for posts mentioning URLs that include a domain or any of its subdomains:
The domain name of a link from a post.
Remember that a link might have been shortened. In all cases, links.domain contains the domain of the fully expanded link.
Filter for Tweets that contain a link to any eBay.com page.
The total number of times this link has been Tweeted.
Please note: we are deprecating this target.
It will continue to compile but it will not match any data.
The total number of times this link has been Tweeted.
See also twitter.retweet.count.
1. Filter for Tweets that contain a link that has been Tweeted to more than 10,000 times.
Note that this might include one Tweet that has been retweeted 10,000 times or 10,000 different Tweets from 10,000 authors, all linking to the same destination page.
The total number of times this link has been Tweeted.
Filter for Tweets that contain a link that has been Tweeted to more than 10,000 times.
Note that this might include one Tweet that has been retweeted 10,000 times or 10,000 different Tweets from 10,000 authors, all linking to the same destination page.
The title of the page the link points to.
The title of a page that is linked to from a post.
For example, suppose that the text of a Tweet is:
"You have to read this! http://imdb.to/16tYO"
This shortened link in the Tweet expands to: www.imdb.com
The HTML <title> of that page is "IMDb - Movies, TV and Celebrities" and it appears as a string in links.title.
1. Filter for Tweets that contain "read this" and include a link to a page with "Google" in the title:
The title is the text that is extracted from the <title></title> tags in the page's HTML. It does not necessarily correspond to any content that appears in a browser when the page is rendered.
The title is extracted from the <title></title> tags in the page's HTML.
The title of a page that is linked to from a post. For example, suppose that the text of a Tweet is:
"You have to read this! http://imdb.to/16tYO"
This shortened link in the Tweet expands to: www.imdb.com
The HTML <title> of that page is "IMDb - Movies, TV and Celebrities" and it appears as a string in links.title.
Use this in combination with Twitter >> Tweet to filter for Tweets that contain "check it out" and include a link to a page with "NFL" in its title.
The full URL of the page (resolved to the final page).
The expanded URL of a page that this post links to.
You can use this augmentation to filter for posts that contain a specific string or substring in the link.
1. Filter for posts that contain a link that, when expanded, includes a specified substring:
2. Filter for posts that contain links that include the substring "obama-victory":
3. Filter for posts that contain links but exclude named domains:
The expanded URL of a page that this post links to. You can use this augmentation to filter for posts that contain a specific string or substring in the link.
Filter for posts that include the substring "nfl-scores" in the link.
The character set for the target web page.
The character set for the target web page. This information is derived from the <meta charset="UTF-8"> (HTML5) or <meta http-equiv="content-type" content="text/html; charset=UTF-8"> (HTML 4.01) tags found inside the <head>...</head> tags. You can use it to measure the rate of adoption of a particular encoding. This is very useful feedback for designers, developers, and publishers. Possible values are:
1. Filter for links to content in UTF-8:
The character set for the target web page. This information is derived from the <meta charset="UTF-8"> (HTML5) or <meta http-equiv="content-type" content="text/html; charset=UTF-8"> (HTML 4.01) tags found inside the <head>...</head> tags. You can use it to measure the rate of adoption of a particular encoding. This is very useful feedback for designers, developers, and publishers.
Filter for links to content in EUC-JP.
Content type of the target page.
Indicates the type of content on the target page. It can be found inside the page's header:
<head>
<meta http-equiv="content-type" content="text/html" />
</head>
This information helps you measure popularity of different types of content (text, image, video) or go deeper and analyze the popularity of different video codecs (video/3gpp, video/mp4, video/x-m4v).
1. Filter for content in plain text:
Here are some sample values that we found:
Indicates the type of content on the target page. It can be found inside the page's header:
<head>
<meta http-equiv="content-type" content="text/html" />
</head>
This information helps you measure popularity of different types of content (text, image, video) or go deeper and analyze the popularity of different video codecs (video/3gpp, video/mp4, video/x-m4v).
Filter for content in html text.
Language of the target page.
Language of the target page, as declared in the HTTP headers or in the HTML meta tags.
Language codes must be ISO 639-1 codes. It is as declared in the HTTP headers, the <html> tag or in the HTML <meta> tags, as follows:.
<html lang="en">
...
</html>
or
<html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en">
...
</html>
Here are some sample values:
Language of the target page, as declared in the HTTP headers or in the HTML meta tags. The language codes must be compliant with ISO 639-1 codes.
Filter for content where the language metadata is set to British English.
Keywords accompanying news stories.
Metadata frequently attached to news stories.
You can use this target to refine your filters. It is especially useful when you are trying to filter for pages that publish news instead of other types of content. This type of metadata is found on pages that implement the Google news_keywords <meta> tag.
<head>
<meta name="news_keywords" content="Obama, White House, Election, 2012, victory" />
</head>
1. Filter for stories that include "Google" as a keyword:
Metadata frequently attached to news stories. You can use this target to refine your filters, especially when you are trying to filter for pages that publish news instead of other types of content. This type of metadata is found on pages that implement the Google news_keywords <meta> tag.
Filter for stories that include "Obama" as a keyword.
Open Graph activity metadata.
Open Graph object type activity.
You can use this augmentation to filter for content by the name of an activity.
The links.meta.opengraph.activity target belongs to the Activities family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:activity" content="running" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph activity metadata is "running":
An Open Graph object type. You can filter for content by the name of an activity. This target belongs to the Activities family of Open Graph object types.
Filter for content where the Open Graph activity metadata is "swimming".
Open Graph actor metadata.
Open Graph object type actor.
You can use this augmentation to filter for content by the name of an actor.
The links.meta.opengraph.actor target belongs to the People family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:actor" content="Will Smith" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph actor metadata is "Will Smith":
An Open Graph object type. You can filter for content by the name of an actor. This target belongs to the People family of Open Graph object types.
Filter for content where the Open Graph actor metadata is "Tom Cruise".
Open Graph album metadata.
Open Graph object type album.
You can use this augmentation to filter for content by the name of an album.
The links.meta.opengraph.album target belongs to the Products and Entertainment family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:album" content="Abbey Road" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph album metadata is "Abbey Road":
An Open Graph object type. You can filter for content by the name of a music album. This target belongs to the Products and Entertainment family of Open Graph object types.
Filter for content where the Open Graph album metadata is "Born This Way".
Open Graph article metadata.
Open Graph object type article.
You can use this augmentation to filter for website content such as news articles, blog posts, photos, and videos.
The links.meta.opengraph.article target belongs to the Websites family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:article" content="Yankees">
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph article metadata is "Yankees":
An Open Graph object type. You can filter for website content such as news articles, blog posts, photos, and videos. This target belongs to the Websites family of Open Graph object types.
Filter for content where the Open Graph article metadata is "New York Jets".
Open Graph athlete metadata.
Open Graph object type athlete.
You can use this augmentation to filter for content by the name of an athlete.
The links.meta.opengraph.athlete target belongs to the People family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:athlete" content="Ryan Lochte" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph athlete metadata is "Ryan Lochte":
An Open Graph object type. You can filter for content by the name of an athlete. This target belongs to the People family of Open Graph object types.
Filter for content where the Open Graph athlete metadata is "LeBron James".
Open Graph author metadata.
Open Graph object type author.
You can use this augmentation to filter for content by the name of an author.
The links.meta.opengraph.author target belongs to the People family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:author" content="Charles Dickens" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph author metadata is "Charles Dickens":
An Open Graph object type. You can filter for content by the name of an author. This target belongs to the People family of Open Graph object types.
Filter for content where the Open Graph actor metadata is "Michael Crichton".
Open Graph band metadata.
Open Graph object type band.
You can use this augmentation to filter for content by the name of a band.
The links.meta.opengraph.band target belongs to the Organizations family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:band" content="The Beatles" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph band metadata is "Beatles":
An Open Graph object type. You can filter for content by the name of a band. This target belongs to the Organizations family of Open Graph object types.
Filter for content where the Open Graph band metadata is "Led Zepplin".
Open Graph bar metadata.
Open Graph object type bar.
You can use this augmentation to filter for content by the name of a bar.
The links.meta.opengraph.bar target belongs to the Businesses family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:bar" content="Rick's" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph bar metadata is "Rick's":
An Open Graph object type. You can filter for content by the name of a bar. This target belongs to the Businesses family of Open Graph object types.
Filter for content where the Open Graph bar metadata is "Mixx Bar".
Open Graph blog metadata.
Open Graph object type blog.
You can use this augmentation to filter for content from a particular blog domain.
The links.meta.opengraph.blog target belongs to the Websites family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:blog" content="blogspot" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph blog metadata contains "blogspot":
An Open Graph object type. You can filter for content from a particular blog domain. This target belongs to the Websites family of Open Graph object types.
Filter for content where the Open Graph blog metadata contains "wordpress".
Open Graph book metadata.
Open Graph object type book.
You can use this augmentation to filter for content by the name of a book.
The links.meta.opengraph.book target belongs to the Products and Entertainment family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:book" content="Hamlet" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph book metadata is "Hamlet":
An Open Graph object type. You can filter for content by the name of a book. This target belongs to the Products and Entertainment family of Open Graph object types.
Filter for content where the Open Graph book metadata is "Life of Pi".
Open Graph cafe metadata.
Open Graph object type cafe.
You can use this augmentation to filter for content by the name of a cafe.
The links.meta.opengraph.cafe target belongs to the Businesses family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:cafe" content="Starbucks" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph cafe metadata includes "Starbucks":
An Open Graph object type. You can filter for content by the name of a cafe. This target belongs to the Businesses family of Open Graph object types.
Filter for content where the Open Graph cafe metadata is "Costa".
Open Graph cause metadata.
Open Graph object type cause.
You can use this augmentation to filter for content by specifying a cause.
The links.meta.opengraph.cause target belongs to the Groups family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:cause" content="World Wildlife Fund" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph cause metadata is "World Wildlife Fund":
An Open Graph object type. You can filter for content by specifying a cause. This target belongs to the Groups family of Open Graph object types.
Filter for content where the Open Graph cause metadata is "WWF".
Open Graph city metadata.
Open Graph object type city.
You can use this augmentation to filter for content by the name of a city.
The links.meta.opengraph.city target belongs to the Places family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:city" content="Paris" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph city metadata includes "Paris":
An Open Graph object type. You can filter for content by the name of a city. This target belongs to the Places family of Open Graph object types.
Filter for content where the Open Graph city metadata includes "New York".
Open Graph company metadata.
Open Graph object type company.
You can use this augmentation to filter for content by the name of a company.
The links.meta.opengraph.company target belongs to the Businesses family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:company" content="DataSift" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph company metadata is "DataSift":
An Open Graph object type. You can filter for content by the name of a company. This target belongs to the Businesses family of Open Graph object types.
Filter for content where the Open Graph company metadata is "Apple".
Open Graph country metadata.
Open Graph object type country.
You can use this augmentation to filter for content by the name of a country.
The links.meta.opengraph.country target belongs to the Places family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:country" content="France" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph country metadata includes "France":
An Open Graph object type. You can filter for content by the name of a country. This target belongs to the Places family of Open Graph object types.
Filter for content where the Open Graph country metadata includes "England".
Open Graph director metadata.
Open Graph object type director.
You can use this augmentation to filter for content by the name of a director.
The links.meta.opengraph.director target belongs to the People family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:director" content="Alfred Hitchcock" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph director metadata is "Alfred Hitchcock":
An Open Graph object type. You can filter for content by the name of a director. This target belongs to the People family of Open Graph object types.
Filter for content where the Open Graph actor metadata is "Steven Spielberg".
Open Graph drink metadata.
Open Graph object type drink.
You can use this augmentation to filter for content by the name of a drink.
The links.meta.opengraph.drink target belongs to the Products and Entertainment family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:drink" content="Shaken not stirred" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph drink metadata includes "Shaken not stirred":
An Open Graph object type. You can filter for content by the name of a drink. This target belongs to the Products and Entertainment family of Open Graph object types.
Filter for content where the Open Graph drink metadata is "Sangria".
The HTTP status code for the original URL.
The HTTP status code for the original URL. This is the HTTP status code received by DataSift when we tried to retrieve the original URL. When DataSift needs to resolve shortened links, it will be the HTTP status code for the last resolved link. Codes other than 200 (OK) do not mean that the resource the original URL points to is unavailable, they only mean that it might had been unavailable when DataSift tried to access it.
1. Filter for Tweets that contain "love" and include a link to a resource with an HTTP status code 200:
2. Filter for Tweets that contain "politics" and include a link to a resource with a "not-failed" HTTP status code:
The HTTP status code for the original URL. This is the HTTP status code received by DataSift when we tried to retrieve the original URL. When DataSift needs to resolve shortened links, it will be the HTTP status code for the last resolved link. Codes other than 200 (OK) do not mean that the resource the original URL points to is unavailable, they only mean that it might had been unavailable when DataSift tried to access it.
Filter for Tweets that contain "life" and include a link to a resource with an HTTP status code 200.
The array of intermediate links that any link posted as a part of an interaction resolves to before DataSift obtains the original URL.
The array of intermediate links that any link posted as a part of an interaction resolves to before DataSift obtains the original URL. If the URL we're analyzing points directly to the resource it represents, the links.hops array will be empty; if the URL is a shortened link (a Bilty link, for example) that resolves directly to the original URL, the links.hops array will have one element. If, however, the URL is a shortened link to another shortened link that points to the original URL, the links.hops array will have two elements.
1. Filter for Tweets that contain links shortened using Bitly:
The array of intermediate links that any link posted as a part of an interaction resolves to before DataSift obtains the original URL. If the URL we're analyzing points directly to the resource it represents, the links.hops array will be empty; if the URL is a shortened link (a Bilty link, for example) that resolves directly to the original URL, the links.hops array will have one element. If, however, the URL is a shortened link to another shortened link that points to the original URL, the links.hops array will have two elements.
Filter for Tweets that contain links shortened using bitly.
Description of the target page.
Description of the target page found inside the page's header.
This information is added to pages by their creators, either automatically or by hand, as follows:
<head>
<meta name="description" content="DataSift offers the most powerful and sophisticated tools for extracting value from Social Data." />
</head>
1. Filter for interactions with links that include "election" in their metadata description:
Description of the target page found inside the page's header.
This information is added to pages by their creators, either automatically or by hand, as follows:
<head>
<meta name="description" content="DataSift offers the most powerful and sophisticated tools for extracting value from Social Data." />
</head>
Filter for interactions with links that include "Yankees win" in their metadata description.
Keywords for a target page.
Keyword metadata for the target page for a link.
This information is added to pages by their creators, either automatically or by hand, as follows:
<head>
<meta name="keywords" content="Romney, Obama, White House, Election, 2012, opinion" />
</head>
The result is a JSON array, for example:
1. Filter for content with keyword metadata that includes "election":
The keyword metadata for the target page for a link.
This information is added to pages by their creators, either automatically or by hand, as follows:
<head>
<meta name="keywords" content="Romney, Obama, White House, Election, 2012, opinion" />
</head>
Filter for content with keyword metadata that includes "photography".
Open Graph description metadata.
Open Graph object property description.
You can use this augmentation to filter for content by specifying a string or substring included in the description. It is an optional Open Graph object property but most Open Graph Protocol users recommend including it.
The links.meta.opengraph.description target belongs to the family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:description" content="Coverage of the 2012 Presidential election" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph description metadata includes "election":
An Open Graph object property. You can filter for content by specifying a string or substring included in the description. It is an optional Open Graph object property but most Open Graph Protocol users recommend including it. This target belongs to the family of optional Open Graph object properties.
Filter for content where the Open Graph description metadata includes "US elections".
Open Graph email address metadata.
Open Graph object property email.
You can use this augmentation to filter for content by specifying an email address.
The links.meta.opengraph.email target belongs to the Contact Information family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:email" content="support@datasift.com" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph email metadata includes "support@datasift.com":
2. Filter for content where the Open Graph email metadata includes one or more email addresses from a list:
An Open Graph object property. You can filter for content by specifying an email address. This target belongs to the Contact Information family of optional Open Graph object properties.
Filter for content where the Open Graph email metadata includes "support@datasift.com".
Open Graph fax_number metadata.
Open Graph object property fax_number.
You can use this augmentation to filter for content by specifying a fax number.
The links.meta.opengraph.fax_number target belongs to the Contact Information family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:fax_number" content="555-123-4567" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph fax_number metadata includes the specified fax number:
An Open Graph object property. You can filter for content by specifying a fax number. This target belongs to the Contact Information family of optional Open Graph object properties.
Filter for content where the Open Graph fax_number metadata includes a specified fax number.
Open Graph food metadata.
Open Graph object type food.
You can use this augmentation to filter for content by specifying the name of a food.
The links.meta.opengraph.food target belongs to the Products and Entertainment family of Open Graph object types:
In the header of an HTML page, it appears as meta property:
<meta property="og:food" content="filet mignon" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph food metadata is "filet mignon":
An Open Graph object type. You can filter for content by the name of a food. This target belongs to the Products and Entertainment family of Open Graph object types.
Filter for content where the Open Graph food metadata is "pizza".
Open Graph geo metadata.
Open Graph object property geo.
You can use this augmentation to filter for content by its geographical location. Be sure to include any one of the three geographical operators that CSDL offers:
The links.meta.opengraph.geo target belongs to the Location family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:geo" content="40.714623,-74.006605:100" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph geo metadata is a location within a 100 kilometer radius of New York:
An Open Graph object property. You can filter for content by its geographical location. You must include any one of the three geographical operators that CSDL offers. This target belongs to the Location family of optional Open Graph object properties.
Filter for content where the Open Graph geo metadata is a location within a 50 kilometer radius of London.
Open Graph locality metadata.
Open Graph object property locality.
You can use this augmentation to filter for content by its locality information.
The links.meta.opengraph.locality target belongs to the Location family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:locality" content="Tristate Area" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph locality metadata is "Tristate Area":
An Open Graph object property. You can filter for content by its locality information. This target belongs to the Location family of optional Open Graph object properties.
Filter for content where the Open Graph locality metadata is "West Village".
Open Graph phone_number metadata.
Open Graph object property phone_number.
You can use this augmentation to filter for content by specifying a phone number.
The links.meta.opengraph.phone_number target belongs to the Contact Information family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:phone_number" content="555-123-4567" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph phone_number metadata includes the specified phone number:
An Open Graph object property. You can filter for content by specifying a phone number. This target belongs to the Contact Information family of optional Open Graph object properties.
Filter for content where the Open Graph phone_number metadata includes a specified phone number.
Open Graph postal_code metadata.
Open Graph object property postal_code.
You can use this augmentation to filter for content by a postal code.
The links.meta.opengraph.postal_code target belongs to the Location family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:postal_code" content="94105" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph postal_code metadata includes "94105":
An Open Graph object property. You can filter for content by a postal code. This target belongs to the Location family of optional Open Graph object properties.
Filter for content where the Open Graph postal_code metadata includes "94105".
Open Graph region metadata.
Open Graph object property region.
You can use this augmentation to filter for content by a particular region.
The links.meta.opengraph.region target belongs to the Location family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:region" content="Los Angeles" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph region metadata is "Los Angeles":
An Open Graph object property. You can filter for content by a particular region. This target belongs to the Location family of optional Open Graph object properties.
Filter for content where the Open Graph region metadata is "Los Angeles".
Open Graph site_name metadata.
Open Graph object property site_name.
You can use this augmentation to filter for content by the name of a website. Though it is an optional Open Graph object property but most Open Graph Protocol users recommend including it.
The links.meta.opengraph.site_name target belongs to the family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:site_name" content="IMDb" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph site_name metadata includes "IMDb":
An Open Graph object property. You can filter for content by the name of a website. It is an optional Open Graph object property but most Open Graph Protocol users recommend including it. This target belongs to the family of optional Open Graph object properties.
Filter for content where the Open Graph site_name metadata includes "Reddit".
Open Graph street_address metadata.
Open Graph object property street_address.
You can use this augmentation to filter for content by its address information.
The links.meta.opengraph.street_address target belongs to the Location family of optional Open Graph object properties:
In the header of an HTML page, it appears as meta property:
<meta property="og:street_address" content="Wall Street" />
Learn more about Open Graph protocol.
1. Filter for content where the Open Graph street_address metadata includes "Wall Street":
An Open Graph object property. You can filter for content by its street address information. This target belongs to the Location family of optional Open Graph object properties.
Filter for content where the Open Graph street_address metadata includes "Wall Street".
The type of Twitter Card: summary, photo, or player.
Indicates whether an interaction is a summary, photo, or player.
There are three different types of cards that can be attached to Tweets:
The links.meta.twitter.card target is part of the family of basic Twitter Cards targets:
In the header of an HTML page, it might appear as:
<meta name="twitter:card" content="summary">
At the time of writing, Twitter advises users that the twitter:card and twitter:player values must be the same. Similarly, when filtering in DataSift for a particular value in links.meta.twitter.card, it is identical to filtering for that value in links.meta.twitter.player.
1. Filter for content where the Twitter Card type is a summary:
According to Twitter's Developer Documentation Suite:
|
Twitter Cards make it possible for you to attach media experiences to Tweets that link to your content. Simply add a few lines of HTML to your webpages, and users who Tweet links to your content will have a "card" attached to the Tweet that is visible to all of their followers.
As a developer, Twitter Cards:
Learn more about Twitter Cards. |
Indicates whether an interaction is a summary, photo, or player.
There are three different types of cards that can be attached to Tweets:
Filter for content where the Twitter Card type is a player.
The Twitter @username of the content creator.
The Twitter @username of the Twitter account which created the content shown in a Twitter Card.
The links.meta.twitter.creator target is part of the Creator Attribution and Website Attribution families of targets:
In the header of an HTML page, it might appear as:
<meta name="twitter:creator" content="@datasift">
Learn more about Twitter Cards.
1. Filter for content where the Twitter Cards creator metadata is "@DataSift":
The Twitter @username of the Twitter account which created the content shown in a Twitter Card.
Filter for content where the Twitter Cards creator metadata is "@Beyonce".
Twitter user ID of the content creator.
The Twitter user ID of the Twitter account which created the content shown in a Twitter Card.
The links.meta.twitter.creator target is part of the Creator Attribution and Website Attribution families of targets:
In the header of an HTML page, it might appear as:
<meta name="twitter:creator_id" content="155505157">
Learn more about Twitter Cards.
1. Filter for content where the Twitter Cards creator_id metadata is "155505157":
The Twitter user id of the Twitter account which created the content shown in a Twitter Card.
Filter for content where the Twitter Cards creator_id metadata is "31239408".
A description of the Twitter Card.
A description of the Twitter Card in a maximum of 200 characters.
You can use this augmentation to filter for content by specifying a string or substring contained in the Twitter Cards description metadata.
The links.meta.twitter.description target is part of the family of basic Twitter Cards targets:
In the header of an HTML page, it might appear as:
<meta name="twitter:description" content="Microsoft launches Windows 9" />
Learn more about Twitter Cards.
1. Filter for content where the Twitter Cards description metadata contains the substring "DataSift":
A description of the Twitter Card in a maximum of 200 characters. You can filter for content by specifying a string or substring contained in the Twitter Cards description metadata.
Filter for content where the Twitter Cards description metadata contains the substring "US Elections".
URL of the image attached to a Twitter Card.
URL of the image attached to a Twitter Card.
This target is populated only if links.meta.twitter.card contains "image". You can use this augmentation to filter for Twitter Cards that contain an image.
The links.meta.twitter.image target is part of the family of basic Twitter Cards targets:
In the header of an HTML page, it might appear as:
<meta name="twitter:image" content="www.example.com/images/datasift.png" />
Learn more about Twitter Cards.
1. Filter for content where the Twitter Cards image metadata is "www.example.com/images/datasift.png":
The URL of the image attached to a Twitter Card. This target is populated only if a Twitter Card contains an image. You can filter for Twitter Cards that contain an certain image.
Filter for content where the Twitter Cards image metadata is "www.example.com/images/datasift.png".
Height (in px) of the image.
Height in pixels of the image attached to a Twitter Card.
This target is populated only if links.meta.twitter.card contains "image". You can filter for images of a particular size by using links.meta.twitter.image_width and links.meta.twitter.image_height, to specify the width and height of an image.
The links.meta.twitter.image_height target is part of the Image family of basic Twitter Cards targets:
In the header of an HTML page, it appears as follows:
<meta content="250" name="twitter:image:height" />
1. Filter for content where the Twitter Cards image_height metadata is "250":
The height in pixels of the image attached to a Twitter Card. This target is populated only if a Twitter Card contains an image. You can filter for images of a particular size by specifying the width and height of an image.
Filter for content where the Twitter Cards image_height metadata is "250".
Width (in px) of the image.
Width in pixels of the image attached to a Twitter Card.
This target is populated only if links.meta.twitter.card contains "image". You can filter for images of a particular size by using links.meta.twitter.image_width and links.meta.twitter.image_height to specify the width and height of an image.
The links.meta.twitter.image_height target is part of the Image family of basic Twitter Cards targets:
In the header of an HTML page, it appears as follows:
<meta content="250" name="twitter:image:width" />