Blog

CSDL Optimization Techniques

There are plenty of ways to optimize your CSDL. Remember, the more optimal your CSDL is, the less it costs you. Overusing Operators Many people heavily overuse the or operator in the following way: interaction.content contains "word1" or interaction.content contains "word2" or ...…

Read CSDL Optimization Techniques >

Salience 5

DataSift uses an engine called Salience to perform sentiment analysis. We've just upgraded to a new version, Salience 5 and, in this blog, I'll review the existing features and introduce the new ones. Up to now, DataSift's sentiment analysis has offered two numeric values. The first is score that…

Read Salience 5 >

Migrating from the Twitter Streaming API

When comparing the DataSift streaming API to the Twitter streaming API, some people have noticed differences in the number of Tweets being returned. The main differences occur when using the Twitter Streaming API "track" method to search for keywords. If we were to search the Twitter…

Read Migrating from the Twitter Streaming API >

Streams within streams

The Curated Stream Definition Language (CSDL) in DataSift allows one stream to call another stream. The technique adds some interesting new possibilities. This blog post is a tutorial for beginners. First, let's set up three example streams, keeping them very simple because we're just illustrating…

Read Streams within streams >

Regular Expressions

Introduction You've probably written streams that use CSDL's native operators such as contains and any. You might not have tried our embedded regular expression (regex) engine yet. If you already know how to write a regex, just read our regular expression page, take a look at the escaping…

Read Regular Expressions >