Gerrit Schultz - Internship at DataSift

Gerrit Schultz describes the time he recently spent from August to November as a intern in the Development group at DataSift. I'm very happy that as part of my university studies I'm now having the chance to work as an intern with DataSift. It's certainly been a brilliant experience. From the…

Read Gerrit Schultz - Internship at DataSift >

Regular Expressions

Introduction You've probably written streams that use CSDL's native operators such as contains and any. You might not have tried our embedded regular expression (regex) engine yet. If you already know how to write a regex, just read our regular expression page, take a look at the escaping…

Read Regular Expressions >

High Scalability

DataSift is the subject of the latest post on the High Scalability blog which includes a detailed overview of the platform architecture and the problems involved in meaningfully filtering unstructured data from the Twitter API in real time. ‘You have to be able to reliably consume it, normalize i…

Read High Scalability >