Blog

Introducing Keyword Relationship Models

Identifying and expanding on keywords and terms is a key challenge when filtering, classifying and analyzing text data. We're always looking at how we can make this challenge easier. One area we've been researching is finding relationships between words using word2vec. Today we've released a tool which allows you to explore relationships between words. We've also created our first Keyword Relationship Model for you to explore. This model represents over three million unique...

Read Introducing Keyword Relationship Modelsimageimage

Building Better Machine Learned Classifiers Faster with Active Learning

You might have seen our recent announcement covering many things including the announcement of VEDO Intent. You're probably aware that DataSift VEDO allows you to run machine-learned classifiers. Unfortunately creating a high-quality classifier relies on a good quantity and quality of manually classified training data (which can be a painstaking task to produce) and exploration of machine learning algorithms to get the best possible result.  VEDO Intent is a tool that helps...

Read Building Better Machine Learned Classifiers Faster with Active Learningimageimage

Exploring Keyword Relationships in Social Posts with word2vec

You might have seen our recent announcement covering many things including the launch of our new FORUM data science initiative. We're looking to share more of our experience and research to help innovation in social data analysis. One of the first things we wanted to share is our work exploring the relationships between keywords and terms in social posts. Our data science team has been researching this area using word2vec - a data science library that models the relationship / similarity...

Read Exploring Keyword Relationships in Social Posts with word2vecimageimage

Open Data Processing for Twitter - Now Available

Following Twitter's announcement to terminate their partnership with DataSift, we've been working hard to build a solution to help our customers fill gaps left by the transition to Gnip. We've now released an open-source connector for you to install on your infrastructure (or a platform such as AWS) which will take data from the Gnip API and feed this into the DataSift platform. Hopefully this component will make your transition as pain-free as possible. DataSift's open-...

Read Open Data Processing for Twitter - Now Availableimageimage

Transitioning to GNIP: Feature Gaps vs DataSift

Twitter has terminated their partnership agreement with DataSift, meaning that as of August 13th 2015, companies will need to license Twitter data directly from GNIP.    From our analysis, 80% of our customers leverage capabilities which are absent in GNIP. The goal of this post is to highlight the main functional areas that will be impacted by transitioning to GNIP, to enable our customers to evaluate the features that may need to deprecate from their products, or identify the...

Read Transitioning to GNIP: Feature Gaps vs DataSiftimageimage

Pages