If you’re coming from the post Is Sentiment Analysis an 80% Technology? you may want to continue directly to my post written partially in response to Seth:

Why Sentiment Analysis Sucks for Social Media Monitoring?

Or continue to read the original post that initiated the debate.

———

To save you the trouble of reading further (and few minutes of your time), the answer is “not much”.

Leading providers such as Sysomos and Radian6 estimate their automated sentiment analysis and scoring system to be 80% accurate. That sounds quite good, right?

It doesn’t at all.

20% difference statistically is huge and comes with an array of problems. Think about a situation in which you’re comparing something with 59% positive sentiment to something that has 65% positive sentiment. That’s less than 10% difference. In other words, whenever you have a situation where you’re looking at current data without long historical trend as a reference (for example the Oscars), these types of numbers are completely useless. Almost exclusively, the numbers sentiment vendors provide have differences within the 20% range.

For example, if [A] has positive sentiment of 50% and [B] has positive sentiment of 60%, with 80% accuracy this could mean that [A] is anything between 40%-60% and [B] is anything between 48% and 72%. The only thing this tells us is that statistically speaking [B] might be seen as more positive, but then again, it says the same thing about [A]!

From the below graphic you can see how the benchmark value with 59% positive sentiment can change the whole graph with just 20% variation. So it could be anything from completely positive to quite negative.

Possible scenarios with 20% accuracy

Statistics are wonderful for BS.

When you come from the analytics industry and start developing tools for the analytics industry, some things are clear from the get-go. The fact that customers have been teached to ask for sentiment scoring (thank you very much early snake oil peddlers) doesn’t mean the vendors should invest R&D in it. When the ticker feature became available to websites in 96-97, everyone wanted it. Hmmm, come to think of it now, maybe that’s why we struggled with revenues back in the day, after all we refused tickers.

To be a pioneer in an industry is a tremendous responsibility. It’s like teaching a child. If you teach that red is green and green is blue, then that is what the child will learn. If other adults reinforce this message, it becomes the common reality. If the vendor’s teach the market that sentiment is the way to go, then that is what the customers will expect thus forcing future vendors in to a situation where being competitive means doing sentiment better. In other words, wasting R&D resources in trying to fix something that is broken (sentiment analysis) instead of looking at what the customer is really looking for. Customer is always looking for the same thing, make more money. So this model is quite simple.

Brands exist in order to make money, that is the harsh reality we live in. But it’s also a very workable reality from the vendor perspective. Sentiment is nice to know, but up until today I haven’t heard a single commercial application for it. Commercial application in this case means an application that serves the purpose of earning more money for someone  else than the vendor of the application.

The question remains:

“How does knowing the net sentiment score help me to drive more commerce?”.

Chinese have a saying about how you should stop digging when you find yourself in a hole. Learning new things about your business and yourself works the same.  If you want to learn something new, you have to be ready to trash some of your existing convictions. Otherwise you’ll be just convincing yourself of how “right” you are and have been.

“In the beginner’s mind there are many possibilities, in the expert’s there are few”

- Shunruy Suzuki

According to some of the players in the social media monitoring field, sentiment analysis works and customers are happy with it. One question I haven’t heard though, is how does sentiment drive commerce. Does negative sentiment mean weaker commerce, positive stronger and so on. It’s a new game, so I understand that there is no numbers (yet) but what I don’t understand is how the industry has completely ignored the whole topic. After all, social media monitoring is marketing, and marketing has truly only one purpose at the end of the day.

To Drive Commerce (or not)

Corporations at this level are very focused with share value. So what better way to look at the commercial value of sentiment than comparing it to stock quote over same time.

According to one of the leaders in the space, Sysomos, the three brands with most negative sentiment around them in 2009 were:

  1. McDonalds
  2. Marlboro (Philip Morris)
  3. Toyota

While the three brands with the most positive buzz around them were:

  1. Samsung
  2. Nokia
  3. Intel

Last year was a tough year for everyone, the sample size is ridiculously small and to make things even worse, the companies are from different categories. Samsung is not traded in NYSE so I replaced it with IBM, who by the way is the biggest winner on the list. Another big winner, Intel has been very open about how they play. Or at least Michael Brito has, thanks! On the other hand, it looks like Nokia’s stock could have done better.

STOCK QUOTES FOR YEAR-TO-DATE FROM GOOGLE FINANCE

Screen shot 2009-12-28 at 9.23.48 PM

The official results for this little test:

  1. IBM +55.14%
  2. Intel +38.67%
  3. Toyota +29.75%
  4. Philip Morris +12.39
  5. McDonalds +1.98%
  6. Nokia -18.07%

It’s well worth noting that both IBM and Intel were coming from their 10 year lows at the switch of 2009, so a hefty rise for such juggernauts would not come as surprise.

This whole ‘research’ I have just conducted, is obviously complete BS and should not be taken seriously by anyone. But one thing it does well (I hope), is that it speaks loudly the question we all should be asking:

“What’s sentiment really worth?”

Also I want to put out the challenge to Sysomos, who obviously has this data from 2009:

It would be great to see this same test run category vs. category with a much larger dataset. It took about 5 minutes to get the data from Google Finance, and another 10 minutes to write this ‘report’ on the ‘findings’.

If it comes to that, we’d love to do this together with you and extend our helping hand. After all, since we’re in the insights business, we should do what we reasonably can. Things start to get really interesting, when we can run this test year to year. I bet a lot of this stuff doesn’t show instantly.

Analytics and Design

October 14th, 2009

Big part of design is minimizing and simplifying. Taking something complex (like data) and taking everything extra away from it. Just like if you’re looking at data. Design is everywhere, not just in the product development and advertising. Essentially everything has a design to it. Revenue model is a type of design. Mission statement is a design. Awesome design almost exclusively is something complex that wonderful folks were able to simplify.

Read the rest of this entry »

In the next 10 years internet is going to change dramatically. After yesterday it’s even clearer than before that Adobe has a stronghold on it, regardless of how the web pans out. We’ll see the internet becoming smarter, more social and of course more reliable. Now Adobe has the toolset to drive that change on its own terms. And it’s the only company in the world who really can say that.

Putting together a platform that combines production and management, analytics and optimization is a huge step forward. Imagine the designer who can now see a/b test results in matter of minutes without having to work for it. Or business application developer getting the insights from all previous applications before starting to build a new one. All within the platform they already learned to love. The proposition of having instant feedback for your actions before and after taking them is a powerful one. An unified online business platform will have the chance of delivering that.
Read the rest of this entry »