Welcome!

AJAX & REA Authors: Piram Manickam, Subrahmanya SV, S Sangeetha, Bob Gourley, RealWire News Distribution

Related Topics: Web 2.0, XML, Open Source, Search, AJAX & REA, Security

Web 2.0: Blog Post

Using Taxonomy to Drive Online Contextual Advertising with Sophializer

Classifying Web Content to the IAB Taxonomy

It’s a Big Market …
…  online advertising.  There are 10,000 stories and data points about it.  Here are two to give some context to the journey below.  First, global online ad spending is projected by ZenithOptimedia to exceed print ad spend by 2015 (note 1).  This 2015 projected spend figure for online advertising is $132.4 billion.  Second, global online ad revenue is projected by another research agency, Digital TV Research, to hit $143 billion by 2017 (note 2).

These are prodigious amounts of money for companies to spend to connect with customers.  But … surely it’s easy to connect online customers to web content featuring, or suggesting, products? And surely, online is “better”?  Where can, and do, taxonomy-based approaches add value to this dance of moving (emotional and semantic) parts between the intentful consumer poised to shop and the intentful marketer with honed content?

Online Ad Targeting is Easy … so 'They' Say …
Really?  So what might be “easy”?  And, indeed, “better”?  Let’s unbundle these simulacra that look like very fuzzy concepts, and as ontologists and knowledge engineers let’s think our way forward with the concept of “precision”.

So … online is more precise than billboards by freeways?  Lightly stated, online has advantages.  What about magazine print ads vs. online?  Online has potential advantages. But … and this is a very big but … in both these cases (and all others) online depends on connecting potential customers to products, their features, their benefits, their attributes and so on precisely, and with precision that is repeatable and extensible.  Rather than random (random is the most expensive way to advertise and has fallen out of favor).  And, since online copy and online ads are words (including in videos) and are semantically classifiable, and since classifications can be organized into models (taxonomies and ontologies) … then there are advantages to be created through the combination of semantic analysis, categorization and taxonomy.

Now, let’s connect taxonomy, classification, semantics and optimizing online ad targeting.  There are a host of holy grails currently being sought in the web/mobile/social uber-ecosystem.  Some are well found, though not perfect, and are unlikely to traverse through a paradigmatic improvement.  Think ‘search’.  Others are most definitely not found (yet).  Given the size of the market outlined in the first paragraph, the rewards are huge to those with the tools and skillsets that know how to work with semantics, taxonomy/ontology, classification of content to taxonomy, and design of taxonomies to drive online targeting.

New Approaches to Classifying to the IAB Taxonomy with Sophializer
Sophia Search
is a recent entrant into this space.  (I have written about them before here.   Sophia Search’s tool – currently called the ‘Sophializer’ – categorizes any URL to nodes in the Internet Advertising Bureau (IAB) taxonomy.  Sophializer can also classify content of ads (and so create a semantic/conceptual ‘signature’ for each). The IAB Contextual Taxonomy comprises three levels:

  • Tier 1 – 23 nodes
  • Tier 2 – 371 nodes
  • Tier 3 – unspecified and vendor specific

Given that Sophializer categorizes both sides of this content dance – web page and ad – web properties can serve ads to any page automatically using the IAB taxonomy as the cross-mapping conceptual foundation.

Sophializer not only classifies to Tier 1 and Tier 2 it also discovers/generates robust classifications that can be used to customize Tier 3 for individual customers.

Benefits of Using Taxonomy for Ad Targeting
Taxonomy gives a framework to this kind of semantic work.  Essentially, we are cross-mapping both partners of this content dance – content and ad - using the IAB taxonomy  as a “choreographer” of sorts.  Other taxonomies could be used.  In fact, multiple taxonomies could be used – and this would be particularly powerful if these taxonomies were cross-mapped to each other.  For example, if you have content (web page, say, or ad) categorized and mapped to Taxonomy A and Taxonomy A is cross-mapped to the IAB taxonomy … then … you can propagate these ads to content that is already categorized.

Benefits of Using Categorization Tools to Assign Marketing Content to Taxonomy Nodes
There are a number of different methods of assigning content to nodes in any taxonomy –

  • Manually
  • Training sets of documents (training documents are most often manually selected as exemplars)
  • Categorization algorithms that work with semantic tokens

There is more than enough to say on each of these around methods, workflows, best practices and pitfalls for a blog post on each.  But not here.

Sophializer utilizes patented and proprietary algorithms in the core of their categorization engine.  Two fundamental points are worth, briefly, focusing on.  Firstly, different categorization engines use different patented technologies.  “Quality” from different categorizers is (very) variable.  Which is why it is important to carry out “Proofs of Concept” when evaluating this technology.

Secondly, the more semantically rich the taxonomy – e.g. fully enriched with synonyms and other types of evidence terms – the better “quality” one gets with any method of associating content to taxonomy nodes.   Both of these parameters are make-or-break (literally) in using semantics to target online ads.

Learn More 2.0
The Google Display Network is IAB Certified and complies with the top 2 tiers of the IAB Contextual Taxonomy.  You can read details of what Google do here and this also navigates you to the Google mapping to the IAB taxonomy Tier 1 and Tier 2.

Sophia Search currently has a number of engagements on the web that are live.  For example, targeting ads for non-fiction books (from a major publishing house) to news stories (on a pre-eminent news site).  You can contact them for details.

This is not an empty space.  Other companies are also searching for the holy grail of taxonomy-based content targeting mediated by content categorization that works.  See, for example, see ADmantX (http://blog.admantx.com/post/15726823528/a-new-iab-based-taxonomy-and-an...).

This whole space is an excellent example of where the application of the nexus of taxonomy, categorization and semantics will provide stratospheric business benefit.  Grails are waiting to be found here.

Notes
Note 1.  See ZenithOprimedia

The detailed ZenithOptimedia figures can be found here

Note 2.  See Hollywood Reporter

You can download the Digital TV Research press release about these figures here

Cloud Expo Breaking News
At pennies per virtual machine-hour, the economics of cloud computing are both compelling and daunting to replicate. Whether you are building your own cloud infrastructure, building a public cloud or choosing a cloud service, there are key strategy and technology decisions that make the difference between success and failure. This session will share industry best practices for deploying cloud infrastructure that maximize the benefits of cloud economics, agility and interoperability. Learn how...
Need to scale your data tier? The foundation of every application is the database layer, and today application architects have more choices than ever. With these choices come new questions: Which database technology is best for your application? How can your application take advantage of Big Data technology? Can you run your relational database at Big Data scale? What does it take to implement a comprehensive data infrastructure, including your core database, incorporating SQL, No SQL and Big Da...
Cloud enables SMBs to access new, scalable resources – previously only available to enterprises – in flexible and cost-effective ways. McKinsey’s SMB Cloud Report projects the public cloud market to reach $40-$50 billion by 2015, with SMBs comprising 65% of public cloud spending in 2015. But selling cloud to SMBs raises the questions of who, what and how. In this session Manjula Talreja, VP of Cisco’s Global Cloud Business Development Team, will discuss the importance of knowing who SMB...
The economics of business are radically changing due to the way in which software and services are being delivered thanks to cloud computing. In his session at 12th Cloud Expo | Cloud Expo New York [10-13 June, 2013], Mike Kavis will cover six reasons for the disruption.
Our more interconnected planet is accelerating the adoption and convergence of next-generation architectures, in the form of cloud, mobile and instrumented physical assets. Organizations that can effectively balance optimization and innovation, will be in a position to leverage new systems of engagement, out maneuver their peers and achieve desired outcomes. In the Opening Keynote at 12th Cloud Expo | Cloud Expo New York, IBM GM & Next Generation Platform CTO Dr Danny Sabbah will detail the crit...
The massive computing and storage resources that are needed to support big data applications make cloud environments an ideal fit. In Nati Shalom's upcoming session at 12th Cloud Expo | Cloud Expo New York [June 10-13, 2013], you'll learn how to build your big data "database on-demand" using MongoDB, Cassandra, Solr, MySQL, or any other big data solution, as well as manage your big data application using a new open source framework called “Cloudify.” All this, on top of the OpenStack cloud.
It’s now possible to create isolated networks in the cloud using OpenStack Networking. Cloud Networks can help enhance network security, increase application agility and improve scalability and availability of your servers.
SYS-CON Events announced today that MetraTech Corp., the leading provider of agreements-based billing™, commerce and compensation solutions, has been named “Bronze Sponsor” of SYS-CON's 12th International Cloud Expo, which will take place on June 10–13, 2013, at the Javits Center in New York City, New York. MetraTech Corp. is the leading provider of commerce, billing and compensation solutions enabling customers to monetize relationships with customers, partners, and suppliers. Its unique Agree...
“Trust is an ongoing journey and sits at the foundation of any vendor relationship – the companies that don’t consistently earn trust won’t be around long,” noted Henrik Rosendahl, Senior VP of Cloud Solutions at Quantum, in this exclusive Q&A with Cloud Expo Conference Chair Jeremy Geelan. “As they do more with cloud, trust will organically grow – maybe it’s just about meeting SLAs or seeing firsthand that data is there when you need it,” Rosendahl continued. Cloud Computing Journal: The move ...
Cloud computing is more than a buzz-phrase it’s a transformative IT paradigm shift. The emphasis in the cloud is on elasticity, scalability, agility and open. Not just open standards but open APIs and open source. The delivery of software is also going through a paradigm shift. Open source software was often a commoditization of a market leader; Unix to Linux or Oracle to MySQL what’s changing is that the iterative nature, user context and the motto of releasing early and often are driving real ...