Welcome!

AJAX & REA Authors: Gary Kaiser, Michael Bushong, Shelly Palmer, Elizabeth White, Roger Strukhoff

Related Topics: Web 2.0, XML, Open Source, Search, AJAX & REA, Security

Web 2.0: Blog Post

Using Taxonomy to Drive Online Contextual Advertising with Sophializer

Classifying Web Content to the IAB Taxonomy

It’s a Big Market …
…  online advertising.  There are 10,000 stories and data points about it.  Here are two to give some context to the journey below.  First, global online ad spending is projected by ZenithOptimedia to exceed print ad spend by 2015 (note 1).  This 2015 projected spend figure for online advertising is $132.4 billion.  Second, global online ad revenue is projected by another research agency, Digital TV Research, to hit $143 billion by 2017 (note 2).

These are prodigious amounts of money for companies to spend to connect with customers.  But … surely it’s easy to connect online customers to web content featuring, or suggesting, products? And surely, online is “better”?  Where can, and do, taxonomy-based approaches add value to this dance of moving (emotional and semantic) parts between the intentful consumer poised to shop and the intentful marketer with honed content?

Online Ad Targeting is Easy … so 'They' Say …
Really?  So what might be “easy”?  And, indeed, “better”?  Let’s unbundle these simulacra that look like very fuzzy concepts, and as ontologists and knowledge engineers let’s think our way forward with the concept of “precision”.

So … online is more precise than billboards by freeways?  Lightly stated, online has advantages.  What about magazine print ads vs. online?  Online has potential advantages. But … and this is a very big but … in both these cases (and all others) online depends on connecting potential customers to products, their features, their benefits, their attributes and so on precisely, and with precision that is repeatable and extensible.  Rather than random (random is the most expensive way to advertise and has fallen out of favor).  And, since online copy and online ads are words (including in videos) and are semantically classifiable, and since classifications can be organized into models (taxonomies and ontologies) … then there are advantages to be created through the combination of semantic analysis, categorization and taxonomy.

Now, let’s connect taxonomy, classification, semantics and optimizing online ad targeting.  There are a host of holy grails currently being sought in the web/mobile/social uber-ecosystem.  Some are well found, though not perfect, and are unlikely to traverse through a paradigmatic improvement.  Think ‘search’.  Others are most definitely not found (yet).  Given the size of the market outlined in the first paragraph, the rewards are huge to those with the tools and skillsets that know how to work with semantics, taxonomy/ontology, classification of content to taxonomy, and design of taxonomies to drive online targeting.

New Approaches to Classifying to the IAB Taxonomy with Sophializer
Sophia Search
is a recent entrant into this space.  (I have written about them before here.   Sophia Search’s tool – currently called the ‘Sophializer’ – categorizes any URL to nodes in the Internet Advertising Bureau (IAB) taxonomy.  Sophializer can also classify content of ads (and so create a semantic/conceptual ‘signature’ for each). The IAB Contextual Taxonomy comprises three levels:

  • Tier 1 – 23 nodes
  • Tier 2 – 371 nodes
  • Tier 3 – unspecified and vendor specific

Given that Sophializer categorizes both sides of this content dance – web page and ad – web properties can serve ads to any page automatically using the IAB taxonomy as the cross-mapping conceptual foundation.

Sophializer not only classifies to Tier 1 and Tier 2 it also discovers/generates robust classifications that can be used to customize Tier 3 for individual customers.

Benefits of Using Taxonomy for Ad Targeting
Taxonomy gives a framework to this kind of semantic work.  Essentially, we are cross-mapping both partners of this content dance – content and ad - using the IAB taxonomy  as a “choreographer” of sorts.  Other taxonomies could be used.  In fact, multiple taxonomies could be used – and this would be particularly powerful if these taxonomies were cross-mapped to each other.  For example, if you have content (web page, say, or ad) categorized and mapped to Taxonomy A and Taxonomy A is cross-mapped to the IAB taxonomy … then … you can propagate these ads to content that is already categorized.

Benefits of Using Categorization Tools to Assign Marketing Content to Taxonomy Nodes
There are a number of different methods of assigning content to nodes in any taxonomy –

  • Manually
  • Training sets of documents (training documents are most often manually selected as exemplars)
  • Categorization algorithms that work with semantic tokens

There is more than enough to say on each of these around methods, workflows, best practices and pitfalls for a blog post on each.  But not here.

Sophializer utilizes patented and proprietary algorithms in the core of their categorization engine.  Two fundamental points are worth, briefly, focusing on.  Firstly, different categorization engines use different patented technologies.  “Quality” from different categorizers is (very) variable.  Which is why it is important to carry out “Proofs of Concept” when evaluating this technology.

Secondly, the more semantically rich the taxonomy – e.g. fully enriched with synonyms and other types of evidence terms – the better “quality” one gets with any method of associating content to taxonomy nodes.   Both of these parameters are make-or-break (literally) in using semantics to target online ads.

Learn More 2.0
The Google Display Network is IAB Certified and complies with the top 2 tiers of the IAB Contextual Taxonomy.  You can read details of what Google do here and this also navigates you to the Google mapping to the IAB taxonomy Tier 1 and Tier 2.

Sophia Search currently has a number of engagements on the web that are live.  For example, targeting ads for non-fiction books (from a major publishing house) to news stories (on a pre-eminent news site).  You can contact them for details.

This is not an empty space.  Other companies are also searching for the holy grail of taxonomy-based content targeting mediated by content categorization that works.  See, for example, see ADmantX (http://blog.admantx.com/post/15726823528/a-new-iab-based-taxonomy-and-an...).

This whole space is an excellent example of where the application of the nexus of taxonomy, categorization and semantics will provide stratospheric business benefit.  Grails are waiting to be found here.

Notes
Note 1.  See ZenithOprimedia

The detailed ZenithOptimedia figures can be found here

Note 2.  See Hollywood Reporter

You can download the Digital TV Research press release about these figures here

Cloud Expo Breaking News
SYS-CON Events announced today that Cloudian, Inc., the leading provider of hybrid cloud storage solutions, has been named “Bronze Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Cloudian is a Foster City, Calif.-based software company specializing in cloud storage. Cloudian HyperStore® is an S3-compatible cloud object storage platform that enables service providers and enterprises to build reliable, affordable and scalable hybrid cloud storage solutions. Cloudian actively partners with leading cloud computing environments including Amazon Web Services, Citrix Cloud Platform, Apache CloudStack, OpenStack and the vast ecosystem of S3 compatible tools and applications. Cloudian's customers include Vodafone, Nextel, NTT, Nifty, and LunaCloud. The company has additional offices in China and Japan.
After a couple of false starts, cloud-based desktop solutions are picking up steam, driven by trends such as BYOD and pervasive high-speed connectivity. In his session at 15th Cloud Expo, Seth Bostock, CEO of IndependenceIT, cuts through the hype and the acronyms, and discusses the emergence of full-featured cloud workspaces that do for the desktop what cloud infrastructure did for the server. He’ll discuss VDI vs DaaS, implementation strategies and evaluation criteria.
SYS-CON Events announced today that Esri has been named “Bronze Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Esri inspires and enables people to positively impact the future through a deeper, geographic understanding of the changing world around them. For more information, visit http://www.esri.com.
Cloud computing started a technology revolution; now DevOps is driving that revolution forward. By enabling new approaches to service delivery, cloud and DevOps together are delivering even greater speed, agility, and efficiency. No wonder leading innovators are adopting DevOps and cloud together! In his session at DevOps Summit, Andi Mann, Vice President of Strategic Solutions at CA Technologies, will explore the synergies in these two approaches, with practical tips, techniques, research data, war stories, case studies, and recommendations.
Cloud Computing is evolving into a Big Three of Amazon Web Services, Google Cloud, and Microsoft Azure. Cloud 360: Multi-Cloud Bootcamp, being held Nov 4–5, 2014, in conjunction with 15th Cloud Expo in Santa Clara, CA, delivers a real-world demonstration of how to deploy and configure a scalable and available web application on all three platforms. The Cloud 360 Bootcamp, led by Janakiram MSV, an analyst with Gigaom Research, is the first bootcamp that introduces the core concepts of Infrastructure as a Service (IaaS) based on the workings of the Big Three platforms – Amazon EC2, Google Compute Engine, and Azure VMs. Bootcamp attendees will get to see the big picture and also receive the knowledge needed to make the best cloud decisions for their business applications and entire enterprise IT organization.
“Distrix fits into the overall cloud and IoT model around software-defined networking. There’s a broad category around software-defined networking that’s focused on data center, and we focus on the WAN,” explained Jay Friedman, President of Distrix, in this SYS-CON.tv interview at the Internet of @ThingsExpo, held June 10-12, 2014, at the Javits Center in New York City. Internet of @ThingsExpo 2014 Silicon Valley, November 4–6, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading IoT industry players in the world.
The Internet of Things promises to transform businesses (and lives), but navigating the business and technical path to success can be difficult to understand. In his session at 15th Internet of @ThingsExpo, Chad Jones, Vice President, Product Strategy of LogMeIn's Xively IoT Platform, will show you how to approach creating broadly successful connected customer solutions using real world business transformation studies including New England BioLabs and more.
Scott Jenson leads a project called The Physical Web within the Chrome team at Google. Project members are working to take the scalability and openness of the web and use it to talk to the exponentially exploding range of smart devices. Nearly every company today working on the IoT comes up with the same basic solution: use my server and you'll be fine. But if we really believe there will be trillions of these devices, that just can't scale. We need a system that is open a scalable and by using the URL as a basic building block, we open this up and get the same resilience that the web enjoys.
“The Internet of Things is a wave that has arrived and it’s growing really fast. The concern at Aria Systems is making sure that people understand the ramifications of their attempts to monetize whatever it is they build on the Internet of Things," explained C Brendan O’Brien, Co-founder and Chief Architect at Aria Systems, in this SYS-CON.tv interview at the Internet of @ThingsExpo, held June 10-12, 2014, at the Javits Center in New York City. Internet of @ThingsExpo 2014 Silicon Valley, November 4–6, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading IoT industry players in the world.
The Internet of Things is a natural complement to the cloud and related technologies such as Big Data, analytics, and mobility. In his session at Internet of @ThingsExpo, Joe Weinman will lay out four generic strategies – digital disciplines – to exploit emerging digital technologies for strategic advantage. Joe Weinman has held executive leadership positions at Bell Labs, AT&T, Hewlett-Packard, and Telx, in areas such as corporate strategy, business development, product management, operations, and R&D.
SYS-CON Events announced today that DevOps.com has been named “Media Sponsor” of SYS-CON's “DevOps Summit at Cloud Expo,” which will take place on June 10–12, 2014, at the Javits Center in New York City, New York. DevOps.com is where the world meets DevOps. It is the largest collection of original content relating to DevOps on the web today Featuring up-to-the-minute news, feature stories, blogs, bylined articles and more, DevOps.com is where the thought leaders of the DevOps movement make their ideas known.
There are 182 billion emails sent every day, generating a lot of data about how recipients and ISPs respond. Many marketers take a more-is-better approach to stats, preferring to have the ability to slice and dice their email lists based numerous arbitrary stats. However, fundamentally what really matters is whether or not sending an email to a particular recipient will generate value. Data Scientists can design high-level insights such as engagement prediction models and content clusters that allow marketers to cut through the noise and design their campaigns around strong, predictive signals, rather than arbitrary statistics. SendGrid sends up to half a billion emails a day for customers such as Pinterest and GitHub. All this email adds up to more text than produced in the entire twitterverse. We track events like clicks, opens and deliveries to help improve deliverability for our customers – adding up to over 50 billion useful events every month. While SendGrid data covers only abo...
SYS-CON Events announced today that the Web Host Industry Review has been named “Media Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Since 2000, The Web Host Industry Review has made a name for itself as the foremost authority of the Web hosting industry providing reliable, insightful and comprehensive news, reviews and resources to the hosting community. TheWHIR Blogs provides a community of expert industry perspectives. The Web Host Industry Review Magazine also offers a business-minded, issue-driven perspective of interest to executives and decision-makers. WHIR TV offers on demand web hosting video interviews and web hosting video features of the key persons and events of the web hosting industry. WHIR Events brings together like-minded hosting industry professionals and decision-makers in local communities. TheWHIR is an iNET Interactive property.
SYS-CON Events announced today that O'Reilly Media has been named “Media Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. O'Reilly Media spreads the knowledge of innovators through its books, online services, magazines, and conferences. Since 1978, O'Reilly Media has been a chronicler and catalyst of cutting-edge development, homing in on the technology trends that really matter and spurring their adoption by amplifying "faint signals" from the alpha geeks who are creating the future. An active participant in the technology community, the company has a long history of advocacy, meme-making, and evangelism.
SYS-CON Events announced today that Verizon has been named “Gold Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Verizon Enterprise Solutions creates global connections that generate growth, drive business innovation and move society forward. With industry-specific solutions and a full range of global wholesale offerings provided over the company's secure mobility, cloud, strategic networking and advanced communications platforms, Verizon Enterprise Solutions helps open new opportunities around the world for innovation, investment and business transformation. Visit verizonenterprise.com to learn more.