Welcome!

Machine Learning Authors: Progress Blog, Pat Romanski, Liz McMillan, Elizabeth White, Kevin Jackson

Related Topics: @BigDataExpo, Cognitive Computing , Machine Learning

@BigDataExpo: Article

Patent Data Quality | @CloudExpo #BigData #Analytics #AI #MachineLearning

Is clean data a pipe dream?

The United States Patent and Trademark Office (USPTO) recently announced an expansion of PatentsView, its visualization tool for US patents. First launched a few years ago, the intent behind the tool was to make 40 years of patent filing data available for free to those interested in examining "the dynamics of inventor patenting activity over time." In spite of being limited to patents (not applications) and with a focus only on the US, it offers some interesting visualizations around locations and citations.

In a blog post last month, USPTO director Michelle Lee said the PatentView tool is based on "the highest-quality patent data available," connecting 40 years' worth of information about inventors, their organizations, and their locations in unprecedented ways. The newly revamped interface presents three user-friendly starting points - relationship, locations, and comparison visualizations - which allow for deeper exploration and detailed views. However, through no fault of their own, the USPTO dataset is rife with spelling errors, doesn't reflect patent reassignments, and doesn't resolve company subsidiaries or acquisitions.

This issue is not unique to the USPTO. Other PTO offices around the world face similar barriers to presenting "clean" data. The first issue, spelling errors, merely reflects the fact that assignee information (among other fields like inventor names) is manually entered and hence prone to error and inconsistency. For example, "International Business Machines" has been spelled 1,200 different ways as a patent assignee over the last two decades in the USPTO data set.

In addition, PTO data doesn't get corrected or updated based on later corrections or patent reassignments. For example, patent US8176440 was originally - and incorrectly - assigned to Silicon Labs. My company, Innography, filed a certificate of correction to update the assignment, yet the USPTO data and PatentsView still don't reflect this. In fact, Innography research shows that nearly 20 percent of US patents are reassigned in their lifetimes, translating into a significant number of company portfolio errors based on this factor alone.

Finally, PTO data also doesn't reflect when companies purchase each other, when there's a spinoff, or when a subsidiary files patents. Microsoft, for example, now owns all LinkedIn's patents, even if the reassignments haven't been processed.

As a result, PTO data falls far short of reflecting reality, where patents and companies are bought and sold every day, and where data-entry errors exist and are corrected. The accuracy of the data is very low when it comes to representing company patent portfolios in the real world.

The Cost of Free Data
The USPTO aims to increase the transparency of patenting and invention processes. But if the quality of data and search results is questionable, what good is it to IP practitioners?

There is rich information available through the patenting process, including economic research, prior-art searching, and discovery of broader trends around filing patterns. However, it was never intended to be used as-is to inform strategic business decisions such as in and out licensing, merger and acquisition activities, or portfolio pruning and maintenance decisions.

It makes sense for PTOs to offer their data for free as a way to engage the community's interest in patenting processes. However, too many lightweight patent analytics tools use this flawed data verbatim to tout their "data quality" to IP professionals.

Many patent analyses start with a company's patent portfolio, such as competitive benchmarking, acquisition analysis, and negotiation preparation. In addition, just about every board-level question about patents requires accurate patent ownership information: "Are we ahead of or behind this competitor?" "What companies should we be worried about in this technology area?"

Poor data quality makes it difficult, if not impossible, to answer those questions accurately. To create the most accurate data set possible, companies must use other sources of information to crosscheck and improve patent data accuracy.

Innography data scientists process more than 2,000 company acquisitions annually, and our user base suggests another 5,000 updates each year. As a result, Innography has created more than 10 million data-correction rules over the last decade, which are continuously updated via machine learning and crowdsourcing.

Company leaders must be able to use patent reports to assess market opportunities and make strategic business decisions. This requires an IP analytics solution that reflects real-world changes, and doesn't rely on poor data quality from outdated PTO assignee information.

More Stories By Tyron Stading

Tyron Stading is president and founder of Innography, and chief data officer for CPA Global. He has been named one of the “World’s Leading IP Strategists" by IAM, and one of National Law Journal's "50 Intellectual Property Trailblazers & Pioneers". Before Innography, Tyron was an IBM worldwide industry solutions manager in the telecommunications and utilities sector, and worked at several start-ups focused on mobile communications and networks security. He has published multiple research papers and filed more than three dozen patents. Tyron has a BS in Computer Science from Stanford University and an MS in Technology Commercialization from The University of Texas.

@CloudExpo Stories
The “Digital Era” is forcing us to engage with new methods to build, operate and maintain applications. This transformation also implies an evolution to more and more intelligent applications to better engage with the customers, while creating significant market differentiators. In both cases, the cloud has become a key enabler to embrace this digital revolution. So, moving to the cloud is no longer the question; the new questions are HOW and WHEN. To make this equation even more complex, most ...
Why Federal cloud? What is in Federal Clouds and integrations? This session will identify the process and the FedRAMP initiative. But is it sufficient? What is the remedy for keeping abreast of cutting-edge technology? In his session at 21st Cloud Expo, Rasananda Behera will examine the proposed solutions: Private or public or hybrid cloud Responsible governing bodies How can we accomplish?
SYS-CON Events announced today that Fusic will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Fusic Co. provides mocks as virtual IoT devices. You can customize mocks, and get any amount of data at any time in your test. For more information, visit https://fusic.co.jp/english/.
In his session at @ThingsExpo, Greg Gorman is the Director, IoT Developer Ecosystem, Watson IoT, will provide a short tutorial on Node-RED, a Node.js-based programming tool for wiring together hardware devices, APIs and online services in new and interesting ways. It provides a browser-based editor that makes it easy to wire together flows using a wide range of nodes in the palette that can be deployed to its runtime in a single-click. There is a large library of contributed nodes that help so...
SYS-CON Events announced today that Enroute Lab will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Enroute Lab is an industrial design, research and development company of unmanned robotic vehicle system. For more information, please visit http://elab.co.jp/.
SYS-CON Events announced today that B2Cloud will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. B2Cloud specializes in IoT devices for preventive and predictive maintenance in any kind of equipment retrieving data like Energy consumption, working time, temperature, humidity, pressure, etc.
SYS-CON Events announced today that N3N will exhibit at SYS-CON's @ThingsExpo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. N3N’s solutions increase the effectiveness of operations and control centers, increase the value of IoT investments, and facilitate real-time operational decision making. N3N enables operations teams with a four dimensional digital “big board” that consolidates real-time live video feeds alongside IoT sensor data a...
SYS-CON Events announced today that NetApp has been named “Bronze Sponsor” of SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. NetApp is the data authority for hybrid cloud. NetApp provides a full range of hybrid cloud data services that simplify management of applications and data across cloud and on-premises environments to accelerate digital transformation. Together with their partners, NetApp em...
Agile has finally jumped the technology shark, expanding outside the software world. Enterprises are now increasingly adopting Agile practices across their organizations in order to successfully navigate the disruptive waters that threaten to drown them. In our quest for establishing change as a core competency in our organizations, this business-centric notion of Agile is an essential component of Agile Digital Transformation. In the years since the publication of the Agile Manifesto, the conn...
SYS-CON Events announced today that SIGMA Corporation will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. uLaser flow inspection device from the Japanese top share to Global Standard! Then, make the best use of data to flip to next page. For more information, visit http://www.sigma-k.co.jp/en/.
SYS-CON Events announced today that Keisoku Research Consultant Co. will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Keisoku Research Consultant, Co. offers research and consulting in a wide range of civil engineering-related fields from information construction to preservation of cultural properties. For more information, vi...
WebRTC is great technology to build your own communication tools. It will be even more exciting experience it with advanced devices, such as a 360 Camera, 360 microphone, and a depth sensor camera. In his session at @ThingsExpo, Masashi Ganeko, a manager at INFOCOM Corporation, will introduce two experimental projects from his team and what they learned from them. "Shotoku Tamago" uses the robot audition software HARK to track speakers in 360 video of a remote party. "Virtual Teleport" uses a mu...
Internet of @ThingsExpo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devic...
Mobile device usage has increased exponentially during the past several years, as consumers rely on handhelds for everything from news and weather to banking and purchases. What can we expect in the next few years? The way in which we interact with our devices will fundamentally change, as businesses leverage Artificial Intelligence. We already see this taking shape as businesses leverage AI for cost savings and customer responsiveness. This trend will continue, as AI is used for more sophistica...
"NetApp's vision is how we help organizations manage data - delivering the right data in the right place, in the right time, to the people who need it, and doing it agnostic to what the platform is," explained Josh Atwell, Developer Advocate for NetApp, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
SYS-CON Events announced today that SourceForge has been named “Media Sponsor” of SYS-CON's 21st International Cloud Expo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. SourceForge is the largest, most trusted destination for Open Source Software development, collaboration, discovery and download on the web serving over 32 million viewers, 150 million downloads and over 460,000 active development projects each and every month.
What You Need to Know You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technolog...
SYS-CON Events announced today that DXWorldExpo has been named “Global Sponsor” of SYS-CON's 21st International Cloud Expo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Digital Transformation is the key issue driving the global enterprise IT business. Digital Transformation is most prominent among Global 2000 enterprises and government institutions.
One of the biggest challenges with adopting a DevOps mentality is: new applications are easily adapted to cloud-native, microservice-based, or containerized architectures - they can be built for them - but old applications need complex refactoring. On the other hand, these new technologies can require relearning or adapting new, oftentimes more complex, methodologies and tools to be ready for production. In his general session at @DevOpsSummit at 20th Cloud Expo, Chris Brown, Solutions Marketi...
Most of the time there is a lot of work involved to move to the cloud, and most of that isn't really related to AWS or Azure or Google Cloud. Before we talk about public cloud vendors and DevOps tools, there are usually several technical and non-technical challenges that are connected to it and that every company needs to solve to move to the cloud. In his session at 21st Cloud Expo, Stefano Bellasio, CEO and founder of Cloud Academy Inc., will discuss what the tools, disciplines, and cultural...