Presentation Overview:
Nearest neighbor models are conceptually just about the simplest kind of model possible. The problem is that they generally arenât feasible to apply. Or at least, they werenât feasible until the advent of Big Data techniques. This talk will describe some of the techniques used in the knn project to reduce thousand-year computations to a few hours. ...
[read more]
Presentation Overview:
Nearest neighbor models are conceptually just about the simplest kind of model possible. The problem is that they generally arenât feasible to apply. Or at least, they werenât feasible until the advent of Big Data techniques. This talk will describe some of the techniques used in the knn project to reduce thousand-year computations to a few hours. The knn project uses the Mahout math library and Hadoop to speed up these enormous computations to the point that they can be usefully applied to real problems. These same techniques can also be used to do real-time model scoring.
Presenters:
Ted Dunning | MapR | Chief Application Architect
Ted has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.
Chao Yuan | American Express | SVP, Decision Science and Targeting for US Consumer Services Risk
Chao has been a significant contributor to the Risk and Information Management team since he joined American Express in 1996. From his roles leading the global fraud modeling team to his more recent role as the head of modeling and decision science for the US Consumer business, Chao has been a major driver of large modeling transformations, which include fraud, credit and marketing models. His expertise also serves well in his role as the chairman of the Global Modeling Strategy Committee â