Sessions

Hadoop Summit 2013 will feature over 90 sessions and 7 tracks dedicated to enabling the next generation data platform. Industry experts, business leaders, architects, data scientists and Hadoop developers will share use cases and success stories, best practices, cautionary tales and technology insights.

A big thank you goes out to the Hadoop Summit Track Chairs and Selection Committees who worked hard to select sessions they felt will bring the most value to attendees. We are finalizing the schedule now and in the process of notifying the selected speakers. We expect to have a full schedule posted in the coming days. Meanwhile, below are a sample of some of the sessions you can expect to see at Hadoop Summit.

Track: Hadoop Driven Business / Business Intelligence

Phil Shelley &
Wuheng Luo
Sears Holdings The 3 T’s – Using Hadoop to modernize with faster access to data and value
Tim Hsu &
Neal Lee
Yahoo! Big data, Easy BI
Ming Ma eBay Hadoop and HBase @eBay
Paul Haefele Deep Value Inc Using Hadoop to do Simulation for High Frequency Trading
Rahul Bhartia &
Alexei Vassiliev
PayPal, Inc EAP – Accelerating behavorial analytics at PayPal using hadoop
Paul Codding &
Ravi Mutyala
 Hortonworks Enabling R on Hadoop
Jay Tang PayPal Hadoop Graph Processing with Apache Giraph
Matthew Rathbone Foursquare Labs Building and Improving Products with Hadoop
Stephen Scaffidi TripAdvisor Simplifying Use of Hive with the Hive Query Tool
Egor Gryaznov NICE Systems Business Rules on Hadoop
Oleg Zhurakousky &
Tom McCuch
Hortonworks High Speed Continuous & Reliable Data Ingest into Hadoop

Track: Hadoop (Disruptive) Economics

Chris Cantrell MapR How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million.
Tony Shan Global Big Data/Cloud
Consulting Firm
Big Data Transformation Method and Practice
Sewook Wee Accenture Technology
Labs
Where to deploy Hadoop: Bare metal or Cloud?
Matt Johnson &
Carmen Hall
Clearwire Wrangling Customer Usage Data with Hadoop
Kevin Coogan AmalgaMood Hadoop — Enabling Expanded Financial Market Analysis Techniques while Improving Investment Performance
Phil Shelley &
Sunil Kakade
Sears Holdings Corp.
(SHC) and MetaScale
Move to Hadoop, Go Faster and Save Millions – Mainframe Legacy Modernization
Oleg Zhurakousky
& Tom McCuch
Hortonworks Go Beyond ‘Debug’: Wire Tap your App for Knowledge with Hadoop
Dhaval Shah Bloomberg L. P. Recommender System at scale using HBase and Hadoop

Track: Future of Apache Hadoop

Phil Shelley &
Wuheng Luo
Sears Holdings
Corp. and MetaScale
Big Data 2.0: Hadoop as part of a Near-Real-Time Integrated Data Era
Shivnath Babu Duke University Demystifying Systems for Interactive and Real-time Analytics
Sanjay Radia Hortonworks Inc HDFS – What is New and Future
James Taylor Salesforce.com Phoenix: How (and why) we put the SQL back into the NoSQL
Steve Loughran Hortonworks Inc Hello OpenStack, meet Hadoop
Julien Le Dem &
Nong Li
Twitter, Inc &
Cloudera
Parquet: Columnar storage for the People
Shohei Hido Preferred Infrastructure Jubatus: real-time and highly-scalable machine learning platform
Andrew Feng Yahoo! Storm-on-YARN: Convergence of Low-Latency and Big-Data

Track: Enterprise Data Architecture

Ben Werther Platfora Realizing the Enterprise Data Reservoir
George Vetticaden
& George Trujillo
Hortonworks A Reference Architecture for ETL 2.0
Carl Steinbach Citus Data SQL on Hadoop: Defining the New Generation of Analytic Databases
Giang Nguyen &
Murtaza Doctor
RichRelevance How we solved Real-time User Segmentation using HBase
Craig Soules &
Garth Goodson
Natero Enabling data management in a Big Data world
Mohamed Elmallah Children’s Hospital
of Los Angeles
Using Hadoop for Vital signs and EMR data in Healthcare Research and Patient Care
Srikanth Sundarrajan
& Venkatesh Seetharam
InMobi Technology
Services Pte, Ltd &
Hortonworks
Apache Falcon – Data Management Platform on Hadoop (Beyond ETL)
Hien Luu LinkedIn LinkedIn Member Segmentation Platform: A Big Data Application
Zubin Dowlaty Mu Sigma Next Generation Analytics: A Reference Architecture
Aaron T. Myers &
Alejandro Abdelnur
Cloudera Securing the Hadoop Ecosystem

Track: Deployment and Operations

Dan Romike Twitter A cluster is only as strong as its weakest link
Ari Flink Cisco Large scale near real-time log indexing with Flume and SolrCloud
Wisely Chen &
Neal Lee
Yahoo! Continuous Integration for the Applications on top of Hadoop.
Joep Rottinghuis
& Jay Shenoy
Twitter Hadoop Hardware @Twitter: Size does matter!
Sanjay Radia &
Suresh Srinivas
Hortonworks Top Ten things to get the most out of your Hadoop cluster
Yusaku Sako &
Jeff Sposetti
Hortonworks Managing your Hadoop clusters with Apache Ambari
Benoy Antony &
Jos Backus
eBay Secure Hadoop @eBay
Joe Crobak Foursquare Lessons learned with Hadoop in the cloud and migrating to the datacenter
Jonathan Hsieh &
Kevin O’Dell
Cloudera Trends in Supporting Production Apache HBase Clusters
Govind Kamat &
Sumeet Singh
Yahoo!, Inc. Compression Options in Hadoop – A Tale of Tradeoffs
Robert Evans Yahoo!, Inc. Running YARN at scale

Track: Applications and Data Science

Chris Poulin &
Alex Kozlov
Patterns and Predictions
& Cloudera
Durkheim Project: Social Media Risk & Bayesian Counters
Jeff Magnusson &
Charles Smith
Netflix Watching Pigs Fly with the Netflix Hadoop Toolkit
Gary Helmling &
Joep Rottinghuis
Twitter A Birds-Eye View of Pig and Scalding Jobs with hRaven
Vaclav Petricek eHarmony Inc Hadoop in Love
Paco Nathan Concurrent, Inc. Pattern – an open source project for migrating predictive models from SAS, etc., onto Hadoop
Ofer Mendelevitch Hortonworks Data Science with Hadoop – A Primer
Eli Reisman Hortonworks Fast, Scalable Graph Processing: Apache Giraph on YARN
Russell Jurney Hortonworks Agile Data: Building Hadoop Analytics Applications
Casey Stella Hortonworks Mahout and Scalable Natural Language Processing
Christopher Severs eBay Should I use Scalding or Scoobi or Scrunch?