AGENDA

GRID | LIST VIEW | PDF

Sunday, June 26, 2016
8:30AM - 5:00PM
Pre-Event Training
Monday, June 27, 2016
8:30AM - 5:00PM
Pre-Event Training
3:00PM - 7:00PM
Registration Open
6:00PM - 8:00PM
Meetups
Tuesday, June 28, 2016
Business Tracks:
M
Modern Data Applications
B
Business Adoption
Technical Tracks:
A
Apache Committer Insights
D
Application Development
C
Cloud and Operations
S
Data Science, Analytics and Spark
F
Future of Apache Hadoop
G
Governance and Security
I
IoT and Streaming
P
Sponsor
RoomsBALLROOM ABALLROOM BBALLROOM C210A210C230A230C211212LL21A
7:30AM - 7:30PM
Registration Open
9:00AM - 11:00AM
Keynote
11:00AM - 11:30AM
Coffee Break in the Community Showcase
Apache Hadoop Crash Course
(11:00AM - 1:30PM)
11:30AM - 12:10PM
M
How Macy's Creates Operational Insight on Hadoop.
Seetha Chakrapany, Macy's
G
What the #$* is a Business Catalog and Why You Need It!
Andrew Ahn, Hortonworks
B
From Zero to Data Flow in Hours with Apache NiFi
Chris Herrera, Schlumberger
P
Open Source Ingredients for Interactive Data Analysis in Spark
Maxim Lukiyanov, Microsoft
C
Cloudbreak - Internals Deep Dive
Krisztian Horvath, Hortonworks
Janos Matyas, Hortonworks
M
Big Data for Managers: From Hadoop to Streaming and Beyond
Vladimir Bacvanski, SciSpike
C
Operationalizing YARN Based Hadoop Clusters in the Cloud - Lessons and Opportunities
Abhishek Modi, Qubole Inc.
S
Model Management Demo in the Lambda Architecture
Chris Kang, Accenture
I
7 Enterprise Case Studies of IoT, Streaming Analytics, and Real Business Value
Mark Palmer, TIBCO Software Inc.
12:20PM - 1:00PM
D
Faster, Faster, Faster!: The True Story of a Mobile Analytics Data Mart on Hive
Mithun Radhakrishnan, Yahoo!
Josh Walters, Yahoo!
F
Evolving HDFS to a Generalized Distributed Storage Subsystem
Sanjay Radia, Hortonworks
Jitendra Pandey, Hortonworks
I
Streaming in the Wild with Apache Flink
Kostas Tzoumas, data Artisans
G
Near Real-time Outlier Detection and Interpretation
Robert Thorman, AT&T
Adam Fuchs, Sqrrl Data, Inc.
I
The "Connected Vehicle" (IoT and Streaming) – Supporting the Mission to Become Both an Automotive and Mobility Company
Daniel Totten, Ford Motor Company
Tom Bryans, Ford Motor Company
C
Apache Ambari – Simplified Cluster Operation and Troubleshooting (including demo)
Jayush Luniya, Hortonworks
Alejandro Fernandez, Hortonworks
S
Building a Graph Database in Neo4j with Spark & Spark SQL to Gain New Insights from Log Data
Robert Hryniewicz, Hortonworks
Rachel Poulsen, TiVo
M
Knowledge from Noise: Geospatial Analytics at Progressive Insurance
Brian Durkin, Progressive
G
There is a New Ranger in Town! End-to-End Security and Auditing in a Big-Data-as-a-Service Deployment
Nanda Vijaydev, BlueData
Abhiraj Butala, BlueData
1:00PM - 2:10PM
Lunch in the Community Showcase
Women in Big Data Lunch in Room LL21E
2:10PM - 2:50PM
F
A Multi-Colored YARN: Apps and First-Class Support for Services
Vinod Kumar Vavilapalli, Hortonworks
I
In-Flux Limiting for a Multi-Tenant Logging Service
Ambud Sharma, Symantec Corporation
Suma Cherukuri, Symantec Corporation
B
What is Data? And What Are You Doing?
Russell Foltz-Smith, RFS Productions
P
Increasing Hadoop Resiliency & Performance with EMC Isilon
Boni Bruno, EMC
I
Building a Smarter Home with Nifi and Spark
Joseph Niemiec, Hortonworks
Christopher Gambino, Hortonworks
G
Governed Self Service Analytics at eBay
HU LIANG, eBay
C
Big Data in the Cloud, the Time has Come
Thomas Phelan, BlueData Inc.
Kris Applegate, Dell Inc.
D
Hdfs Analysis for Small Files
Rohit Jangid, Expedia
Raman Goyal, Expedia
I
Preventative Maintenance of Robots in Automotive Industry
Amit Kumar, Cisco
Ari Flink, Cisco
3:00PM - 3:40PM
I
The Industrial Internet: Big Data, Intelligent Machines, and Smarter Workforce
Uday Tennety, GE Digital
A
HDFS: Optimization, Stabilization and Supportability
Arpit Agarwal, Hortonworks
Chris Nauroth, Hortonworks
M
Successes, Challenges and Pitfalls Migrating a SAAS Business to Hadoop
Shaun Klopfenstein, Marketo
Eric Kienle, Marketo
C
On-Demand HDP Clusters Using Cloudbreak and Ambari
Vivek Madani, Symantec Corporation
Karthik Karuppaiya, Symantec Corporation
P
Investigating the Effects of Over Committing YARN Resources
Jason Lowe, Yahoo! Inc.
F
Analysis of Major Trends in Big Data Analytics
Slim Baltagi, Capital One Financial Corp.
P
Solving Performance Problems on Hadoop to Move Data Analytics Workloads into Production
Tyler Mitchell, Actian Corporation
I
End-to-End Processing of 3.7 Million Telemetry Events per Second Using Lambda Architecture
Saurabh Mishra, Hortonworks
Raghavendra Nandagopal, Symantec Corporation
S
Deep Learning Using DL4J and Spark on HDP for Fun and Profit
Adam Gibson, Skymind
Dhruv Kumar, Hortonworks

Apache Spark Crash Course
(3:00PM - 6:00PM)
3:40PM - 4:10PM
Coffee Break in the Community Showcase
4:10PM - 4:50PM
S
Google Cloud Platform Empowers TensorFlow and Machine Learning
Kaz Sato, Google Inc.
A
HDFS Tiered Storage
Virajith Jalaparti, Microsoft
Chris Douglas, Microsoft
I
The Life of an Internet of Things (IoT) Electron; its Journey to Become a Positive Influence for Something Greater
Peter Crossley, Webtrends
S
Application of Active Learning for Fraud Labeling @PayPal
Venkatesh Ramanathan, PayPal Inc
S
Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Jianfeng Zhang, Hortonworks
B
How Hadoop and a Modern Data Platform Can Enable Transformation in Healthcare
Beata Puncevic, Blue Cross Blue Shield of Michigan
P
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: Today's ETL Does it All!
Scott Gnau, Hortonworks
Tendu Yogurtcu, Syncsort
S
H2O: A Platform for Big Math
Arno Candel, h2o.ai
G
Top Three - Big Data Governance Issues and How Apache ATLAS resolves it for the Enterprise
Andrew Ahn, Hortonworks
5:00PM - 5:40PM
A
File Format Benchmark - Avro, JSON, ORC, and Parquet
Owen O`Malley, Hortonworks
P
IoT, Streaming Analytics and Machine Learning: Delivering Real-Time Intelligence With Apache NiFi
Paul Kent, SAS Institute Inc.
Dan Zaratsian, SAS
D
SQL and Solr Search with Spark for Big Data Analytics in Your Browser
Romain Rigaux, Cloudera
I
Introducing Kafka Streams, the New Stream Processing Library of Apache Kafka
Guozhang Wang, Confluent
B
LEGO: Data Driven Growth Hacking Powered by Big Data
Kamal Duggireddy, Salesforce.com
Prashant Gokhale, Salesforce.com
S
Data Preparation and Munging for Data Science: A Field Guide
Casey Stella, Hortonworks
P
How to Build a Successful Data Lake
Alex Gorelik, Waterline Data
A
Bridging the Gap of Relational to Hadoop Using Sqoop @ Expedia
Shashank Tandon, Expedia
Kopal Niranjan, Expedia
M
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Learning DSLs
Murali Kaundinya, Merck
Gopi Janakiraman, Merck
5:50PM - 6:30PM
S
Netflix - Productionizing Spark on YARN for ETL at Petabyte Scale
Ashwin Shankar, Netflix
Nezih Yigitbasi, Netflix
C
Keep your Hadoop Cluster at its Best!
Sheetal Dolas, Hortonworks
Chris Nauroth, Hortonworks
D
Spark SQL versus Apache Drill: Different Tools with Different Rules
Ted Dunning, MapR Technologies
P
Workload Automation + Hadoop? Oh Yeah! …a Match Made in Heaven
Darren Chinen, Malwarebytes
A
Scheduling Policies and Resource Types in YARN
Varun Vasudev, Hortonworks
Wangda Tan, Hortonworks
B
HDP @ MD Anderson - Starting the Hadoop Journey at a Global Leader in Cancer Research
Vamshi Punugoti, MD Anderson Cancer Center
Bryan Lari, MD Anderson Cancer Center
C
Instrument Your Instruments: Data-Driven Ops
Premal Shah, 6sense
I
Processing and Retrieval of Geotagged Unmanned Aerial System Telemetry
Kristopher Kane, Hortonworks
G
Hive Metastore Security of Apache Ranger
Yan Zhou, IBM
Tanping Wang, IBM
6:30PM - 7:30PM
Exhibitor Reception
Wednesday, June 29, 2016
Business Tracks:
M
Modern Data Applications
B
Business Adoption
Technical Tracks:
A
Apache Committer Insights
D
Application Development
C
Cloud and Operations
S
Data Science, Analytics and Spark
F
Future of Apache Hadoop
G
Governance and Security
I
IoT and Streaming
P
Sponsor
RoomsBALLROOM ABALLROOM BBALLROOM C210A210C230A230C211212LL21A
7:30AM - 6:30PM
Registration Open
9:00AM - 11:00AM
Keynote
11:00AM - 11:30AM
Coffee Break in the Community Showcase
Apache Nifi Crash Course
(11:00AM - 1:30PM)
11:30AM - 12:10PM
D
Spark Uber Development Kit
Kelvin Chu, Uber
Gang Wu, Uber
F
To Infinity and Beyond – Datacenter Scale YARN Clusters through Federation
Subru Krishnan, Microsoft
Kishore Chaliparambil, Microsoft
P
Extending Hortonworks with Oracle’s Big Data Platform
Paul Miller, Oracle
A
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with Apache Kafka
Jun Rao, Confluent
P
A New "Sparkitecture" for Modernizing Your Data Warehouse
Ranga Nathan, HPE
I
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
Julian Hyde, Hortonworks
C
Accelerating Data Warehouse Migration to Hadoop
Ajay Anand, Kyvos Insights, Inc.
Vineet Tyagi, Impetus
G
Navigating the World of User Data Management and Data Discovery
Smiti Sharma, EMC
M
Hadoop Application Architectures - Fraud Detection
Nishant Thacker, Microsoft
12:20PM - 1:00PM
M
Machine Learning for Any Size of Data, Any Type of Data
Apoorv Saxena, Google
P
Real-Time Hadoop: Keys for Success from Streams to Queries
Ted Dunning, MapR Technologies
F
Debunking the Myths of HDFS Erasure Coding Performance
Zhe Zhang, LinkedIn
Uma Maheswara Rao Gangumalla, Intel
P
Using Hadoop for Cognitive Analytics
Dr. Pedro Desouza, IBM
I
Performance Comparison of Streaming Big Data Platforms
Reza Farivar, Capital One
Kyle Nusbaum, Yahoo!
A
ACID Transactions in Hive
Eugene Koifman, Hortonworks
D
Quark: Simplify and Optimize SQL Queries Across Hadoop and RDBMS
Rajat Venkatesh, Qubole
G
Tracing Your Security Telemetry With Apache Metron
Justin Leet, Hortonworks
B
Beyond TCO: Architecting Hadoop for Adoption and Data Applications
Reid Levesque, RBC
1:00PM - 2:10PM
Lunch in the Community Showcase
2:10PM - 2:50PM
P
The Next Generation of Data Processing & OSS
James Malone, Google
D
Enterprise-Grade Streaming Under 2ms on Hadoop
Vijay Bhat, Capital One
A
Apache HBase - State of the Union
Enis Soztutar, Hortonworks
G
Apache Kafka Security
Parth Brahmbhatt, Netflix
Sriharsha Chintalapani, Hortonworks
D
Blink−Improved Runtime for Flink and its Application in Alibaba Search
Xiaowei Jiang, Alibaba Inc
Feng Wang, Alibaba Inc
M
Yahoo!'s Next-Generation User Profile Platform
Kai Liu, Yahoo! Inc.
Lu Niu, Yahoo! Inc.
I
Simultaneous Localization and Mapping (SLAM) with Kafka and Spark Streaming
Jay White Bear, IBM
G
Curb Your Insecurity - Tips for a Secure Cluster (with Spark too)
Ancil McBarnett, Hortonworks
Pardeep Kumar, Hortonworks
M
Lambda Architecture: How we Merged Batch and Real-Time
Sewook Wee, Trulia
Sotos Matzanas, Trulia
3:00PM - 3:40PM
F
Apache Hive 2.0 SQL Speed Scale
Alan Gates, Hortonworks
S
Scalable Realtime Analytics using Druid
Slim Bouguerra, HortonWorks
Nishant Bangarwa, Hortonworks
F
Toward Better Multi-Tenancy Support from HDFS
Xiaoyu Yao, Hortonworks
D
Omid: A Transactional Framework for HBase
Francisco Perez-Sorrosal, Yahoo! Inc.
Ohad Shacham, Yahoo Research
P
Building and Managing Large Scale Data Pipelines with Complex Dependencies Using Apache Oozie
Purshotam Shah, Yahoo!
M
Automated Systems for Loan Decisions Using AKKA and Spark
Fredrick Crable, Capital One
S
Future of Apache Hadoop – An Enterprise Architecture View
Oliver Halter, PwC
Ritesh Ramesh, PwC
I
"I'm Being Followed by Drones!" The Impact of IoT on the Future of Unmanned Aerial Systems
Kenneth Kranz, Cognizant
B
Prescient Keeps Travelers Safe with Natural Language Processing and Geospatial Analytics
Mike Bishop, Prescient

Internet of Things Crash Course
(3:00PM - 6:00PM)
3:40PM - 4:10PM
Coffee Break in the Community Showcase
4:10PM - 4:50PM
B
The Architectural Journey to our Modern Data Applications – DSC (Data Supply Chain)
Daniel Totten, Ford Motor Company
Tom Bryans, Ford Motor Company
M
The Ecosystem is Too Damn Big
Andrew Brust, Datameer
C
The Intuit Analytics Cloud 101
Tilmann Bruckhaus, Intuit
G
Fine-Grained Security for Spark and Hive
Carter Shanklin, Hortonworks
Don Bosco Durai, Hortonworks
D
Hive Hbase Metastore - Improving Hive with a Big Data Metadata Storage
Daniel Dai, Hortonworks
Vaibhav Gumashta, Hortonworks
B
Statistical Analysis of Genomic Data with Hadoop
Jay Etchings, Arizona State University
I
Designing and Implementing Your IoT Solution with Open Source
Sunil Patil, mapr
Sridhar Reddy, MapR Technologies
S
Analyzing Telecom Fraud at Hadoop Scale
Sanjay Vyas, Diyotta
I
Building a Data Analytics PaaS for Smart Cities
Smiti Sharma, EMC
Keith Manthey, EMC
5:00PM - 5:40PM
M
Reliable and Scalable Data Ingestion at Airbnb
Krishna Puttaswamy, Airbnb
Jason Zhang, Airbnb
I
Lambda-less Stream Processing @ Scale in LinkedIn
Yi Pan, LinkedIn
Kartik Paramasivam, LinkedIn
B
Self-Service Analytics on Hadoop: Lessons Learned
Drew Leamon, Comcast
I
Make Streaming Analytics Work For You: The Devil is in the Details
Kanishk Mahajan, Hortonworks
Ryan Medlin, Neustar
C
Operating and Supporting Apache HBase - Best Practices and Improvements
Tanvir Kherada, Hortonworks
Enis Soztutar, Hortonworks
G
Apache Eagle - Secure Hadoop in Real Time
Hao Chen, eBay Inc.
Ralph Su, eBay Inc.
S
Real Time Visualizing Machine Learning with Spark
Chester Chen, GoPro
P
Filling the Data Lake
Chuck Yarbrough, Pentaho
Mark Burnette, Pentaho
I
The Stream is the Database - Revolutionizing Healthcare Data Architecture
Will Ochandarena, MapR Technologies
Brad Anderson, Liaison Technologies
5:50PM - 6:30PM
G
Managing a Large Multi-tenant Data Lake
Ray Harrison, Comcast
Michael Fagan, Comcast
C
Hybrid Data Platform – Cloud Environment Connected with On-premise Data Environment
Shankar Radhakrishnan, Impetus
M
High-Scale Entity Resolution in Hadoop
Thomas Schweiger, eBay, Inc
Gurpreet Singh, eBay, Inc
D
Presto, What's New in SQL-on-Hadoop and Beyond
Kamil Bajda-Pawlikowski, Teradata
Martin Traverso, Facebook
I
Scalable Optical Character Recognition with Apache NiFi and Tesseract
Casey Stella, Hortonworks
Michael Miklavcic, Hortonworks
I
From Device to Data Center to Insights: Architectural Considerations for the Internet of Anything
P. Taylor Goetz, Hortonworks
I
Resource Aware Scheduling in Storm
Boyang (Jerry) Peng, Yahoo! Inc.
M
"The Path to Wellness Through Big Data"
Roy Wilds, PHEMI Systems
Mary Caire MD, MARYCAIREMD
B
Disrupting Insurance with Advanced Analytics – The Next Generation Carrier How Motorist Leapfrogged into the Future of Analytics and Data
Alan Byers, Motorists Insurance Group
Sanjeev Kumar, Saama Technologies
6:30PM - 9:30PM
10 Years of Hadoop Cocktail Party
Thursday, June 30, 2016
Business Tracks:
M
Modern Data Applications
B
Business Adoption
Technical Tracks:
A
Apache Committer Insights
D
Application Development
C
Cloud and Operations
S
Data Science, Analytics and Spark
F
Future of Apache Hadoop
G
Governance and Security
I
IoT and Streaming
P
Sponsor
RoomsBALLROOM ABALLROOM BBALLROOM C210A210C230A230C211212LL21A
7:30AM - 4:00PM
Registration Open
9:00AM - 11:00AM
Keynote
11:00AM - 11:30AM
Coffee Break in the Community Showcase
Data Science Crash Course
(11:00AM - 1:30PM)
11:30AM - 12:10PM
M
War on Stealth Cyberattacks that Target Unknown Vulnerabilities
George Vetticaden, Hortonworks
James Sirota, Hortonworks
A
Cross-DC Fault-Tolerant ViewFileSystem at Twitter
Gera Shegalov, Twitter
Ming Ma, Twitter
P
IoT, Big Data, Cloud – the Convergence of Marketing Terms?
Joanna Schloss, Dell Software
G
State of Security in Spark
Vinay Shukla, Hortonworks
F
The Future of Apache Storm
P. Taylor Goetz, Hortonworks
D
Yahoo’s Experience Running Pig on Tez at Scale
Rohini Palaniswamy, Yahoo! Inc.
Jon Eagles, Yahoo! Inc.
F
Meeting Performance Goals in Multi-tenant Hadoop Clusters
Brian Majeska, YP
Shivnath Babu, Duke University and Unravel Data Systems
B
A Data Lake and a Data Lab to Optimize Operations and Safety Within a Nuclear Fleet
Marie-Luce Picard, EDF
A
Hadoop & Cloud Storage: Object Store Integration in Production
Rajesh Balamohan, Hortonworks
Chris Nauroth, Hortonworks
12:20PM - 1:00PM
F
(Big data) Squared: How YARN Timeline Service v.2 Unlocks 360-Degree Platform Insights at Scale
Sangjin Lee, Twitter Inc.
Li Lu, Hortonworks
A
How We Re-Engineered Phoenix with a Cost-Based Optimizer Based on Calcite
Julian Hyde, Hortonworks
Maryann Xue, Intel
S
Distributed Deep Learning on Hadoop Clusters
Andy Feng, Yahoo!
Jun Shi, Yahoo!
A
LLAP: Sub-Second Analytical Queries in Hive
Gopal Vijayaraghavan, Hortonworks
G
HIPAA Compliance in the Public Cloud
Christopher Crosbie, AWS
Jonathan Fritz, Amazon Web Services
S
Big Data Heterogeneous Mixture Learning on Spark
Masato Asahara, NEC
Ryohei Fujimaki, NEC
I
Embeddable Data Transformation for Real-Time Streams
Joey Echeverria, Rocana
B
It’s Time: Launching Your Advanced Analytics Program for Success in a Mature Industry Like Oil and Gas
Kelly Cook, ConocoPhillips
Kelly Kohlleffel, Hortonworks
I
Turning the Stream Processor into a Database: Building Online Applications on Streams
Stephan Ewen, data Artisans
1:00PM - 2:10PM
Lunch in the Community Showcase
2:10PM - 2:50PM
S
Combining Machine Learning Frameworks with Apache Spark
Timothy Hunter, Databricks
C
Hadoop in the Cloud – The What, Why and How from the Experts
Nishant Thacker, Microsoft
M
The Evolution of Big Data Pipelines at Intuit
Rekha Joshi, Intuit
Lokesh Rajaram, Intuit
A
The Columnar Era: Leveraging Parquet and Kudu for High-Performance Analytics
Julien Le Dem, Dremio
Amit Hadke, Dremio
P
Managing Hadoop, HBase, and Storm Clusters at Yahoo Scale
Savitha Ravikrishnan, Yahoo!
Dheeraj Kapur, Yahoo!
P
Integrating Apache Spark and NiFi for Data Lakes
Ron Bodkin, Think Big a Teradata Company
Scott Reisdorf, Think Big a Teradata Company
I
Lego-Like Building Blocks of Storm and Spark-Streaming Pipelines for Rapid IOT and Streaming Analytics App Development
Anand Venugopal, Impetus Technologies
Punit Shah, Impetus Technologies
A
Debugging YARN Cluster in Production
Jian He, Hortonworks
Ram Venkatesh, Hortonworks
I
Internet Of Things: What about Data Storage?
Vladimir Rodionov, Hortonworks
3:00PM - 3:40PM
G
Instilling Confidence and Trust - Big Data Security & Governance
Nick Curcuru, MasterCard
P
Cost and Resource Tracking for Hadoop
Kendall Thrapp, Yahoo!
I
Zero Downtime App Deployment Using Hadoop
Hemananthan Duraiswamy, Hortonworks
Wei Wang, Hortonworks
S
Real-time, Streaming Advanced Analytics, Approximations, and Recommendations using Apache Spark ML/GraphX, Kafka Stanford CoreNLP, and Twitter Algebird
Christopher Fregly, PipelineIO
D
Phoenix + HBase: An Enterprise Grade Data-Warehouse Appliance for Interactive Analytics?
Ankit Singhal, Hortonworks
RajeshBabu Chintaguntla, Hortonworks
B
Customer Journey - Sentiment Analysis for Fashion Retail
Steve Howard, EXPRESS
Eric Thorsen, Hortonworks
A
Ingest and Stream Processing - What Will You Choose?
Anand Iyer, Cloudera
Pat Patterson, StreamSets
M
Big Data Ready Enterprise Framework
Rahul Sarda, Wipro Technologies
Arijit Banerjee, Wipro Technologies
F
Next Gen Big Data Analytics with Apache Apex
Thomas Weise, DataTorrent
Pramod Immaneni, DataTorrent

Apache Spark Crash Course
(3:00PM - 6:00PM)
3:40PM - 4:10PM
Coffee Break
4:10PM - 4:50PM
D
Apache Beam: A Unified Model for Batch and Streaming Data Processing
Davor Bonaci, Google Inc.
S
Building A Scalable Data Science Platform with R
Mario Inchiosa, Microsoft
M
Modernizing Your Company’s Data Ecosystem
Evan Levy, SAS
G
Extend Governance in Hadoop with Atlas Ecosystem
Andrew Ahn, Hortonworks
Mohan Sadashiva, Waterline Data
F
Effective Spark on Multi-Tenant Clusters
Kostas Sakellis, Cloudera
A
GoodFit - An Efficient MRP (Multi-Resource Packing) Allocator for YARN
Arun Suresh, Microsoft
Srikanth Kandula, Microsoft
P
Swimming Across the Data Lake - Lessons Learned and Keys to Success!
Vineet Tyagi, Impetus
D
The DAP: Where Yarn, HBase, Kafka and Spark go to Production
Jonathan Gray, Cask
I
Fighting Fraud in Real Time by Processing 1M+ TPS Using Storm on Slider (YARN)
Nitin Aggarwal, Rocket Fuel inc.
Ishan Chhabra, Rocketfuel Inc.
5:00PM - 7:00PM
BOF:
Apache Spark, Apache Zeppelin & Data Science
BOF:
Cloud & Operations
BOF:
Streaming & Data Flow
BOF:
Security & Governance
BOF:
Apache Hive & Apache Pig
BOF:
Apache Hadoop - YARN BoF
BOF:
Apache Hadoop - HDFS BoF
BOF:
Apache HBase BoF

FILTER:
Full Schedule
Apache Committer Insights
Application Development
Cloud and Operations
Data Science, Analytics and Spark
Future of Apache Hadoop
Governance and Security
IoT and Streaming
Modern Data Applications
Sponsor
Business Adoption


Sunday, June 26, 2016
8:30AM - 5:00PM
Monday, June 27, 2016
8:30AM - 5:00PM
3:00PM - 7:00PM
Registration Open
6:00PM - 8:00PM
Meetups
Tuesday, June 28, 2016
7:30AM - 7:30PM
Registration Open
9:00AM - 11:00AM
11:00AM - 1:30PM
Apache Hadoop Crash Course
11:00AM - 11:30AM
Coffee Break in the Community Showcase
11:30AM - 12:10PM
How Macy's Creates Operational Insight on Hadoop.
Seetha Chakrapany, Macy's
Room: BALLROOM A
What the #$* is a Business Catalog and Why You Need It!
Andrew Ahn, Hortonworks
Room: BALLROOM B
From Zero to Data Flow in Hours with Apache NiFi
Chris Herrera, Schlumberger
Room: BALLROOM C
Open Source Ingredients for Interactive Data Analysis in Spark
Maxim Lukiyanov, Microsoft
Room: 210A
Cloudbreak - Internals Deep Dive
Krisztian Horvath, Hortonworks
Janos Matyas, Hortonworks
Room: 210C
Big Data for Managers: From Hadoop to Streaming and Beyond
Vladimir Bacvanski, SciSpike
Room: 230A
Operationalizing YARN Based Hadoop Clusters in the Cloud - Lessons and Opportunities
Abhishek Modi, Qubole Inc.
Room: 230C
Model Management Demo in the Lambda Architecture
Chris Kang, Accenture
Room: 211
7 Enterprise Case Studies of IoT, Streaming Analytics, and Real Business Value
Mark Palmer, TIBCO Software Inc.
Room: 212
12:20PM - 1:00PM
Faster, Faster, Faster!: The True Story of a Mobile Analytics Data Mart on Hive
Mithun Radhakrishnan, Yahoo!
Josh Walters, Yahoo!
Room: BALLROOM A
Evolving HDFS to a Generalized Distributed Storage Subsystem
Sanjay Radia, Hortonworks
Jitendra Pandey, Hortonworks
Room: BALLROOM B
Streaming in the Wild with Apache Flink
Kostas Tzoumas, data Artisans
Room: BALLROOM C
Near Real-time Outlier Detection and Interpretation
Robert Thorman, AT&T
Adam Fuchs, Sqrrl Data, Inc.
Room: 210A
The "Connected Vehicle" (IoT and Streaming) – Supporting the Mission to Become Both an Automotive and Mobility Company
Daniel Totten, Ford Motor Company
Tom Bryans, Ford Motor Company
Room: 210C
Apache Ambari – Simplified Cluster Operation and Troubleshooting (including demo)
Jayush Luniya, Hortonworks
Alejandro Fernandez, Hortonworks
Room: 230A
Building a Graph Database in Neo4j with Spark & Spark SQL to Gain New Insights from Log Data
Robert Hryniewicz, Hortonworks
Rachel Poulsen, TiVo
Room: 230C
Knowledge from Noise: Geospatial Analytics at Progressive Insurance
Brian Durkin, Progressive
Room: 211
There is a New Ranger in Town! End-to-End Security and Auditing in a Big-Data-as-a-Service Deployment
Nanda Vijaydev, BlueData
Abhiraj Butala, BlueData
Room: 212
1:00PM - 2:10PM
Lunch in the Community Showcase
Women in Big Data Lunch in Room LL21E
2:10PM - 2:50PM
A Multi-Colored YARN: Apps and First-Class Support for Services
Vinod Kumar Vavilapalli, Hortonworks
Room: BALLROOM A
In-Flux Limiting for a Multi-Tenant Logging Service
Ambud Sharma, Symantec Corporation
Suma Cherukuri, Symantec Corporation
Room: BALLROOM B
What is Data? And What Are You Doing?
Russell Foltz-Smith, RFS Productions
Room: BALLROOM C
Increasing Hadoop Resiliency & Performance with EMC Isilon
Boni Bruno, EMC
Room: 210A
Building a Smarter Home with Nifi and Spark
Joseph Niemiec, Hortonworks
Christopher Gambino, Hortonworks
Room: 210C
Governed Self Service Analytics at eBay
HU LIANG, eBay
Room: 230A
Big Data in the Cloud, the Time has Come
Thomas Phelan, BlueData Inc.
Kris Applegate, Dell Inc.
Room: 230C
Hdfs Analysis for Small Files
Rohit Jangid, Expedia
Raman Goyal, Expedia
Room: 211
Preventative Maintenance of Robots in Automotive Industry
Amit Kumar, Cisco
Ari Flink, Cisco
Room: 212
3:00PM - 6:00PM
Apache Spark Crash Course
3:00PM - 3:40PM
The Industrial Internet: Big Data, Intelligent Machines, and Smarter Workforce
Uday Tennety, GE Digital
Room: BALLROOM A
HDFS: Optimization, Stabilization and Supportability
Arpit Agarwal, Hortonworks
Chris Nauroth, Hortonworks
Room: BALLROOM B
Successes, Challenges and Pitfalls Migrating a SAAS Business to Hadoop
Shaun Klopfenstein, Marketo
Eric Kienle, Marketo
Room: BALLROOM C
On-Demand HDP Clusters Using Cloudbreak and Ambari
Vivek Madani, Symantec Corporation
Karthik Karuppaiya, Symantec Corporation
Room: 210A
Investigating the Effects of Over Committing YARN Resources
Jason Lowe, Yahoo! Inc.
Room: 210C
Analysis of Major Trends in Big Data Analytics
Slim Baltagi, Capital One Financial Corp.
Room: 230A
Solving Performance Problems on Hadoop to Move Data Analytics Workloads into Production
Tyler Mitchell, Actian Corporation
Room: 230C
End-to-End Processing of 3.7 Million Telemetry Events per Second Using Lambda Architecture
Saurabh Mishra, Hortonworks
Raghavendra Nandagopal, Symantec Corporation
Room: 211
Deep Learning Using DL4J and Spark on HDP for Fun and Profit
Adam Gibson, Skymind
Dhruv Kumar, Hortonworks
Room: 212
3:40PM - 4:10PM
Coffee Break in the Community Showcase
4:10PM - 4:50PM
Google Cloud Platform Empowers TensorFlow and Machine Learning
Kaz Sato, Google Inc.
Room: BALLROOM A
HDFS Tiered Storage
Virajith Jalaparti, Microsoft
Chris Douglas, Microsoft
Room: BALLROOM B
The Life of an Internet of Things (IoT) Electron; its Journey to Become a Positive Influence for Something Greater
Peter Crossley, Webtrends
Room: BALLROOM C
Application of Active Learning for Fraud Labeling @PayPal
Venkatesh Ramanathan, PayPal Inc
Room: 210A
Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Jianfeng Zhang, Hortonworks
Room: 210C
How Hadoop and a Modern Data Platform Can Enable Transformation in Healthcare
Beata Puncevic, Blue Cross Blue Shield of Michigan
Room: 230A
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: Today's ETL Does it All!
Scott Gnau, Hortonworks
Tendu Yogurtcu, Syncsort
Room: 230C
H2O: A Platform for Big Math
Arno Candel, h2o.ai
Room: 211
Top Three - Big Data Governance Issues and How Apache ATLAS resolves it for the Enterprise
Andrew Ahn, Hortonworks
Room: 212
5:00PM - 5:40PM
File Format Benchmark - Avro, JSON, ORC, and Parquet
Owen O`Malley, Hortonworks
Room: BALLROOM A
IoT, Streaming Analytics and Machine Learning: Delivering Real-Time Intelligence With Apache NiFi
Paul Kent, SAS Institute Inc.
Dan Zaratsian, SAS
Room: BALLROOM B
SQL and Solr Search with Spark for Big Data Analytics in Your Browser
Romain Rigaux, Cloudera
Room: BALLROOM C
Introducing Kafka Streams, the New Stream Processing Library of Apache Kafka
Guozhang Wang, Confluent
Room: 210A
LEGO: Data Driven Growth Hacking Powered by Big Data
Kamal Duggireddy, Salesforce.com
Prashant Gokhale, Salesforce.com
Room: 210C
Data Preparation and Munging for Data Science: A Field Guide
Casey Stella, Hortonworks
Room: 230A
How to Build a Successful Data Lake
Alex Gorelik, Waterline Data
Room: 230C
Bridging the Gap of Relational to Hadoop Using Sqoop @ Expedia
Shashank Tandon, Expedia
Kopal Niranjan, Expedia
Room: 211
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Learning DSLs
Murali Kaundinya, Merck
Gopi Janakiraman, Merck
Room: 212
5:50PM - 6:30PM
Netflix - Productionizing Spark on YARN for ETL at Petabyte Scale
Ashwin Shankar, Netflix
Nezih Yigitbasi, Netflix
Room: BALLROOM A
Keep your Hadoop Cluster at its Best!
Sheetal Dolas, Hortonworks
Chris Nauroth, Hortonworks
Room: BALLROOM B
Spark SQL versus Apache Drill: Different Tools with Different Rules
Ted Dunning, MapR Technologies
Room: BALLROOM C
Workload Automation + Hadoop? Oh Yeah! …a Match Made in Heaven
Darren Chinen, Malwarebytes
Room: 210A
Scheduling Policies and Resource Types in YARN
Varun Vasudev, Hortonworks
Wangda Tan, Hortonworks
Room: 210C
HDP @ MD Anderson - Starting the Hadoop Journey at a Global Leader in Cancer Research
Vamshi Punugoti, MD Anderson Cancer Center
Bryan Lari, MD Anderson Cancer Center
Room: 230A
Instrument Your Instruments: Data-Driven Ops
Premal Shah, 6sense
Room: 230C
Processing and Retrieval of Geotagged Unmanned Aerial System Telemetry
Kristopher Kane, Hortonworks
Room: 211
Hive Metastore Security of Apache Ranger
Yan Zhou, IBM
Tanping Wang, IBM
Room: 212
6:30PM - 7:30PM
Exhibitor Reception
Wednesday, June 29, 2016
7:30AM - 6:30PM
Registration Open
9:00AM - 11:00AM
1:00AM - 1:30PM
Apache Nifi Crash Course
11:00AM - 11:30AM
Coffee Break in the Community Showcase
11:30AM - 12:10PM
Spark Uber Development Kit
Kelvin Chu, Uber
Gang Wu, Uber
Room: BALLROOM A
To Infinity and Beyond – Datacenter Scale YARN Clusters through Federation
Subru Krishnan, Microsoft
Kishore Chaliparambil, Microsoft
Room: BALLROOM B
Extending Hortonworks with Oracle’s Big Data Platform
Paul Miller, Oracle
Room: BALLROOM C
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with Apache Kafka
Jun Rao, Confluent
Room: 210A
A New "Sparkitecture" for Modernizing Your Data Warehouse
Ranga Nathan, HPE
Room: 210C
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
Julian Hyde, Hortonworks
Room: 230A
Accelerating Data Warehouse Migration to Hadoop
Ajay Anand, Kyvos Insights, Inc.
Vineet Tyagi, Impetus
Room: 230C
Navigating the World of User Data Management and Data Discovery
Smiti Sharma, EMC
Room: 211
Hadoop Application Architectures - Fraud Detection
Nishant Thacker, Microsoft
Room: 212
12:20PM - 1:00PM
Machine Learning for Any Size of Data, Any Type of Data
Apoorv Saxena, Google
Room: BALLROOM A
Real-Time Hadoop: Keys for Success from Streams to Queries
Ted Dunning, MapR Technologies
Room: BALLROOM B
Debunking the Myths of HDFS Erasure Coding Performance
Zhe Zhang, LinkedIn
Uma Maheswara Rao Gangumalla, Intel
Room: BALLROOM C
Using Hadoop for Cognitive Analytics
Dr. Pedro Desouza, IBM
Room: 210A
Performance Comparison of Streaming Big Data Platforms
Reza Farivar, Capital One
Kyle Nusbaum, Yahoo!
Room: 210C
ACID Transactions in Hive
Eugene Koifman, Hortonworks
Room: 230A
Quark: Simplify and Optimize SQL Queries Across Hadoop and RDBMS
Rajat Venkatesh, Qubole
Room: 230C
Tracing Your Security Telemetry With Apache Metron
Justin Leet, Hortonworks
Room: 211
Beyond TCO: Architecting Hadoop for Adoption and Data Applications
Reid Levesque, RBC
Room: 212
1:00PM - 2:10PM
Lunch in the Community Showcase
2:10PM - 2:50PM
The Next Generation of Data Processing & OSS
James Malone, Google
Room: BALLROOM A
Enterprise-Grade Streaming Under 2ms on Hadoop
Vijay Bhat, Capital One
Room: BALLROOM B
Apache HBase - State of the Union
Enis Soztutar, Hortonworks
Room: BALLROOM C
Apache Kafka Security
Parth Brahmbhatt, Netflix
Sriharsha Chintalapani, Hortonworks
Room: 210A
Blink−Improved Runtime for Flink and its Application in Alibaba Search
Xiaowei Jiang, Alibaba Inc
Feng Wang, Alibaba Inc
Room: 210C
Yahoo!'s Next-Generation User Profile Platform
Kai Liu, Yahoo! Inc.
Lu Niu, Yahoo! Inc.
Room: 230A
Simultaneous Localization and Mapping (SLAM) with Kafka and Spark Streaming
Jay White Bear, IBM
Room: 230C
Curb Your Insecurity - Tips for a Secure Cluster (with Spark too)
Ancil McBarnett, Hortonworks
Pardeep Kumar, Hortonworks
Room: 211
Lambda Architecture: How we Merged Batch and Real-Time
Sewook Wee, Trulia
Sotos Matzanas, Trulia
Room: 212
3:00PM - 6:00PM
Internet of Things Crash Course
3:00PM - 3:40PM
Apache Hive 2.0 SQL Speed Scale
Alan Gates, Hortonworks
Room: BALLROOM A
Scalable Realtime Analytics using Druid
Slim Bouguerra, HortonWorks
Nishant Bangarwa, Hortonworks
Room: BALLROOM B
Toward Better Multi-Tenancy Support from HDFS
Xiaoyu Yao, Hortonworks
Room: BALLROOM C
Omid: A Transactional Framework for HBase
Francisco Perez-Sorrosal, Yahoo! Inc.
Ohad Shacham, Yahoo Research
Room: 210A
Building and Managing Large Scale Data Pipelines with Complex Dependencies Using Apache Oozie
Purshotam Shah, Yahoo!
Room: 210C
Automated Systems for Loan Decisions Using AKKA and Spark
Fredrick Crable, Capital One
Room: 230A
Future of Apache Hadoop – An Enterprise Architecture View
Oliver Halter, PwC
Ritesh Ramesh, PwC
Room: 230C
"I'm Being Followed by Drones!" The Impact of IoT on the Future of Unmanned Aerial Systems
Kenneth Kranz, Cognizant
Room: 211
Prescient Keeps Travelers Safe with Natural Language Processing and Geospatial Analytics
Mike Bishop, Prescient
Room: 212
3:40PM - 4:10PM
Coffee Break in the Community Showcase
4:10PM - 4:50PM
The Architectural Journey to our Modern Data Applications – DSC (Data Supply Chain)
Daniel Totten, Ford Motor Company
Tom Bryans, Ford Motor Company
Room: BALLROOM A
The Ecosystem is Too Damn Big
Andrew Brust, Datameer
Room: BALLROOM B
The Intuit Analytics Cloud 101
Tilmann Bruckhaus, Intuit
Room: BALLROOM C
Fine-Grained Security for Spark and Hive
Carter Shanklin, Hortonworks
Don Bosco Durai, Hortonworks
Room: 210A
Hive Hbase Metastore - Improving Hive with a Big Data Metadata Storage
Daniel Dai, Hortonworks
Vaibhav Gumashta, Hortonworks
Room: 210C
Statistical Analysis of Genomic Data with Hadoop
Jay Etchings, Arizona State University
Room: 230A
Designing and Implementing Your IoT Solution with Open Source
Sunil Patil, mapr
Sridhar Reddy, MapR Technologies
Room: 230C
Analyzing Telecom Fraud at Hadoop Scale
Sanjay Vyas, Diyotta
Room: 211
Building a Data Analytics PaaS for Smart Cities
Smiti Sharma, EMC
Keith Manthey, EMC
Room: 212
5:00PM - 5:40PM
Reliable and Scalable Data Ingestion at Airbnb
Krishna Puttaswamy, Airbnb
Jason Zhang, Airbnb
Room: BALLROOM A
Lambda-less Stream Processing @ Scale in LinkedIn
Yi Pan, LinkedIn
Kartik Paramasivam, LinkedIn
Room: BALLROOM B
Self-Service Analytics on Hadoop: Lessons Learned
Drew Leamon, Comcast
Room: BALLROOM C
Make Streaming Analytics Work For You: The Devil is in the Details
Kanishk Mahajan, Hortonworks
Ryan Medlin, Neustar
Room: 210A
Operating and Supporting Apache HBase - Best Practices and Improvements
Tanvir Kherada, Hortonworks
Enis Soztutar, Hortonworks
Room: 210C
Apache Eagle - Secure Hadoop in Real Time
Hao Chen, eBay Inc.
Ralph Su, eBay Inc.
Room: 230A
Real Time Visualizing Machine Learning with Spark
Chester Chen, GoPro
Room: 230C
Filling the Data Lake
Chuck Yarbrough, Pentaho
Mark Burnette, Pentaho
Room: 211
The Stream is the Database - Revolutionizing Healthcare Data Architecture
Will Ochandarena, MapR Technologies
Brad Anderson, Liaison Technologies
Room: 212
5:50PM - 6:30PM
Managing a Large Multi-tenant Data Lake
Ray Harrison, Comcast
Michael Fagan, Comcast
Room: BALLROOM A
Hybrid Data Platform – Cloud Environment Connected with On-premise Data Environment
Shankar Radhakrishnan, Impetus
Room: BALLROOM B
High-Scale Entity Resolution in Hadoop
Thomas Schweiger, eBay, Inc
Gurpreet Singh, eBay, Inc
Room: BALLROOM C
Presto, What's New in SQL-on-Hadoop and Beyond
Kamil Bajda-Pawlikowski, Teradata
Martin Traverso, Facebook
Room: 210A
Scalable Optical Character Recognition with Apache NiFi and Tesseract
Casey Stella, Hortonworks
Michael Miklavcic, Hortonworks
Room: 210C
From Device to Data Center to Insights: Architectural Considerations for the Internet of Anything
P. Taylor Goetz, Hortonworks
Room: 230A
Resource Aware Scheduling in Storm
Boyang (Jerry) Peng, Yahoo! Inc.
Room: 230C
"The Path to Wellness Through Big Data"
Roy Wilds, PHEMI Systems
Mary Caire MD, MARYCAIREMD
Room: 211
Disrupting Insurance with Advanced Analytics – The Next Generation Carrier How Motorist Leapfrogged into the Future of Analytics and Data
Alan Byers, Motorists Insurance Group
Sanjeev Kumar, Saama Technologies
Room: 212
6:30PM - 9:30PM
10 Years of Hadoop Cocktail Party
Thursday, June 30, 2016
7:30AM - 4:00PM
Registration Open
9:00AM - 11:00AM
1:00AM - 1:30PM
Data Science Crash Course
11:00AM - 11:30AM
Coffee Break in the Community Showcase
11:30AM - 12:10PM
War on Stealth Cyberattacks that Target Unknown Vulnerabilities
George Vetticaden, Hortonworks
James Sirota, Hortonworks
Room: BALLROOM A
Cross-DC Fault-Tolerant ViewFileSystem at Twitter
Gera Shegalov, Twitter
Ming Ma, Twitter
Room: BALLROOM B
IoT, Big Data, Cloud – the Convergence of Marketing Terms?
Joanna Schloss, Dell Software
Room: BALLROOM C
State of Security in Spark
Vinay Shukla, Hortonworks
Room: 210A
The Future of Apache Storm
P. Taylor Goetz, Hortonworks
Room: 210C
Yahoo’s Experience Running Pig on Tez at Scale
Rohini Palaniswamy, Yahoo! Inc.
Jon Eagles, Yahoo! Inc.
Room: 230A
Meeting Performance Goals in Multi-tenant Hadoop Clusters
Brian Majeska, YP
Shivnath Babu, Duke University and Unravel Data Systems
Room: 230C
A Data Lake and a Data Lab to Optimize Operations and Safety Within a Nuclear Fleet
Marie-Luce Picard, EDF
Room: 211
Hadoop & Cloud Storage: Object Store Integration in Production
Rajesh Balamohan, Hortonworks
Chris Nauroth, Hortonworks
Room: 212
12:20PM - 1:00PM
(Big data) Squared: How YARN Timeline Service v.2 Unlocks 360-Degree Platform Insights at Scale
Sangjin Lee, Twitter Inc.
Li Lu, Hortonworks
Room: BALLROOM A
How We Re-Engineered Phoenix with a Cost-Based Optimizer Based on Calcite
Julian Hyde, Hortonworks
Maryann Xue, Intel
Room: BALLROOM B
Distributed Deep Learning on Hadoop Clusters
Andy Feng, Yahoo!
Jun Shi, Yahoo!
Room: BALLROOM C
LLAP: Sub-Second Analytical Queries in Hive
Gopal Vijayaraghavan, Hortonworks
Room: 210A
HIPAA Compliance in the Public Cloud
Christopher Crosbie, AWS
Jonathan Fritz, Amazon Web Services
Room: 210C
Big Data Heterogeneous Mixture Learning on Spark
Masato Asahara, NEC
Ryohei Fujimaki, NEC
Room: 230A
Embeddable Data Transformation for Real-Time Streams
Joey Echeverria, Rocana
Room: 230C
It’s Time: Launching Your Advanced Analytics Program for Success in a Mature Industry Like Oil and Gas
Kelly Cook, ConocoPhillips
Kelly Kohlleffel, Hortonworks
Room: 211
Turning the Stream Processor into a Database: Building Online Applications on Streams
Stephan Ewen, data Artisans
Room: 212
1:00PM - 2:10PM
Lunch in the Community Showcase
2:10PM - 2:50PM
Combining Machine Learning Frameworks with Apache Spark
Timothy Hunter, Databricks
Room: BALLROOM A
Hadoop in the Cloud – The What, Why and How from the Experts
Nishant Thacker, Microsoft
Room: BALLROOM B
The Evolution of Big Data Pipelines at Intuit
Rekha Joshi, Intuit
Lokesh Rajaram, Intuit
Room: BALLROOM C
The Columnar Era: Leveraging Parquet and Kudu for High-Performance Analytics
Julien Le Dem, Dremio
Amit Hadke, Dremio
Room: 210A
Managing Hadoop, HBase, and Storm Clusters at Yahoo Scale
Savitha Ravikrishnan, Yahoo!
Dheeraj Kapur, Yahoo!
Room: 210C
Integrating Apache Spark and NiFi for Data Lakes
Ron Bodkin, Think Big a Teradata Company
Scott Reisdorf, Think Big a Teradata Company
Room: 230A
Lego-Like Building Blocks of Storm and Spark-Streaming Pipelines for Rapid IOT and Streaming Analytics App Development
Anand Venugopal, Impetus Technologies
Punit Shah, Impetus Technologies
Room: 230C
Debugging YARN Cluster in Production
Jian He, Hortonworks
Ram Venkatesh, Hortonworks
Room: 211
Internet Of Things: What about Data Storage?
Vladimir Rodionov, Hortonworks
Room: 212
3:00PM - 6:00PM
Apache Spark Crash Course
3:00PM - 3:40PM
Instilling Confidence and Trust - Big Data Security & Governance
Nick Curcuru, MasterCard
Room: BALLROOM A
Cost and Resource Tracking for Hadoop
Kendall Thrapp, Yahoo!
Room: BALLROOM B
Zero Downtime App Deployment Using Hadoop
Hemananthan Duraiswamy, Hortonworks
Wei Wang, Hortonworks
Room: BALLROOM C
Real-time, Streaming Advanced Analytics, Approximations, and Recommendations using Apache Spark ML/GraphX, Kafka Stanford CoreNLP, and Twitter Algebird
Christopher Fregly, PipelineIO
Room: 210A
Phoenix + HBase: An Enterprise Grade Data-Warehouse Appliance for Interactive Analytics?
Ankit Singhal, Hortonworks
RajeshBabu Chintaguntla, Hortonworks
Room: 210C
Customer Journey - Sentiment Analysis for Fashion Retail
Steve Howard, EXPRESS
Eric Thorsen, Hortonworks
Room: 230A
Ingest and Stream Processing - What Will You Choose?
Anand Iyer, Cloudera
Pat Patterson, StreamSets
Room: 230C
Big Data Ready Enterprise Framework
Rahul Sarda, Wipro Technologies
Arijit Banerjee, Wipro Technologies
Room: 211
Next Gen Big Data Analytics with Apache Apex
Thomas Weise, DataTorrent
Pramod Immaneni, DataTorrent
Room: 212
3:40PM - 4:10PM
Coffee Break
4:10PM - 4:50PM
Apache Beam: A Unified Model for Batch and Streaming Data Processing
Davor Bonaci, Google Inc.
Room: BALLROOM A
Building A Scalable Data Science Platform with R
Mario Inchiosa, Microsoft
Room: BALLROOM B
Modernizing Your Company’s Data Ecosystem
Evan Levy, SAS
Room: BALLROOM C
Extend Governance in Hadoop with Atlas Ecosystem
Andrew Ahn, Hortonworks
Mohan Sadashiva, Waterline Data
Room: 210A
Effective Spark on Multi-Tenant Clusters
Kostas Sakellis, Cloudera
Room: 210C
GoodFit - An Efficient MRP (Multi-Resource Packing) Allocator for YARN
Arun Suresh, Microsoft
Srikanth Kandula, Microsoft
Room: 230A
Swimming Across the Data Lake - Lessons Learned and Keys to Success!
Vineet Tyagi, Impetus
Room: 230C
The DAP: Where Yarn, HBase, Kafka and Spark go to Production
Jonathan Gray, Cask
Room: 211
Fighting Fraud in Real Time by Processing 1M+ TPS Using Storm on Slider (YARN)
Nitin Aggarwal, Rocket Fuel inc.
Ishan Chhabra, Rocketfuel Inc.
Room: 212

Agenda is subject to change.