SPEAKERS

Andrew Ahn

Hortonworks
Andrew Ahn is an Apache Atlas committer and currently works for Hortonworks as the product manager for Apache Falcon and Apache Atlas. Prior work includes Product and Governance duties for the NYSE Euronext, spanning 12 countries and 23 market centers.

Session(s):
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditional EDW and Hadoop
April 13, 2016, 14:10 - 14:50
Wicklow Hall 2B

Why Big Data Management Requires Hierarchical Taxonomies
April 14, 2016, 12:20 - 13:00
Wicklow Hall 2B
Andrew Ahn

Hortonworks
Andrew Ahn is an Apache Atlas committer and currently works for Hortonworks as the Governance product manager supporting Apache Atlas. Prior work includes Product and Governance duties for the NYSE Euronext, spanning 12 countries and 23 electronic market centers.

Session(s):
Apache Atlas: Tracking Dataset Lineage Across Hadoop Components
April 13, 2016, 16:10 - 16:50
Wicklow Hall 2A
Matthew Aslett

451 Research
As Research Director 451 Research Matt has overall responsibility for the data platforms and analytics research coverage, which includes operational and analytic databases, Hadoop, grid/cache, stream processing, search-based data platforms, data integration, data quality, data management, analytics, and advanced analytics. Matt's own primary area of focus includes data management, reporting and analytics, and exploring how the various data platforms and analytics technology sectors are converging in the form of next-generation data platforms.

Session(s):
Hadoop and Other Animals
April 14, 2016, 15:00 - 15:40
Liffey A
Marton Balassi

Hungarian Academy of Sciences
Márton Balassi is a PMC member at Apache Flink and researcher at the Hungarian Academy of Sciences. He has worked for data Artisans in Berlin. His main expertise and interest is real-time distributed data processing frameworks. His current work includes research and development on mapping the models and guarantees of different streaming systems. Márton has been a speaker at ApacheCon, Hadoop Summit and numerous Big Data related meetups recently.

Session(s):
Large-Scale Stream Processing in the Hadoop Ecosystem
April 13, 2016, 16:10 - 16:50
Liffey Hall 1

Advanced Execution Visualization of Spark Jobs
April 13, 2016, 14:10 - 14:50
Liffey Hall 1
Slim Baltagi

Capital One Financial Corporation
Slim Baltagi (@SlimBaltagi) is currently director of Big Data engineering at Capital One. He has more than 18 years of IT and business experience and has spent the last five years of his life hadooping and more recently sparking and flinking! He has worked on more than 12 Big Data projects as a solution architect. He enjoys evangelizing about Big Data technologies and maintaining a Big Data Knowledge Base: Hadoop, Spark, Flink, ...With some fellow squirrels, Slim also runs the Chicago Apache Flink Meetup, the Washington DC Area Apache Flink Meetup, the Boston Flink Meetup and the Paris Flink Meetup.

Session(s):
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
April 13, 2016, 17:50 - 18:30
Liffey Hall 2
Reuben Banga

Centrica
Reuben is a Big Data Infrastructure & Operations Manager within Centrica Plc. He joined the company in 2014 having spent 3 years providing Consultancy to the KCC ICS Programme and previously 10 years at Ocado Group advising from its start up phase to their IPO. With over 20 years industry experience Reuben has a 1st Class Honours Mathematics Degree from Nottingham Trent University.

Session(s):
Rocking the World of Big Data at Centrica
April 13, 2016, 15:00 - 15:40
Liffey B
Hellmar Becker

ING
Hellmar has worked in a number of positions in big data analytics and digital analytics. Currently working at ING Bank, implementing Datalake Foundation project (based on Hadoop) within Client Information management. Long standing experience in advanced analytics and data management. Speaker engagements at Hadoop Summit Brussels 2015 (https://www.youtube.com/watch?v=AhT-nxoEkbg), at Apache Big Data Europe 2015 (http://events.linuxfoundation.org/sites/events/files/slides/Securing%20Hadoop%20in%20an%20enterprise%20context.pdf) and at various industry events in Germany, including Online Value Forum 2009 (https://www.youtube.com/watch?v=JLGxLjfsXlQ).

Session(s):
Securing Hadoop in an Enterprise Context
April 14, 2016, 11:30 - 12:10
Wicklow Hall 2B
Dr. Thomas Beer

Continental Automotive
Thomas is the Big Data lead architect at Continental's eHorizon project. Before joining Continental he was the lead architect for Big Data solutions at NTT Data. He holds a PhD in Computer Science. Juergen is one of the specialists for Digital Transformation within Continental Automotive IT department. His main focus is on Big Data solutions and architectures to support new business models. He holds a university degree in Computer Science.

Session(s):
Hadoop and Friends as Key Enabler of the IoE – Continental’s Dynamic eHorizon
April 13, 2016, 16:10 - 16:50
Wicklow Hall 2B
Kamélia Benchekroun

Renault
Big Data Experience @ Bouygues Telecom and Renault

Session(s):
The Hadoop Deployment Strategy at Renault Group
April 13, 2016, 17:50 - 18:30
Liffey A
Bryan Bende

Hortonworks
Bryan Bende is a Member of the Technical Staff at Hortonworks where he develops dataflow capabilities around the core framework of Apache NiFi, and has over ten years of experience developing enterprise software solutions. Bryan received a B.S. in Computer Science from the University of Maryland at College Park, and a M.S. in Computer Science from John Hopkins University.

Session(s):
Apache NiFi in the Hadoop Ecosystem
April 13, 2016, 12:20 - 13:00
Wicklow Hall 2A
Narendra Bidari

Symantec
Biography : Narendra Bidari is a Sr. Software Engineer working for the Cloud Platform Engineering team at Symantec. As part of the Cloud Platform Engineering group at Symantec, he is helping product teams in re-architecting/re-engineering the legacy systems to move to cloud. Narendra has worked on Storm/ Kafka Data Pipeline’s to Collect the data from different end points into a single cluster to enable Product teams to analyze data. The current platform ingests at the rate of 70 Billion events a day through Kafka and Storm and eventually write to Hive.

Session(s):
On-Demand HDP Clusters using Cloudbreak and Ambari
April 13, 2016, 15:00 - 15:40
Liffey A
Michael Bironneau

Open Energi
Michael is a data scientist at Open Energi. He is interested in distributed systems, web development and analysing datasets at scale using any means necessary.

Session(s):
Powering a Virtual Power Station with Big Data
April 13, 2016, 14:10 - 14:50
Wicklow Hall 1
Ron Bodkin

ThinkBig, a Teradata company
Ron Bodkin – Founder and President, Think Big, a Teradata company. Ron founded Think Big to help companies realize measurable value from Big Data. Think Big is the first and leading provider of independent consulting and integration services specifically focused on Big Data solutions. Our expertise spans all facets of data science and data engineering and helps our customers to drive maximum value from their Big Data initiative. Previously, Ron was VP Engineering at Quantcast where he led the data science and engineer teams that pioneered the use of Hadoop and NoSQL for batch and real-time decision making. Prior to that, Ron was Founder of New Aspects, which provided enterprise consulting for Aspect-oriented programming. Ron was also Co-Founder and CTO of B2B applications provider C-Bridge, which he led to team of 900 people and a successful IPO. Ron graduated with honors from McGill University with a B.S. in Math and Computer Science. Ron also earned his Master's Degree in Computer Science from MIT, leaving the PhD program after presenting the idea for C-bridge and placing in the finals of the 50k Entrepreneurship Contest.

Session(s):
The Key to Unlocking the Value in the Internet of Things? Managing the Data!
April 13, 2016, 17:50 - 18:30
Wicklow Hall 1
Alex Buijsman

ING Bank
Alex is an information specialist with a passion for data management and data warehousing. After finishing his master's degree in Information Management, he has worked in the financial industry for over 15 years; within banking, insurance and local government. His roles varied from system designer, DWH architect to enterprise architect. Alex is also experienced in the area of change management. Momentarily he works as information architect within the challengers & growth countries of ING, supporting the countries with the challanges they encounter working with data. Besides that he is co-responsible for the development and implementation of the Data Lake architecture

Session(s):
Smart Data for a Predictive Bank
April 14, 2016, 14:10 - 14:50
Liffey Hall 1
Bart Buter

ING
Bart is a Hadoop specialist with a passion for data science. After finishing his master's degree in Artificial Intelligence, he has worked within the financial industry in multiple roles; from consultant and data analyst to DevOps engineer. Besides financials, he also has a history in the software and data center industries. Currently, Bart works at an international analytics team in Germany for the ING Bank. He supports data scientists with their use cases and Hadoop based exploration environments.

Session(s):
Smart Data for a Predictive Bank
April 14, 2016, 14:10 - 14:50
Liffey Hall 1
Aaron Call

Barcelona Supercomputing Center
Aaron Call is a research support engineer at Barcelona Supercomputing Center since 2014. Currently working on the ALOJA project, a project towards characterization and prediction of Big Data cost-effectiveness in the cloud. His research is focused on analysing Hadoop on Platform as a Service and comparing its benefits over IaaS solutions.

Session(s):
Benefits of Hadoop as Platform as a Service
April 14, 2016, 17:00 - 17:40
Liffey Hall 2
Andrea Capodicasa

Capgemini
Andrea is an experienced professional with more than fifteen years of architectural, project and technical leadership experience in complex Big Data and Business Information systems.

Session(s):
No Time to Waste: From Data Warehousing to Modern Data Architecture in 4 Easy Sprints
April 13, 2016, 15:00 - 15:40
Wicklow Hall 2A
Ufuk Celebi

data Artisans GmbH
Ufuk Celebi is a PMC member at Apache Flink and software engineer at data Artisans.

Session(s):
Unified Stream & Batch Processing with Apache Flink
April 13, 2016, 12:20 - 13:00
Liffey Hall 2
Prasad Chalasani

Media Math
Prasad Chalasani is currently the SVP of Data Science at Media Math, leading the development of innovative, proprietary scalable algorithms and analytics that leverage massive amounts of data to power smarter digital marketing for the world's leading advertisers. Prior to joining Media Math, Prasad led Data Science at Yahoo Research, and before that worked for 10 years as a quantitative researcher and portfolio manager of statistical trading strategies at hedge funds and at Goldman Sachs. Prasad holds a PhD in Computer Science from CMU and BTech in Computer Science from IIT.

Session(s):
Fast Distributed Online Classification and Clustering
April 13, 2016, 16:10 - 16:50
Liffey A
Ranadip Chatterjee

Healtrix Ltd
Ranadip is a big data architect with over 13 years of core technology expertise in the industry. He is engaged as a principal architect at Barclays Investment Bank. Previously, he was a senior product engineer at Yahoo, building one of their award winning advertising product, Consumer Direct / Connect. He was a speaker at Hadoop Summit Europe 2014. He is a cofounder of London based digital healthcare startup, Healtrix, that is pioneering the use of big data in healthcare delivery to improve penetration of services in remote areas. He holds an MBA from London Business School.

Session(s):
Hadoop Helps Deliver High Quality, Low Cost Healthcare Services
April 14, 2016, 11:30 - 12:10
Wicklow Hall 1
Paul Codding

Hortonworks
Paul Codding is a Senior Product Manager focussing on Apache Hadoop Operations at Hortonworks. Paul joined Hortonworks four years ago as a Solutions Engineer and was responsible for helping customers successfully deploy Apache Hadoop at scale. With those lessons learned, he currently leads the Hortonworks SmartSense project, and helps with Apache Ambari.

Session(s):
Taming the Elephant: Efficient and Effective Apache Hadoop Management
April 13, 2016, 11:30 - 12:10
Wicklow Hall 2B
Justin Coffey

Criteo
Justin Coffey is a senior staff devlead at Criteo in charge of the Analytics Infrastructure team. He oversees (and even manages the occasional contribution to) the development of better tools to manage the petabytes of analytic data employed by hundreds of Criteo analysts and engineers across the world. With over 15 years of experience working in the Internet, Justin has worked with web technologies since their inception. Prior to working for Criteo, Justin worked in a number of Internet startups as an on-hands engineering manager helping drive explosive growth at the early stages.

Session(s):
Working with the Type Safe Scalding API
April 14, 2016, 17:00 - 17:40
Liffey Hall 1
David Darden

Big Fish Games
David manages the BI Engineering Team at Big Fish, one of the world's largest producers and distributors of casual games. This groups owns all of the massively paralleled processing platforms to support analytics and data warehousing at Big Fish. He has spent 17 years working in the technology industry with companies ranging from start-ups to Fortune 50 companies. David has spent the last 10 of those years focusing on using Agile methodologies to deliver high quality Business Intelligence projects across numerous industries.

Session(s):
It's Not the Size of Your Cluster, It's How You Use It
April 13, 2016, 11:30 - 12:10
Wicklow Hall 2A
Patrick de Vries

KPN
Patrick de Vries is an OSS manager (Demand), IT architect with 8 years experience in the mobile networks. He has a passion for data management and data warehousing. In this time he successfully led many IT architecture, design and implementation activities for operations readiness, assurance, service quality and business continuity projects. Currently, Patrick works at KPN in the Netherlands at further improvements in servicing both customer experience and operational excellence particularly within the even growing digital/online environment.

Session(s):
Connecting Everything
April 13, 2016, 17:00 - 17:40
Wicklow Hall 2A
Robby Dick

BMC Software
I have been working with the workload automation discipline in various capacities since 1994.

Session(s):
Batch is Back: Critical for Agile Application Adoption
April 13, 2016, 11:30 - 12:10
Liffey Hall 1
Sofian Djamaa

Criteo
Sofian is a passionate programmer fond of solving head-scratching problems in challenging environments such as big data architectures, low latency systems and complex algorithmic situations (machine learning, optimizations). At Criteo, Sofian is the chief sponsor of Scalding, Summingbird and other data processing frameworks.

Session(s):
Working with the Type Safe Scalding API
April 14, 2016, 17:00 - 17:40
Liffey Hall 1
Ted Dunning

MapR Technologies
Ted Dunning has been contributing to open source for decades. He likes cool algorithms and plays mandolin poorly, but enthusiastically.

Session(s):
Detecting Persistent Threats Using Sequence Statistics
April 14, 2016, 17:00 - 17:40
Wicklow Hall 1
Joey Echeverria

Rocana
Joey Echeverria is the platform technical lead at Rocana, where he builds applications for scaling IT operations built on the Apache Hadoop platform. Joey is a committer on the Kite SDK, an Apache-licensed data API for the Hadoop ecosystem. Joey was previously a software engineer at Cloudera, where contributed to several ASF projects including Apache Flume, Apache Sqoop, Apache Hadoop, and Apache HBase. Joey is also a coauthor of Hadoop Security, published by O'Reilly Media.

Session(s):
Real-time Search on Terabytes of Data Per Day: Lessons Learned
April 13, 2016, 12:20 - 13:00
Wicklow Hall 1
Simon Elliston Ball

Hortonworks
Simon is a Principal Solutions Engineer at Hortonworks, where he helps customers solve problems with Hadoop. Before that he headed up research and development of tools to make Hadoop easier for Red Gate. His love of data and data tools however came from working with a London hedge fund and a specialist ERP vendor, who taught him just how messy data can be.

Session(s):
Telematics with Hadoop and Nifi
April 14, 2016, 14:10 - 14:50
Liffey A
Gyula Fora

King
Gyula is a committer and PMC member for the Apache Flink project, currently working as a Data Warehouse Engineer at King. His main expertise and interest is real-time distributed data processing frameworks, and their connections to other big data applications. He is a core architect of Apache Flink Streaming. His current work includes research and development on several aspects of stream processing, including fault-tolerance, efficient stateful computations and streaming machine learning. Gyula has been a speaker at ApacheCon, Hadoop Summit and numerous Big Data related conferences and meetups recently on stream processing.

Session(s):
Large-Scale Stream Processing in the Hadoop Ecosystem
April 13, 2016, 16:10 - 16:50
Liffey Hall 1
Alan Gates

Hortonworks
Alan is one of the Co-founders of Hortonworks. He is an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and guided its adoption as an Apache Incubator project. Alan has a BS in Mathematics from Oregon State University and a MA in Theology from Fuller Theological Seminary. He is also the author of Programming Pig, a book from O’Reilly Press.

Session(s):
Hive on ACID
April 13, 2016, 17:00 - 17:40
Liffey A

Apache Hive 2.0 SQL Speed Scale
April 14, 2016, 14:10 - 14:50
Liffey B
Guillaume Germaine

EDF R&D
Guillaume is a Data Scientist at EDF R&D, focusing on Hadoop and its evolving ecosystem with a special attention for Apache Spark. He tracks cutting edge technologies and pilot projects or POC to investigate the feasibility of applying machine learning to specific real-world business problems at EDF. Prior to joining EDF, he held technical and managerial positions at AXA, building the Business Intelligence platform of the company.

Session(s):
Exploring Titan and Spark GraphX for Analyzing Time-Varying Electrical Networks
April 13, 2016, 15:00 - 15:40
Wicklow Hall 1
P. Taylor Goetz

Hortonworks
P. Taylor Goetz is the Apache Storm PMC Chair and Streaming Data Tech Lead at Hortonworks. Taylor has over 20 years of software development expertise in projects including financial transaction management, transportation logistics, DoD Command and Control systems, and Master Data Management (MDM). Taylor is an ASF Member, and as a member of the Apache Incubator PMC mentors a number of incubating projects.

Session(s):
The Future of Apache Storm
April 13, 2016, 15:00 - 15:40
Liffey Hall 2
Tugdual Grall

MapR Technologies
Tugdual Grall is Chief Technical Evangelist for EMEA at MapR, an open source advocate and a passionate developer. He currently works with the European developer communities to ease MapR, Hadoop and NoSQL adoption. Before joining MapR, Tug was Technical Evangelist at MongoDB and Couchbase. Tug has also worked as CTO at eXo Plaform and JavaEEproduct manager, and software engineer at Oracle. Tugdual is Co-Founder of the Nantes JUG (Java User Group) that holds since 2008 monthly meeting about Java ecosystem. Tugdual also writes a blog available at http://tgrall.github.io.

Session(s):
Real-World NoSQL Schema Design
April 13, 2016, 16:10 - 16:50
Liffey Hall 2
Scott Gray

IBM
Scott Gray is the lead architect for IBM Open Platform with Apache Hadoop, the heart of IBM's Hadoop distribution, BigInsights and was previously a senior architect for IBM Big SQL for Hadoop. Scott has an extensive career in the software industry focusing heavily on relational database, architecture, design, optimization and internals. Prior to working with IBM, Scott was the chief architect for ANTs Software’s SQL Skin for Sybase, a real time Sybase T-SQL to IBM SQL PL translation engine.

Session(s):
Surviving The Hadoop Revolution
April 14, 2016, 16:10 - 16:50
Wicklow Hall 1
Rashim Gupta

Microsoft Corp.
Rashim Gupta is a Product Manager in the Big Data team at Microsoft. Rashim has been at Microsoft for 4 years working on distributed systems and cloud applications. He has worked on many Big Data technologies including Azure Data Lake, Cosmos and HDInsight. Before working at Microsoft, Rashim has worked on various startups including Digital Chocolate, Magnetic, Live Gamer and Comixology. Rashim started his career as a software developer at Qualcomm. He has a MS in Computer Science from Rensselaer Polytechnic Institute and MBA from New York University Stern School of Business.

Session(s):
Hadoop in the Cloud: Real World Lessons from Enterprise Customers
April 13, 2016, 17:00 - 17:40
Liffey Hall 1
Eshcar Hillel

Yahoo!
Eshcar Hillel is Research Scientist at Yahoo Labs. She works on scalable big data and search platforms. Most recently, she focused on HBase scalability features. Dr. Hillel received her PhD in distributed computing from Technion CS in 2011. Prior to Yahoo, she held a technical position at HP Labs.

Session(s):
HBase on Steroids with In-Memory Compaction
April 13, 2016, 17:50 - 18:30
Wicklow Hall 2A
James Hodge

Splunk
James Hodge is a Principal Product Manager at Splunk responsible for Big Data products. Prior to joining the product management team at Splunk, James led Big Data and Analytics technical sales in EMEA. Before Splunk James carried out similar roles at HP Autonomy and i2 IBM giving him a him a wealth of experience working with data analytics within large public and private sector organisations.

Session(s):
Log, I am Your Father. The Role of Machine Data in the IoT
April 14, 2016, 16:10 - 16:50
Liffey A
Jonathan Hsieh

Cloudera
Jonathan is the Tech Lead and Manager of the Apache HBase Team at Cloudera. He is an Apache HBase committer and PMC member, as well as a founder of the Apache Flume project. Jonathan has an M.S. in Computer Science from University of Washington and also has an M.S. and a B.S. in Electrical and Computer Engineering from Carnegie Mellon University.

Session(s):
HBase and Spark: Leveraging your Non-Relational Datastore in Batch and Streaming Applications
April 13, 2016, 15:00 - 15:40
Liffey Hall 1
Julian Hyde

Hortonworks
Julian Hyde is an expert in query optimization, in-memory analytics, and streaming. He is PMC chair of Apache Calcite, the query planning framework behind Hive, Drill, Kylin and Phoenix. He was the original developer of the Mondrian OLAP engine. He is an architect at Hortonworks.

Session(s):
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
April 13, 2016, 12:20 - 13:00
Wicklow Hall 2B

Planning with Polyalgebra: Bringing Together Relational, Complex and Machine Learning Algebra
April 14, 2016, 14:10 - 14:50
Wicklow Hall 1
Petr Igrevski

eBay
Long experience in the data and information management, gave me a great opportunity to look at how any developer could make his data processing applications easier, without sacrificing the requirements to do it, in the secure and scalable manner. As an architect, I work in different areas related to data processing. These include development of the data processing platform, security, and analytics cloud infrastructure.

Session(s):
Tailored for Spark
April 14, 2016, 12:20 - 13:00
Liffey B
Burak Isikli

Turkcell
Burak IŞIKLI works as a Senior Software Engineer at Turkcell Technology, currently working with Big Data and Oracle technologies, and NoSQL. He's the lead big data developer who is responsible for architectural design and implementation for big data projects. He has a MSc in Computer Science with specialization in social network analysis from Gebze Technology University besides he's also a PhD. candidate in Ozyegin University. He's interested in data science, distributed systems such as Hadoop, Spark and so on. He's also certified R programmer.

Session(s):
How Do You Decide Where Your Customer Was?
April 14, 2016, 16:10 - 16:50
Wicklow Hall 2A
Nikhil Joshi

EMC
In his role as a Consultant Product Manager, Nikhil focuses on enabling Hadoop analytics on EMC’s Cloud Storage and Converged Infrastructure platforms. Previously, as a Software Engineer, Nikhil developed massive distributed systems and helped wrangle data at companies like Informatica, Yahoo! and Pivotal. He holds a Masters degree in Computer Science from the State University of New York at Stony Brook. On a personal front, he enjoys camping, craft beer and re-runs of Frasier.

Session(s):
Hadoop Everywhere: Geo-Distributed Storage for Big Data
April 14, 2016, 15:00 - 15:40
Wicklow Hall 2B
Karthik Karuppaiya

Symantec
Karthik Karuppaiya is a Sr. Engineering Manager / Architect working for the Cloud Platform Engineering team at Symantec. He has been working on hadoop based systems since 2010. As part of the Cloud Platform Engineering group at Symantec, he is helping product teams in re-architecting/re-engineering the legacy systems to move to big data cloud. Currently he is focussed on building 100s of Petabyte scale Data Lake to help move Symantec to next generation analytics engine.

Session(s):
On-Demand HDP Clusters using Cloudbreak and Ambari
April 13, 2016, 15:00 - 15:40
Liffey A
Moon soo Lee

NFLabs
LeeMoonSoo is a creator for Apache Zeppelin (incubating) and a Co-Founder, CTO at NFLabs. For past few years he has been working on bootstrapping Zeppelin project and it’s community. His recent focus is growing Zeppelin’s community and getting adoptions.

Session(s):
Apache Zeppelin, Helium and Beyond
April 14, 2016, 14:10 - 14:50
Liffey Hall 2
Yang Li

Kyligence Inc.
Yang Li is a big data architect and engineer focus on analytics technologies. He is a PMC member of Apache Kylin and the co-founder of Kyligence, a startup that provides intelligent analytics platform at massive scale data. Yang is one of the creators of Apache Kylin and has been working on Kylin since 2014 when he was in eBay, acting as tech leader and architect. Prior to that, Yang has worked eight years at IBM and two years at Morgan Stanley. At IBM, Yang was an Outstanding Technical Achievement Award winner, and was a tech lead of InfoSphere BigInsights, responsible for Hadoop open source infrastructure. At Morgan Stanley, Yang was a Vice President, responsible for global regulatory reporting infrastructure.

Session(s):
The Evolution of Apache Kylin: Realtime and Plugin Architecture in Kylin2
April 14, 2016, 16:10 - 16:50
Liffey Hall 2
Piotr Lusakowski

deepsense.io
Piotr Lusakowski is a Senior Software Engineer at deepsense.io. He graduated from the Faculty of Mathematics, Informatics and Mechanics at University of Warsaw. For several years he worked at GG Network - largest Polish instant messenger, where he was in charge of designing and building highly scalable, low-latency distributed systems. Prior to GG Network, he created complex systems protecting copyrighted content in p2p networks.

Session(s):
Cooperative Data Exploration with IPython Notebook
April 13, 2016, 17:00 - 17:40
Wicklow Hall 2B
Yves Mabiala

Thales
Mr. Yves Mabiala is a Big Data Analytics specialist at Thales Communications & Security. He holds an engineering degree in Signal Processing and a master's degree in Probability and Statistics. He has been involved in many projects dealing with predictive maintenance, credit card fraud or cyber-security threat detection. His interests cover unsupervised/supervised learning, visual analytics and Big Data.

Session(s):
Deep Recurrent Neural Networks for Sequence Learning in Spark
April 14, 2016, 12:20 - 13:00
Liffey Hall 1
Simon Maby

Octo Technology
Simon is working at Octo Technology as a Data Scientist and Big Data architect. His missions consist of designing Data Lakes architectures based on the Hadoop Ecosystem, helping companies to define a Big Data strategy, or training predictive models during short agile projects. He has experience on technologies such as Spark, Storm, Kafka or Redis by building custom real-time analytics. Simon also created a three days Data Science training at Octo, teaching all the essentials about machine learning. He is deeply interested into organizational models and methodologies that can help companies to become more Data driven.

Session(s):
Practical Advice to Build a Data Driven Company
April 14, 2016, 15:00 - 15:40
Wicklow Hall 1
Ted Malaska

Cloudera Inc
Ted Malaska has worked on close to 60 clusters for over two- to three-dozen clients with over hundreds of use cases. He has 18 years of professional experience working for startups, the U.S. government, a number of the world’s largest banks, commercial firms, bio firms, retail firms, hardware appliance firms, and the largest non-profit financial regulator in the U.S. Ted has architecture experience across topics such as Hadoop, Web 2.0, Mobile, SOA (ESB, BPM), and big data. He is a regular committer to Flume, Avro, Pig, and YARN.

Session(s):
Ingest and Stream Processing - What Will You Choose?
April 13, 2016, 14:10 - 14:50
Liffey Hall 2

HBase and Spark: Leveraging your Non-Relational Datastore in Batch and Streaming Applications
April 13, 2016, 15:00 - 15:40
Liffey Hall 1
Arun Manoharan

eBay
Arun Manoharan is a Sr Product manager at eBay responsible of building Hadoop Big data platforms. He is also a PMC member for Apache Eagle. Prior to eBay Arun was a product manager at IBM Data Exporer and Vivisimo helping develop products to explore big data. His interests are in open source data platforms and product management.

Session(s):
Apache Eagle - Monitor Hadoop in Real-time
April 14, 2016, 15:00 - 15:40
Liffey Hall 2
Hessel Miedema

Capgemini
Hessel is a big data architect with 10 years of experience in Big Data and Data Warehousing. After his Master's degree in Applied Physics he started as a software engineer developing dataflows in large enterprises. Recently he designed solutions for large government agencies and global enterprises that focussed on moving away from traditional data warehousing and into the modern data architecture.

Session(s):
No Time to Waste: From Data Warehousing to Modern Data Architecture in 4 Easy Sprints
April 13, 2016, 15:00 - 15:40
Wicklow Hall 2A
Adam Morton

Admiral Insurance
Adam Morton is the Enterprise Data Architect for Admiral Insurance in the UK. With over 10 years' experience in Data Warehousing, Business Intelligence, Analytics arena (and more recently Big Data!), he is responsible for the strategic vision, data integration and architecture to creatively meet the needs of an exciting and data-driven business.

Session(s):
Telematics with Hadoop and Nifi
April 14, 2016, 14:10 - 14:50
Liffey A
Raj Mukherjee

Zurich Insurance
I have over 11 yrs. experience managing complex global programs and delivering high-performance data management solutions from Design to implementation. I am a high-energy leader known for quickly solving problems and delivering results, skilled in building teams and operating models. I am currently driving Architecture, design and delivery of a Big data platform for Zurich insurance which aim to transforming data management and exploitation capabilities with in the organisation.

Session(s):
Using a Data Lake at the Core of a Life Assurance Business: Solutions for Data in one Place and Putting the Customer First
April 13, 2016, 12:20 - 13:00
Liffey A
Chris Murphy

Zurich Life Insurance , UK
Having worked as an IT professional for the last 15 years in the financial services industry, I have a strong history of delivering architectural solutions in a complex technical landscape. I am currently working as the lead Solution Architect on the Zurich UK Life Data Lake (the first key deliverable on Zurich's Big Data Platform) shaping the patterns and frameworks that will be used to realise our strategies, both locally and globally.

Session(s):
Using a Data Lake at the Core of a Life Assurance Business: Solutions for Data in one Place and Putting the Customer First
April 13, 2016, 12:20 - 13:00
Liffey A
Chris Nauroth

Hortonworks
Chris Nauroth is a software engineer on the HDFS team at Hortonworks. He is an Apache Hadoop committer and PMC member, an Apache ZooKeeper committer, an Apache Yetus committer and PMC member, an Apache Incubator PMC member and an Apache Software Foundation member. His most significant contributions include HDFS operational improvements, Hadoop Windows compatibility and HDFS ACLs. He also helped shepherd the Apache contribution of WASB, a Hadoop-compatible file system backed by Azure Storage. Prior to joining Hortonworks, Chris deployed and maintained Disney's Hadoop infrastructure and developed web services and MapReduce jobs.

Session(s):
HDFS: Optimization, Stabilization and Supportability
April 13, 2016, 14:10 - 14:50
Liffey B
Madhan Neethiraj

Hortonworks
Madhan Neethiraj is an Apache committer and currently works at Hortonworks as Engineering Lead in Enterprise Security Team. He is one of the main contributors/author for various Apache Ranger features including audit framework, stack model and tag-based policies. Prior to Hortonworks, Madhan was at Oracle in development of security access management suite, governance and real-time fraud detection/prevention products. Prior to Oracle, he was with Bharosa Inc. responsible for the development of real-time fraud detection solution for Financial Institutes, HealthCare and eCommerce.

Session(s):
Is Your Enterprise Data Lake Metadata-Driven AND Secure?
April 13, 2016, 11:30 - 12:10
Liffey Hall 2
Gilles Noisette

Hewlett Packard Enterprise
Gilles Noisette is a Master Solution Architect at the HPE EMEA Solution Innovation center. He is the technical lead of the HPE EMEA Big Data Center of Excellence, promoting Big Data solutions, and doing the link between HPE Divisions, HPE labs and the field, helping to explore and develop innovative solutions. He is presenting HPE technologies in events like the Hadoop Summit and Strata Hadoop World. Since twenty years that he is with HP, Gilles is working on emerging technologies.

Session(s):
Empower Data-Driven Organizations with HPE and Hadoop
April 13, 2016, 14:10 - 14:50
Liffey A
Owen O'Malley

Hortonworks
Owen O'Malley is a co-founder and technical fellow at Hortonworks, which develops the completely open source Hortonworks Data Platform (HDP). Owen has been working on Hadoop since the beginning of 2006 at Yahoo, was the first committer added to the project, and used Hadoop to set the Gray sort benchmark in 2008 and 2009. In the last 9 years, he has been the architect of MapReduce, Security, and now Hive. Recently he has been driving the development of the ORC file format and adding ACID transactions to Hive.

Session(s):
Protecting Enterprise Data In Hadoop
April 14, 2016, 11:30 - 12:10
Liffey Hall 1
Chris Ottinger

Telstra
Chris Ottinger leads Big Data technology strategy within Telstra's Chief Technology Office. He is responsible for integrating emerging data and analytics technologies into Telstra's enterprise landscape. Prior to his role with the Big Data team, Chris held various positions across Telstra's data and analytics portfolio, including both development and operations. Chris holds a B.S. in Comp Science (H. Honors) from the Georgia Institute of Technology and an MBA from the Melbourne Business School.

Session(s):
Telstra's Tale of Hadoop in the Enterprise
April 14, 2016, 12:20 - 13:00
Wicklow Hall 1
Dhabaleswar K (DK) Panda

The Ohio State University
Dhabaleswar K (DK) Panda is a Professor and University Distinguished Scholar of Computer Science and Engineering at the Ohio State University. He has published over 350 papers in the area of high-end computing and networking. The RDMA-enabled Apache Hadoop libraries, designed and developed by his team to exploit HPC technologies under the High-Performance Big Data (HiBD) project (http://hibd.cse.ohio-state.edu) are currently being used by more than 135 organizations in 20 countries. More than 13,000 downloads of these libraries have taken place from the project's site. He is an IEEE Fellow. More details on Prof. Panda are available at http://www.cse.ohio-state.edu/~panda

Session(s):
Accelerating Apache Hadoop through High-Performance Networking and I/O Technologies
April 13, 2016, 17:00 - 17:40
Wicklow Hall 1
Pat Patterson

StreamSets
Pat Patterson has been working with Internet technologies since 1997, building software and working with communities at Sun Microsystems, Huawei, Salesforce and StreamSets. At Sun, Pat was the community lead for the OpenSSO open source project, while at Huawei he developed cloud storage infrastructure software. Part of the developer evangelism team at Salesforce, Pat focused on identity, integration and the Internet of Things. Now community champion at StreamSets, Pat is responsible for the care and feeding of the StreamSets open source community.

Session(s):
Ingest and Stream Processing - What Will You Choose?
April 13, 2016, 14:10 - 14:50
Liffey Hall 2
Michael Peterson

Rocana
Michael Peterson is a Platform Software Engineer at Rocana working on large-scale data processing, search and analysis of data collected streaming in large data centers. He has 15 years experience in a variety of domains including bioinformatics, medical data processing, high volume ad-serving, and distributed systems.

Session(s):
Real-time Search on Terabytes of Data Per Day: Lessons Learned
April 13, 2016, 12:20 - 13:00
Wicklow Hall 1
Bill Porto

RedPoint Global Inc.
Bill Porto is an expert in applying computational intelligence to solve real-world problems across various problem domains. As senior analytics engineer at RedPoint Global, he develops automated business optimization software that incorporates evolutionary optimization, neural networks, and a host of other non-traditional machine learning techniques. An applied mathematician by trade, Bill has created adaptive solutions to dynamic problems for resource allocation, pattern recognition, drug discovery, and logistics scheduling. Before RedPoint, he was president of Natural Selection, Inc. where he received the 2010 FDA Honor Award for his work on their PREDICT automated risk-assessment system.

Session(s):
Machine Learning in Big Data – Look Forward or Be Left Behind
April 13, 2016, 16:10 - 16:50
Wicklow Hall 1
Sanjay Radia

Hortonworks
Sanjay is founder and chief architect at Hortonworks, and an Apache Hadoop committer and member of the Apache Hadoop PMC. Prior to co-founding Hortonworks, Sanjay was the chief architect of core-Hadoop at Yahoo and part of the team that created Hadoop. In Hadoop he has contributed to several areas including HDFS, MapReduce schedulers, Yarn's design, high availability, compatibility, etc. He has also held senior engineering positions at Sun Microsystems and INRIA, where he developed software for distributed systems and grid/utility computing infrastructures. Sanjay has a PhD in Computer Science from the University of Waterloo in Canada.

Session(s):
Evolving HDFS to a Generalized Distributed Storage Subsystem
April 14, 2016, 11:30 - 12:10
Wicklow Hall 2A
Stefan Radtke

EMC
Stefan has spent 20+ years employed with IT-industry leaders including IBM and EMC. Currently, he spearheads a team of CTOs focused on Multi-Protocol Data Lake Solutions where businesses can create large file and object storage solutions with in-place data analytics using Hadoop. Stefan has project experience from various engagements ranging from Cloud providers, ISPs, Telco, Video Surveillance, Media Entertainment, Finance, EDA, Healthcare and Manufacturing. Author of many technical articles and a regular speaker at events, Stefan also writes technical blogs around Scale Out NAS solutions, Hadoop and backups at scale. He holds the Dipl.-Ing. and Dr.-Ing. degree in Information Technology.

Session(s):
Tame That Beast: How to Bring Operations, Governance and Reliability to Hadoop
April 13, 2016, 12:20 - 13:00
Liffey Hall 1
Raghu Ramakrishnan

Microsoft Corporation

Session(s):
Scale out Resource Management at Microsoft Using Apache YARN
April 13, 2016, 12:20 - 13:00
Liffey B
Ram Ramanathan

Google
Ram Ramanathan is a Product Manager on Google Cloud Platform. He received his undergrad in Computer Engineering from Northwestern University and an MBA from the University of Chicago. Since then Ram has worked for multiple Fortune 500 companies such as PwC, Microsoft and General Electric leading teams that develop cloud based data & analytics services. Most recently Ram has been working on Google's Machine Learning efforts.

Session(s):
TensorFlow: Large-Scale Deep Learning For Intelligent Computer Systems
April 13, 2016, 17:00 - 17:40
Liffey B
Noa Resare

Spotify
Noa Resare is a senior engineer and the Spotify Free Software Ombudsman. Noa is an accomplished public speaker has been giving talks at conferences such as Cloud Open, DevOps days and LinuxCon on a wide variety of technical subjects.

Session(s):
Migrating Hundreds of Pipelines in Docker Containers
April 13, 2016, 17:50 - 18:30
Liffey B
Villu Ruusmann

Openscoring OÜ
Villu Ruusmann is the founder and CTO of a European startup company Openscoring, which provides open source Java technologies for the representation and productionalization of machine learning models. These technologies enable data scientists to effortlessly scale their ideas from the smallest laboratory experiments to the largest industrial Hadoop deployments.

Session(s):
Production Grade Data Science for Hadoop
April 14, 2016, 12:20 - 13:00
Wicklow Hall 2A
John Scheibmeir

eBay
John wants to see analytics make the world a better place (for humans). Currently John works to continue growing eBay’s analytic capabilities. His focus is on expanding capabilities and improving performance at scale within eBay’s analytical data platforms. John seeks to grow the role of analytics by making the underlying systems more powerful, approachable, and performant.

Session(s):
Tailored for Spark
April 14, 2016, 12:20 - 13:00
Liffey B
Sidharta Seethana

Hortonworks
Sidharta is a software engineer at Hortonworks. He works on the YARN team, focussing on bringing new kinds of workloads to YARN. Prior to joining Hortonworks, Sidharta spent 10 years at Yahoo! Inc., working on a variety of large scale distributed systems for core platforms/web services, search and marketplace properties, developer network and personalization.

Session(s):
Apache Hadoop YARN and the Docker Container Runtime
April 14, 2016, 12:20 - 13:00
Liffey A
Venkatesh Sellappa

Teradata UK Limited
My primary interest and technical experience lies in: Designing and developing architecture and fluid proof of concept solutions using a variety of approaches, in line with performance and resource constraints. Technologies dealing with horizontally scalable, high performance server solutions including – BigData , Horizontal Grid computing, In-memory K/V stores, NoSQL. Applying statistical approaches to Data analysis and Design.

Session(s):
How To: A Beginners Guide to Becoming an Apache Contributor
April 13, 2016, 11:30 - 12:10
Wicklow Hall 1
Mikhail Semeniuk

Shift Technologies
Mikhail is the Director of Data Products at TrueCar, where he focuses on algorithm-driven pricing systems. Prior to, and in the early days of TrueCar, Mikhail was a data scientist, turned ML/algorithms engineer, and most recently has taken a product role on the data and pricing side of the business. Mikhail studied Mathematics and Economics at the University of Minnesota, which inspired his mission to bridge the gap between data science and engineering. He grew up in Minneapolis, lived in Venice, CA for 6 years where he pursued skydiving and hopes of being a decent surfer, and now resides in the Bay Area.

Session(s):
MLLeap: Or How to Productionize Data Science Workflows using Spark
April 14, 2016, 11:30 - 12:10
Liffey A
Vishrut Shah

EMC
Vishrut is the Director of Engineering, heading the engineering team for the Elastic Cloud Storage (ECS) system at EMC. His focus includes distributed storage engine and storage services supporting object, file and HDFS protocols with ECS. Before joining EMC, Vishrut worked at Microsoft on SQL Azure, specifically in the area of fail over and disaster recovery.

Session(s):
Hadoop Everywhere: Geo-Distributed Storage for Big Data
April 14, 2016, 15:00 - 15:40
Wicklow Hall 2B
Saisai Shao

Hortonworks
Saisai Shao works as a member of technical staff at Hortonworks, mainly focused on Spark area, especially Spark core, Spark on Yarn and Spark Streaming. He is an active Apache Spark contributor. Prior to Hortonworks, he was a software engineer at Intel working on performance tuning and optimization of Hadoop and Spark.

Session(s):
Running Spark in Production
April 13, 2016, 11:30 - 12:10
Liffey A
Sergey Shelukhin

Hortonworks
Sergey Shelukhin is an active Hive contributor, Hive PMC and HBase committer at Hortonworks with 7 years of distributed systems expertise. He is currently working to help deliver next-generation performance and SQL capabilities to Apache Hive.

Session(s):
LLAP: Sub-Second Analytical Queries in Hive
April 14, 2016, 17:00 - 17:40
Liffey A
Tomer Shiran

Dremio
Tomer Shiran is the CEO and co-founder of Dremio. He is also the founder of the open source Apache Drill project. Prior to Dremio, he was VP product at MapR, where he was responsible for product strategy, roadmap and new feature development. As a member of the executive team, Tomer helped grow the company from five employees to over 300 employees and 700 enterprise customers. Prior to MapR, Tomer held product management and engineering positions at Microsoft and IBM Research. He holds an MS from Carnegie Mellon and a BS from Technion - Israel Institute of Technology.

Session(s):
The Heterogeneous Data Lake: Analytics in the World of Specialized Datastores
April 13, 2016, 15:00 - 15:40
Wicklow Hall 2B
Vinay Shukla

Hortonworks
Vinay Shukla is the Director of Product Management for Spark & Data Science at Hortonworks. Vinay is a veteran of enterprise software. Previously, Vinay has worked as Product Manager, Developer, and Security Architect. When not in front of a computer, Vinay enjoys being on a Yoga mat or on a hiking trail.

Session(s):
Running Spark in Production
April 13, 2016, 11:30 - 12:10
Liffey A
Sumeet Singh

Yahoo!, Inc.
Sumeet Singh is a senior director of products for cloud and big data platforms at Yahoo. In his current role, he leads the Hadoop products team responsible for both Apache open source contributions and Yahoo projects. Sumeet is responsible for introducing several new multitenant cloud services at Yahoo that are now the cornerstone of most of Yahoo's next-generation consumer product offerings and user experiences. Sumeet has 16 years of experience in product management and software development in the technology industry. He earned an MBA from the UCLA Anderson School of Management and an MS from Rensselaer Polytechnic Institute, NY.

Session(s):
Hadoop Platform at Yahoo: A Year in Review
April 14, 2016, 12:20 - 13:00
Liffey Hall 2
Prabhjyot Singh

Hortonworks
Coming...

Session(s):
Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
April 14, 2016, 15:00 - 15:40
Liffey Hall 1
Don Smith

Big Fish Games
Don Smith has been working in data warehousing and business intelligence for over 13 years. He has built systems on Oracle, MySQL, SQL Server, Sybase IQ, IBM Netezza and most recently Hadoop. Don's career has been in e-commerce, telecommunications and now in media and entertainment. Over the last 2 years he has been working on standing up the logical data warehouse by augmenting the existing IBM Netezza Enterprise Data Warehouse with an Hadoop Cluster using the IBM Big Insights ODP distribution.

Session(s):
It's Not the Size of Your Cluster, It's How You Use It
April 13, 2016, 11:30 - 12:10
Wicklow Hall 2A
Enis Soztutar

Hortonworks
Enis Soztutar is a committer and PMC member of Apache HBase, Phoenix and Hadoop projects and a member of the Apache Software Foundation. He has been using and developing Hadoop ecosystem projects since 2007. He is currently working as a lead at Hortonworks, HBase engineering.

Session(s):
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
April 14, 2016, 16:10 - 16:50
Liffey Hall 1
Carl Steinbach

LinkedIn
Carl Steinbach is a Senior Staff Software Engineer at LinkedIn where he leads the Hadoop Platform Team. He is also a member of LinkedIn's Technology Leadership Group and its Open Source Committee. Before joining LinkedIn Carl was an early employee at Cloudera. He is an ASF member and former PMC Chair of the Apache Hive Project.

Session(s):
The Past, Present, and Future of Hadoop at LinkedIn
April 13, 2016, 16:10 - 16:50
Liffey B

Scaling out to 10 Clusters, 1000 Users, and 10,000 Flows: The Dali Experience at LinkedIn
April 14, 2016, 11:30 - 12:10
Liffey B
Casey Stella

Hortonworks
I am a principal architect focusing on Data Science in the consulting organization at Hortonworks. In the past, I've worked as an architect and senior engineer at a healthcare informatics startup spun out of the Cleveland Clinic, as a developer at Oracle and as a Research Geophysicist in the Oil & Gas industry. Before that, I was a graduate student in Math at Texas A&M. I specialize in writing software and solving problems where there are either scalability concerns due to large amounts of traffic or large amounts of data. I have a particular passion for data science

Session(s):
Outlier Analysis and Anomaly Detection for Sensors with Spark and Storm
April 13, 2016, 17:00 - 17:40
Liffey Hall 2

Using Natural Language Processing on Non-Textual Data with MLLib
April 14, 2016, 15:00 - 15:40
Wicklow Hall 2A
Nishant Thacker

Microsoft
Nishant Thacker is a Technical Product Manager for the Big Data Analytics services from Microsoft. With more than 8 years of experience working with on Analytics platform, he's been a speaker at many leading conferences. His work involves working with Engineering and the Field to enable a smooth launch pad for new service and offerings in the Big Data space, and also making sure there is ample technical acumen for partners to have a seamless implementation cycle. He also is a true evangelist of the power of analytics.

Session(s):
Big Data Application Architectures - IoT
April 14, 2016, 15:00 - 15:40
Liffey B
Marcelo Vanzin

Cloudera
Marcelo is a Software Engineer at Cloudera and a committer to the Apache Spark project.

Session(s):
Securing Spark on Production Hadoop Clusters
April 13, 2016, 17:50 - 18:30
Liffey Hall 1
Varun Vasudev

Hortonworks
Vinod Kumar Vavilapalli (reachable at @tshooter) is the Hadoop YARN and MapReduce guy at Hortonworks. He is a long term Hadoop contributor at Apache, Hadoop committer and a member of the Apache Hadoop PMC. He has a Bachelors degree from Indian Institute of Technology Roorkee in Computer Science in Engineering. He has been working on Hadoop for nearly years. Straight out of college, he joined the Hadoop team at Yahoo! Bangalore where he worked on HadoopOnDemand, Hadoop-0.20, CapacityScheduler, and Hadoop security, before Hortonworks happened. He is passionate about using computers to change the world for better, bit by bit.

Session(s):
Apache Hadoop YARN: Past, Present and Future
April 13, 2016, 11:30 - 12:10
Liffey B
Thomas Vial

Octo Technology
Thomas is a Senior Architect at OCTO Technology. He’s had the chance to help many client organizations in their architecture journeys (data, integration, development, BI, …). He is part of the Big Data & Analytics team at OCTO, where the fun stuff happens: data science, machine learning, data-driven strategy and - of course - Hadoop. There, the line between consulting and R&D is somewhat blurry, that’s what makes those topics so interesting. He is also a Hortonworks trainer.

Session(s):
Exploring Titan and Spark GraphX for Analyzing Time-Varying Electrical Networks
April 13, 2016, 15:00 - 15:40
Wicklow Hall 1
Hollin Wilkins

Truecar
Hollin Wilkins is a lead ML engineer at TrueCar. He has been working on machine learning infrastructure since 2015, focusing on platforms for data scientists and engineers to rapidly iterate on ML algorithms and pipeline deployments. Previously he worked in the games industry at LindenLab on Blocksworld and Versu, helping to build everything from game UI, to servers, to custom logic languages that drive user experiences. He holds a degree inBiology from Cornell University and is currently trying to bring the tech industry to Ithaca, NY.

Session(s):
MLLeap: Or How to Productionize Data Science Workflows using Spark
April 14, 2016, 11:30 - 12:10
Liffey A
Yong Zhang

eBay
Yong Zhang is Senior MTS Architect at eBay Analytics Data Infrastructure and author of Apache Eagle. He is actively building large scalable and high-performance monitoring framework Eagle to secure Hadoop in realtime. He has worked and led engineering teams for many years on search engine, cloud computing and big data areas etc. His interests are stream processing, distributed computation and framework abstraction.

Session(s):
Apache Eagle - Monitor Hadoop in Real-time
April 14, 2016, 15:00 - 15:40
Liffey Hall 2
Zhan Zhang

Hortonworks
Zhan Zhang is a member of technical staff at Hortonworks, where he works on Apache Spark and Hadoop Ecosystem. He received his BS/MS degree from Fudan University of China and Ph.D in Computer & Information Science & Engineering from University of Florida. His research interests distributed system and large scale machine learning platform, with results published in top journals/conferences, such as MobiCom, INFOCOM, etc.

Session(s):
Bringing HBase Data Efficiently into Spark with DataFrame Support
April 14, 2016, 11:30 - 12:10
Liffey Hall 2
Jianfeng Zhang

Hortonworks
Jeff has 7 years of experience in big data industry. He has used Hadoop since 2009 and is a PIG committer. My past experience is not only in big data infrastructure, but also on the application level of how to leverage these big data tools to get insight from data. Now I work in hortonworks as member of technical staff and mostly focus on Tez & Spark.

Session(s):
Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
April 14, 2016, 15:00 - 15:40
Liffey Hall 1
Adriana Zubiri

IBM
Adriana Zubiri is a Program Director in the Big Data organization at the IBM Toronto Lab. Adriana is responsible for leading the world wide Big SQL engineering team, IBM's exciting technology that extends the power of SQL to the world of Apache Hadoop as part of the IBM BigInsights offering. Adriana is recognized within the industry as an expert in the area of big data and data warehouses, based on her extensive work with clients, her numerous papers and conference presentations.

Session(s):
Surviving The Hadoop Revolution
April 14, 2016, 16:10 - 16:50
Wicklow Hall 1
Zoltán Zvara

Hungarian Academy of Sciences
Zoltán Zvara is a PhD student at the Eötvös Lorand University, Hungary, and a researcher at the Hungarian Academy of Sciences. His focus of research is the execution and scheduling of general-purpose distributed data processing frameworks. Zoltán lead various Spark-related research projects to success in cooperation with well-known industrial partners.

Session(s):
Advanced Execution Visualization of Spark Jobs
April 13, 2016, 14:10 - 14:50
Liffey Hall 1
sponsor purchase
community partners