Storage & Analytics. Every stone, unturned.


The Fifth Elephant 2013 brings engineering and business professionals under one roof to discuss technology and application challenges involved in managing and analyzing large volumes of data.

Technology Agenda: Exposure to storage and analytics technologies. Better understanding of why different tools and approaches work for some domains. Interact with decision-makers and product managers to understand how these technologies fit into business processes and goals.

Business Agenda: Understand how technology is applied for better customer targeting, building products, improving business processes and decision-making. Interact with technologists and analysts to learn how data collection and processing techniques work in different sectors.

Talks apart, there will be open house sessions, panel discussions and plenty of opportunities to interact with some of the best minds in technology and business.


The conference covers two major themes: storage and analytics. There will be separate engineering and business tracks covering each theme.

Engineering track on Storage

Learn about databases, OLTP, NoSQL, data integration and warehousing. Find out how developers and tech teams deal with performance, speed, scaling and other challenges associated with storage. The storage track will lay the foundation for better understanding of data processing and analytics.

Business track on Storage

Understand decision support issues such as costs of storage infrastructure, maintenance, benefits and limitations. Panel discussions and interactive Q&A sessions between solution architects, decision-makers and engineers will further clarify how choice of storage technologies affect business processes and goals.

Analytics track for engineers and analysts

Find out about metrics, tools and statistical techniques and their application for different use-cases. This track is for the algorithmically curious. It also covers hacky territory such as applications and technologies for better data collection and insight gathering. Case studies of platforms and solutions built for data collection, processing and analytics will be showcased here.

CIO and Product Managers' track on Analytics

How data can be used for decision-making is a pressing question for most organizations. This track will consist of talks around how data has been used effectively for building products, leveraging insights and developing in-house technology capabilities for better business processes.


Friday, July 12, 2013

Time Auditorium 1 (Storage: Technology) Auditorium 2 (Storage: Decision Support)
09:00 – 10:00 Registrations and breakfast
10:00 – 10:15 Introduction to HasGeek and The Fifth Elephant
10:15 – 11:00

Keynote: Agility and Innovation vs IT: how new data platforms can overcome this never-ending struggle (sponsored)

Edouard Servan-Schreiber, 10gen

11:00 – 11:30 Tea break
11:30 – 12:15

Latency and Fault tolerance in OLTP @1.5 billion/day service calls

Regunath Balasubramanian, Flipkart

Cloud based low cost, low maintenance, scalable data platform

Apoorva Gaurav, Myntra

12:15 – 13:00

Analyzing Terabytes of Data with Google BigQuery

Chandramouli Mahadevan, Google

HOWTO run a hadoop cluster on a laptop

Gopal Vijayaraghavan, Hortonworks

13:00 – 14:00 Lunch
14:00 – 14:45

15 Billion value at risk computations in 187 milliseconds

Abinasha Karana, Bizosys Technologies

Strategic advantages of MongoDB (sponsored)

Edouard Servan-Schreiber, 10gen

14:45 – 15:30

SolrCloud and NoSQL

Anshum Gupta, LucidWorks

Neo4j Graphs: What, When, How

Andreas Kollegger, Neo Technology

15:30 – 16:00 Tea break
16:00 – 16:45

Workflow Schedulers: The Heart Beat of a Big Data Stack

Rajat Venkatesh, Qubole

Flash Talks


16:45 - 17:30

Closing Keynote: What Happens When Firefox Crashes?

Erik Rose, Mozilla

19:00 – 23:00 Party at Bak Bak Bar co-sponsored by Active Sphere and Neo4j

Saturday, July 13, 2013

Time Auditorium 1 (Analytics: Technology) Auditorium 2 (Analytics: Metrics and Tools)
09:00 – 10:00 Registrations and breakfast
10:00 – 10:15 Summary of day 1 and introduction to day 2
10:15 – 11:00

Extracting consumer trends in real time using 100 billion tweets

Pankaj Risbood, WalmartLabs

Evaluating SSD Performance for Databases Handling Real-Time Big Data

Brian Bulkowski, Aerospike

11:00 – 11:30 Tea break
11:30 – 12:15

Building large scale Analytics Platform

Prabhu Prakash Ganesh, MEDIA iQ Digital

MapReduce and the 'Art of Thinking Parallel'

Shailesh Kumar, Google

12:15 – 13:00

Analytics using Hadoop ecosystem on AWS (sponsored)

Rajat Venkatesh, Qubole

Product Demos

To be announced

13:00 – 14:00 Lunch
14:00 – 14:45

Visualising networks

Anand S, Gramener

Similar entity detection in large data

Arthi Venkataraman, Wipro Technologies

14:45 – 15:30

Interactive analysis of data live, using Pandas, Matplotlib and IPython

Lakshman Prasad

Julia: A fresh approach to technical computing and data science

Viral Shah, JuliaLang

15:30 – 16:00 Tea break
16:00 – 16:45

A Billion Snapshots- Principles and Processes in the Census of India

Varsha Joshi, Census Delhi, Registrar General of India

Unlocking the Potential of Data for Everyday Developers and Product Managers

Karthik Kastury, PayPal India (eBay Inc.)

16:45 - 17:30

Smart Analytics in Smartphones

Satnam Singh, Samsung Research

Co-occurrence Analytics: A versatile framework for finding interesting needles in crazy haystacks!

Shailesh Kumar, Google

17:30 – 18:15 Feedback and Closing