The preliminary day of The Fifth Elephant 2013 - 11th July - provides the best opportunity to dive into various languages and tools used for data analysis and visualization, and understand how they compare with each other.
Learn about open source frameworks and tools such as Pig, Hive, Hadoop; commercial solutions (sponsored), programming languages such as R, databases and visualization techniques and tricks through slides, demonstrations and partially hands-on activities. There will be ample opportunity to clarify doubts regarding the limitations and possibilities of these tools and frameworks.
This hand-on session will be a crash course on using the Neo4j graph database. Assuming no prior knowledge, participants will learn about working with Neo4j through a progressive series of exercises.
Participants will use Neo4j's query language - Cypher - to:
create a simple graph
import a larger sample graph
run basic queries to get known data
discover new data with graph patterns
By the end of this workshop, attendees will leave with a foundation of how to begin working with Neo4j, ready to explore further with language-specific drivers.
Andreas Kollegger
Neo Technology
Andreas has been part of the Neo4j community since having his own graph epiphany while working on medical informatics in Zambia. He joined as an early member of core engineering, and has now taken on the role of Product Experience Designer, responsible for maturing that fantastic codebase into an industrial strength product.
Big Data, Real-time Processing and Storm
This workshop explains the basics of Storm and its salient features. The instructor will discuss how Storm is similar / different from Hadoop and will run through the source of WordCount example and its demo. Finally, the instructor will show how Hadoop and Storm together can help process Big Data seamlessly.
Participants will learn and understand:
Concepts and salient features of Storm
How Storm can be used for processing Big Data and in real-time
Storm through a simple example
Storm vs. Hadoop
Real-time analysis of tweets using Storm
Prashanth Babu
NTT DATA
Prashanth Babu is a Research Engineer with NTT DATA. He is currently working on an R & D initiative on Big Data using Apache Hadoop Ecosystem and he is also Cloudera Certified Developer for Apache Hadoop [CCDH].