The Fifth Elephant

When: July 26, 2012, 05:00 PM – 07:00 PM
Where: The Energy & Resources Institute, 4th Main, 2nd Cross, Domlur 2nd Stage, Bangalore – 560 071

by Prashanth Babu

Pig is a high-level platform for creating MapReduce programs used with Hadoop for analyzing Big Data. This is a two-hour workshop on intro to Pig. The workshop aims at live-coding introductory session for analyzing Big Data using Pig.

This workshop will include discussion on:

  • Basics of Hadoop
  • Basics of Pig and PigLatin
  • Pig vs MapReduce
  • Pig vs SQL

And also:

  • Live-coding session on Pig for analyzing huge sample data.
  • Checking the visualization of Pig MapReduce Jobs with Twitter Ambrose


  • Basic understanding of Hadoop, HDFS and MapReduce.
  • Laptop with VMware Player or Oracle VirtualBox installed.
  • Please download Cloudera Demo VM from's+Hadoop+Demo+VM
  • Alternatively, a USB flash drive will be distributed with a VMware image of 64 bit Ubuntu Server 12.04 [Precise Pangolin] with Hadoop, HBase, Sqoop, Hive and Pig installed and configured using Apache Bigtop.

Facilitator bio

Prashanth Babu has 9+ years of experience in software development predominantly in Java and JavaEE. He is working with NTT DATA Global Delivery Services (previously Keane India Pvt. Ltd.) on an R & D initiative on Big Data using Apache Hadoop Ecosystem. Also, an avid Android enthusiast with experience in Android App Development.


The venue has limited capacity. Interested in attending? Please login. You can use your existing Twitter or Google account, and if you have previously voted on a session proposal or attended a hacknight, you already have a HasGeek account.

Login with Twitter, Google or HasGeek id