The Fifth Elephant

When: July 26, 2012, 05:00 PM – 07:00 PM
Where: The Energy & Resources Institute, 4th Main, 2nd Cross, Domlur 2nd Stage, Bangalore – 560 071

by Prashanth Babu

Pig is a high-level platform for creating MapReduce programs used with Hadoop for analyzing Big Data. This is a two-hour workshop on intro to Pig. The workshop aims at live-coding introductory session for analyzing Big Data using Pig.

This workshop will include discussion on:

  • Basics of Hadoop
  • Basics of Pig and PigLatin
  • Pig vs MapReduce
  • Pig vs SQL

And also:

  • Live-coding session on Pig for analyzing huge sample data.
  • Checking the visualization of Pig MapReduce Jobs with Twitter Ambrose

Requirements

  • Basic understanding of Hadoop, HDFS and MapReduce.
  • Laptop with VMware Player or Oracle VirtualBox installed.
  • Please download Cloudera Demo VM from ccp.cloudera.com/display/SUPPORT/Cloudera's+Hadoop+Demo+VM
  • Alternatively, a USB flash drive will be distributed with a VMware image of 64 bit Ubuntu Server 12.04 [Precise Pangolin] with Hadoop, HBase, Sqoop, Hive and Pig installed and configured using Apache Bigtop.

Facilitator bio

Prashanth Babu has 9+ years of experience in software development predominantly in Java and JavaEE. He is working with NTT DATA Global Delivery Services (previously Keane India Pvt. Ltd.) on an R & D initiative on Big Data using Apache Hadoop Ecosystem. Also, an avid Android enthusiast with experience in Android App Development. gplus.to/Prashanth

RSVP

The venue has limited capacity. Interested in attending? Please login. You can use your existing Twitter or Google account, and if you have previously voted on a session proposal or attended a hacknight, you already have a HasGeek account.

Login with Twitter, Google or HasGeek id