122135199-hadoop.pdf

  • Uploaded by: Lokesh Kumar
  • 0
  • 0
  • December 2019
  • PDF

This document was uploaded by user and they confirmed that they have the permission to share it. If you are author or own the copyright of this book, please report to us by using this DMCA report form. Report DMCA


Overview

Download & View 122135199-hadoop.pdf as PDF for free.

More details

  • Words: 423
  • Pages: 3
Apache Hadoop

Introduction The Motivation For Hadoop Problems with traditional large-scale systems Requirements for a new approach

Hadoop: Basic Concepts An Overview of Hadoop The Hadoop Distributed File System Hands-On Exercise How MapReduce Works Anatomy of a Hadoop Cluster Other Hadoop Ecosystem Components

Writing a MapReduce Program The MapReduce Flow Examining a Sample MapReduce Program Basic MapReduce API Concepts The Driver Code The Mapper The Reducer Hadoop’s Streaming API Using Eclipse for Rapid Development Hands-on exercise The New MapReduce API

Integrating Hadoop Into The Workflow Relational Database Management Systems Storage Systems Importing Data from RDBMSs With Sqoop Hands-on exercise Importing Real-Time Data with Flume Accessing HDFS Using FuseDFS and Hoop

Delving Deeper Into The Hadoop API More about ToolRunner Testing with MRUnit Reducing Intermediate Data With Combiners The configure and close methods for Map/Reduce Setup and Teardown Writing Partitioners for Better Load Balancing Hands-On Exercise Directly Accessing HDFS Using the Distributed Cache

Common MapReduce Algorithms Sorting and Searching Indexing Machine Learning With Mahout Term Frequency – Inverse Document Frequency Word Co-Occurrence Hands-On Exercise

Using Hive and Pig Hive Basics Pig Basics Hands-on exercise

Practical Development Tips and Techniques Debugging MapReduce Code Using LocalJobRunner Mode For Easier Debugging Retrieving Job Information with Counters Logging Splittable File Formats Determining the Optimal Number of Reducers Map-Only MapReduce Jobs Hands-On Exercise

More Advanced MapReduce Programming Custom Writables and WritableComparables Saving Binary Data using SequenceFiles and Avro Files Creating InputFormats and OutputFormats Hands-On Exercise

Joining Data Sets in MapReduce Map-Side Joins The Secondary Sort Reduce-Side Joins

Graph Manipulation in Hadoop Introduction to graph techniques Representing graphs in Hadoop Implementing a sample algorithm: Single Source Shortest Path

Creating Workflows With Oozie The Motivation for Oozie Oozie’s Workflow Definition Format Hands-On Exercise

Partners :

www.facebook.com/ducateducation

NOIDA

GREATER NOIDA

GHAZIABAD

FARIDABAD

A-43 & A-52, Sector-16, Noida - 201301, (U.P.) INDIA Ph. : 0120-4646464 M. : 09871055180

E - 35, SITE - 4, Near Swarna Nagari, Adjacent J.P. . Golf Course, Greater Noida (U. P.) Ph. : 0120-4345190-91-92 to 97 M. :09899909738, 09899913475

1, Anand Industrial Estate, Near ITS College, Mohan Nagar, Ghaziabad (U.P.) Ph.: 0120-4835400...98-99 M : 09810831363 / 9818106660 : 08802288258 - 59-60

SCO-32, 1st Floor, Sec.-16, Faridabad (HARYANA) Ph. : 0129-4150605-09 M : 09811612707

GURGAON

JAIPUR

GWALIOR

1808/2, 2nd floor old DLF, Near Honda Showroom, Sec.-14, Gurgaon (Haryana) Ph. : 0124-4219095-96-97-98 M. : 09873477222-333

38,Jai Jawan Colony 3rd, Near Gaurav Tower,JLN Marg, Jaipur (Rajsthan) Ph. : 0141-2550077, 2550202 M : 08824246937

C-8, Ist floor, Opposite Aditya College, Near Airtel Office, City Centre, Gwalior (M.P.) Ph. : 0751-4078733-44 M: 09754478733

More Documents from "Lokesh Kumar"

Registry Tweaks
October 2019 16
Biometric Atm.pptx
May 2020 9
122135199-hadoop.pdf
December 2019 11
Think Positive
June 2020 11
Herreweghen
May 2020 13