Skip to content

Benchmark of a standard Java vs Hadoop MapReduce implementation of WordCount

Notifications You must be signed in to change notification settings

peteratt/WordCountMR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WordCount: Java vs MapReduce

To run this application in a single node:

  1. Install Hadoop, stable version recommended: http://www.apache.org/dyn/closer.cgi/hadoop/common/

  2. Install Ant. In Ubuntu:

$ sudo apt-get install ant

  1. Set the environmental variable HADOOP_HOME pointing to the root Hadoop installation directory, e.g.:

$ export HADOOP_HOME=~/hadoop-0.20.203.0

  1. Edit the /conf/hadoop-env.sh by uncommenting the following line. In Mac OS X it is:

The java implementation to use. Required.

export JAVA_HOME=/Library/Java/Home

And setting your JAVA_HOME correctly.

  1. Insert the plain text files you want to count words in the input/ directory

  2. Run javaexec.sh and hadoopexec.sh and enjoy!

About

Benchmark of a standard Java vs Hadoop MapReduce implementation of WordCount

Resources

Stars

Watchers

Forks

Packages

No packages published