Hadoop is an Apache framework used to process a large amount of data with the parallel process. Hadoop is mostly used for Linux flavors systems for production use. As a developer, To explore more things on Hadoop, we need a start on windows.
As Hadoop is developed and executed in Linux flavors, We have many options to set up on windows with either Cygwin or VM player.
I am going to list down the steps required for installation and configuring Hadoop on Windows using Cygwin.
What is Cygwin:- Cygwin is a mock environment for windows based systems to run as Linux based systems. They provided command line interfaces which process this commands and call the windows dll and API’s. So you need of knowing most Unix commands for this.
It is free and open source software.
1. Download from Cygwin from Cygwin site and click on setup.exe. and select type openssh in the search box in Selected Package as below and install required dependencies.
Once Cygwin is installed in your system, make sure that it works.
2. make sure you set JAVA_HOME to point to jdk1.5 or jdk1.6 in your environment variable
3. try to unpack the Hadoop Hadoop rar distribution In the distribution,
type the below command c:\ bin/Hadoop
Now you are ready to start Hadoop node
Another tutorial that you may like. Learn Basics of HDFS in Hadoop Introduction to Hadoop Difference between Distribute File System and Normal File System