Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
hadoop [2020/02/12 13:24] – created andonovj | hadoop [2020/11/11 11:19] (current) – [Overview] andonovj | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | TODO | + | =====Overview===== |
+ | Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. | ||
+ | |||
+ | Hadoop is a whole eco system and deverse a wiki on its own, but here we will address several components: | ||
+ | |||
+ | * HDFS (Hadoop Distributed File system) | ||
+ | * HBase (Hadoop NoSQL Database) | ||
+ | * Yarn (Resource manager) | ||
+ | |||
+ | You can see the whole eco system below: | ||
+ | |||
+ | {{ : | ||
+ | |||
+ | In a nutshell, HDFS on its own is storing the data into datanodes which allow many reads but only once write, where the HBase is suitable for many read-write operation again using the HDFS | ||
+ | |||
+ | |||
+ | |||
+ | =====Management===== | ||
+ | |||
+ | ====Services==== | ||
+ | < | ||
+ | [oracle@edvmr1p0 ~]$ start-dfs.sh | ||
+ | Starting namenodes on [localhost] | ||
+ | localhost: starting namenode, logging to / | ||
+ | localhost: starting datanode, logging to / | ||
+ | Starting secondary namenodes [0.0.0.0] | ||
+ | 0.0.0.0: starting secondarynamenode, | ||
+ | </ | ||
+ | |||
+ | < | ||
+ | [oracle@edvmr1p0 ~]$ start-yarn.sh | ||
+ | starting yarn daemons | ||
+ | starting resourcemanager, | ||
+ | localhost: starting nodemanager, | ||
+ | </ | ||
+ | |||
+ | < | ||
+ | [oracle@edvmr1p0 ~]$ start-hbase.sh | ||
+ | localhost: starting zookeeper, logging to / | ||
+ | |||
+ | starting master, logging to / | ||
+ | Java HotSpot(TM) 64-Bit Server VM warning: ignoring option PermSize=128m; | ||
+ | Java HotSpot(TM) 64-Bit Server VM warning: ignoring option MaxPermSize=128m; | ||
+ | starting regionserver, | ||
+ | </ | ||
+ | |||
+ | < | ||
+ | [oracle@edvmr1p0 ~]$ $JAVA_HOME/ | ||
+ | 10498 Jps | ||
+ | 8932 DataNode | ||
+ | 9910 HQuorumPeer | ||
+ | 8791 NameNode | ||
+ | 9112 SecondaryNameNode | ||
+ | 10158 HRegionServer | ||
+ | 10030 HMaster | ||
+ | 9391 NodeManager | ||
+ | 9279 ResourceManager | ||
+ | [oracle@edvmr1p0 ~]$ | ||
+ | </ | ||
+ | |||
+ | |||
+ | ====HDFS==== | ||
+ | < | ||
+ | [oracle@edvmr1p0 ~]$ hdfs dfs -mkdir /usr | ||
+ | ^C[oracle@edvmr1p0 ~]$ | ||
+ | [oracle@edvmr1p0 ~]$ | ||
+ | [oracle@edvmr1p0 ~]$ hdfs dfs -mkdir /user | ||
+ | [oracle@edvmr1p0 ~]$ hdfs dfs -mkdir / | ||
+ | [oracle@edvmr1p0 ~]$ hdfs dfs -ls / | ||
+ | Found 3 items | ||
+ | drwxr-xr-x | ||
+ | drwxr-xr-x | ||
+ | drwxr-xr-x | ||
+ | [oracle@edvmr1p0 ~]$ hdfs dfs -rmdir /usr | ||
+ | [oracle@edvmr1p0 ~]$ hdfs dfs -ls / | ||
+ | Found 2 items | ||
+ | drwxr-xr-x | ||
+ | drwxr-xr-x | ||
+ | [oracle@edvmr1p0 ~]$ | ||
+ | </ | ||
+ | |||
+ |