大家好,又见面了,我是你们的朋友全栈君。
安装Hadoop(伪分布式环境)namenode和datanode无法启动解决方案
先附上我参考的安装教程链接
10.1.88.4/index_1.php?url=http://www.msftconnecttest.com/redirect
我在执行./start-all.sh之后发现,没有任何错误提示,输入jps得到如下结果:
[hadoop@localhost sbin]$ ./start-all.sh
This script is Deprecated. Instead use start-dfs.sh and start-yarn.sh
Starting namenodes on [localhost]
localhost: starting namenode, logging to /usr/software/hadoop_install/hadoop/logs/hadoop-hadoop-namenode-localhost.localdomain.out
localhost: starting datanode, logging to /usr/software/hadoop_install/hadoop/logs/hadoop-hadoop-datanode-localhost.localdomain.out
Starting secondary namenodes [0.0.0.0]
0.0.0.0: starting secondarynamenode, logging to /usr/software/hadoop_install/hadoop/logs/hadoop-hadoop-secondarynamenode-localhost.localdomain.out
starting yarn daemons
resourcemanager running as process 21995. Stop it first.
localhost: nodemanager running as process 22133. Stop it first.
[hadoop@localhost sbin]$ jps
22133 NodeManager
23848 Jps
21995 ResourceManager
明显没有datanode和namenode,上网找了很多方法都没用。
按照网上的方法,我就查看文件夹data/tmp/data发现我根本没有这个目录。一脸懵逼。
我只好查看$HADOOP_HOME/log里面的文件,查看有关于datanode和namenode的日志,
我先查看的是datanode的日志,
有点多,直接划到最后,(看我加粗字体)
2019-11-02 17:35:59,401 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: registered UNIX signal handlers for [TERM, HUP, INT]
2019-11-02 17:36:00,195 WARN org.apache.hadoop.hdfs.server.datanode.DataNode: Invalid dfs.datanode.data.dir /usr/software/hadoop_install/hadoop/data/dfs/data :
java.io.FileNotFoundException: File file:/usr/software/hadoop_install/hadoop/data/dfs/data does not exist
at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:635)
at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:861)
at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:625)
at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:442)
at org.apache.hadoop.util.DiskChecker.mkdirsWithExistsAndPermissionCheck(DiskChecker.java:233)
at org.apache.hadoop.util.DiskChecker.checkDirInternal(DiskChecker.java:141)
at org.apache.hadoop.util.DiskChecker.checkDir(DiskChecker.java:116)
at org.apache.hadoop.hdfs.server.datanode.DataNode$DataNodeDiskChecker.checkDir(DataNode.java:2580)
at org.apache.hadoop.hdfs.server.datanode.DataNode.checkStorageLocations(DataNode.java:2622)
at org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.java:2604)
at org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:2497)
at org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.java:2544)
at org.apache.hadoop.hdfs.server.datanode.DataNode.secureMain(DataNode.java:2729)
at org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:2753)
2019-11-02 17:36:00,207 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: Exception in secureMain
java.io.IOException: All directories in dfs.datanode.data.dir are invalid: “/usr/software/hadoop_install/hadoop/data/dfs/data”
at org.apache.hadoop.hdfs.server.datanode.DataNode.checkStorageLocations(DataNode.java:2631)
at org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.java:2604)
at org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:2497)
at org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.java:2544)
at org.apache.hadoop.hdfs.server.datanode.DataNode.secureMain(DataNode.java:2729)
at org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:2753)
2019-11-02 17:36:00,208 INFO org.apache.hadoop.util.ExitUtil: Exiting with status 1
2019-11-02 17:36:00,216 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down DataNode at localhost/127.0.0.1
************************************************************/
[hadoop@localhost logs]$
我顿时恍然大悟,,肯定是权限不够,看不到data,我立马回到hadoop的安转目录下查看文件的权限情况
[hadoop@localhost hadoop]$ ls -l
总用量 128
drwxr-xr-x. 2 hadoop hadoop 194 11月 2 17:50 bin
drwxr-xr-x. 2 root root 6 11月 2 16:58 data
drwxr-xr-x. 3 hadoop hadoop 20 11月 2 16:57 etc
drwxr-xr-x. 2 hadoop hadoop 106 9月 10 2018 include
drwxr-xr-x. 3 hadoop hadoop 20 9月 10 2018 lib
drwxr-xr-x. 2 hadoop hadoop 239 9月 10 2018 libexec
-rw-r–r–. 1 hadoop hadoop 99253 9月 10 2018 LICENSE.txt
drwxrwxr-x. 3 hadoop hadoop 4096 11月 2 17:36 logs
-rw-r–r–. 1 hadoop hadoop 15915 9月 10 2018 NOTICE.txt
-rw-r–r–. 1 hadoop hadoop 1366 9月 10 2018 README.txt
drwxr-xr-x. 2 hadoop hadoop 4096 9月 10 2018 sbin
drwxr-xr-x. 4 hadoop hadoop 31 9月 10 2018 share
drwxr-xr-x. 2 root root 27 11月 2 16:23 test
果然 ,根据红色字体能发现,data的权限所有者是root的,hadoop根本就不能操作,我就想肯定是一开始创建的时候滥用了root用户
到这里就很简单了,两行命令即可:
# 修改文件权限拥有者,hadoop是我的用户名,data是文件夹名字
sudo chown -R hadoop data
# 修改文件权限组
sudo chgrp -R hadoop data
修改过后,查看一下修改结果,可以看到修改成功:
[hadoop@localhost hadoop]$ ls -l
总用量 128
drwxr-xr-x. 2 hadoop hadoop 194 11月 2 17:50 bin
drwxr-xr-x. 2 hadoop hadoop 6 11月 2 16:58 data
drwxr-xr-x. 3 hadoop hadoop 20 11月 2 16:57 etc
drwxr-xr-x. 2 hadoop hadoop 106 9月 10 2018 include
drwxr-xr-x. 3 hadoop hadoop 20 9月 10 2018 lib
drwxr-xr-x. 2 hadoop hadoop 239 9月 10 2018 libexec
-rw-r–r–. 1 hadoop hadoop 99253 9月 10 2018 LICENSE.txt
drwxrwxr-x. 3 hadoop hadoop 4096 11月 2 17:36 logs
-rw-r–r–. 1 hadoop hadoop 15915 9月 10 2018 NOTICE.txt
-rw-r–r–. 1 hadoop hadoop 1366 9月 10 2018 README.txt
drwxr-xr-x. 2 hadoop hadoop 4096 9月 10 2018 sbin
drwxr-xr-x. 4 hadoop hadoop 31 9月 10 2018 share
drwxr-xr-x. 2 root root 27 11月 2 16:23 test
然后再回去停止刚才执行的所有node
[hadoop@localhost sbin]$ ./stop-all.sh
This script is Deprecated. Instead use stop-dfs.sh and stop-yarn.sh
Stopping namenodes on [localhost]
localhost: no namenode to stop
localhost: no datanode to stop
Stopping secondary namenodes [0.0.0.0]
0.0.0.0: no secondarynamenode to stop
stopping yarn daemons
stopping resourcemanager
localhost: stopping nodemanager
localhost: nodemanager did not stop gracefully after 5 seconds: killing with kill -9
no proxyserver to stop
最后就是启动所有node
[hadoop@localhost sbin]$ ./start-all.sh
This script is Deprecated. Instead use start-dfs.sh and start-yarn.sh
Starting namenodes on [localhost]
localhost: starting namenode, logging to /usr/software/hadoop_install/hadoop/logs/hadoop-hadoop-namenode-localhost.localdomain.out
localhost: starting datanode, logging to /usr/software/hadoop_install/hadoop/logs/hadoop-hadoop-datanode-localhost.localdomain.out
Starting secondary namenodes [0.0.0.0]
0.0.0.0: starting secondarynamenode, logging to /usr/software/hadoop_install/hadoop/logs/hadoop-hadoop-secondarynamenode-localhost.localdomain.out
starting yarn daemons
starting resourcemanager, logging to /usr/software/hadoop_install/hadoop/logs/yarn-hadoop-resourcemanager-localhost.localdomain.out
localhost: starting nodemanager, logging to /usr/software/hadoop_install/hadoop/logs/yarn-hadoop-nodemanager-localhost.localdomain.out
输入jps命令查看启动情况:
[hadoop@localhost sbin]$ jps
36534 DataNode
36343 NameNode
37097 NodeManager
36762 SecondaryNameNode
36954 ResourceManager
37422 Jps
可以看到所有的DataNode和NameNode都已经成功启动。
激动万分,终于弄出来了,哈哈大家要是哪里对不上或者是有其他问题,可以留言问我,我最近装这个装了好几遍哈哈。
发布者:全栈程序员-用户IM,转载请注明出处:https://javaforall.cn/129408.html原文链接:https://javaforall.cn
【正版授权,激活自己账号】: Jetbrains全家桶Ide使用,1年售后保障,每天仅需1毛
【官方授权 正版激活】: 官方授权 正版激活 支持Jetbrains家族下所有IDE 使用个人JB账号...