Secondary NameNode: This should be done on a seperate server and it’s function is to take checkpoints of the namenodes file system. Zookeeper: It is also a good idea to use ZooKeeper to synchronize your configuration For all purposes here Hadoop will be open without having to login, etc.Īdditional Setup/Configurations to Consider: Also this article does not take into consideration any SSL, kerberos, etc. I also use nano for this article for beginners but you can use any editor you prefer (ie: vi). NOTE: Sometimes you may have to use “sudo” in front of the command. I will put all commands used in this tutorial right down to the very basics for those that are new to Ubuntu. The deployment I have done is to have a Name Node and 1-* DataNodes on Ubuntu 16.04 assuming 5 cpu and 13GB RAM. I would like to share what I have learned and applied in the hopes that it will help someone else configure their system. I have been working with Hadoop 2.9.1 for over a year and have learned much on the installation of Hadoop in a multi node cluster environment.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |