Spark RnD


The following stack of tools need to be configured once you downloaded and setup java 8 and Scala on your laptop. Remember to have a laptop with a higher RAM above 8 GB to set up this Big Data cluster on your machine.
  1. Configure Hadoop
  2. Configure Hbase
  3. Configure Kafka with Zookeeper
  4. Download Spark version which prebuilt with your hadoop version
  5. extract it and set %SPARK_HOME%\bin to Path variable
  6. create spark-env.cmd file in %SPARK_HOME%\conf folder
  7. Take the classpaths of your hadoop system by executing command on your windows command prompt [%HADOOP_HOME%\bin\hadoop classpath]
  8. copy the output and add that path to [set SPARK_DIST_CLASSPATH=] section of spark-env.cmd file
  9. Once you set with these all you are ready to execute spark-shell command


Comments

Popular posts from this blog

Let's try to build scrum masters/project managers/software architects/even a company with training AI models

TCP Ports list

Climate Change News from all over the world through API