Thursday, November 19, 2015

Working with RHadoop

working with RHadoop

working with RHadoop

Error

hdfs.ls("/tamil")

Error in .jcall("java/lang/Class",
"Ljava/lang/Class;", "forName", cl, :
No running JVM detected. Maybe .jinit() would help. Error in .jfindClass(as.character(class)) : No running JVM detected. Maybe .jinit() would help.

Solution


hdfs.init()

hdfs.ls("/")

Error

hdfs.init()
sh: 1: /media/bdalab/bdalab/sw/hadoop-2.7.1/bin: Permission denied
Error in .jnew("org/apache/hadoop/conf/Configuration") : java.lang.ClassNotFoundException In addition: Warning message: running command '/media/bdalab/bdalab/sw/hadoop-2.7.1/bin classpath' had status 126
Solution


Sys.setenv(HADOOP_CMD='/hadoop-2.7.1/bin/hadoop')
Sys.setenv(JAVA_HOME='/jdk1.8.0_60/')
to know the environment varibale
Sys.getenv("HADOOP_CMD")
hdfs.init()


Next :

1 comment:

Vale Co Xenia said...

RHadoop is a bridge between the statistical power of R and the distributed processing capabilities of Hadoop. It allows users to leverage the strengths of both platforms for complex data analysis and machine learning tasks on large datasets.

Big Data Projects For Final Year


Key Components
R: The statistical programming language for data manipulation, analysis, and visualization.
Hadoop: The distributed computing framework for storing and processing large datasets.
RHadoop: The integration layer that enables R to interact with Hadoop.
Core Functionalities
Data Import/Export: Reading and writing data to/from HDFS using R.
MapReduce: Writing custom MapReduce jobs in R for data processing.
Integration with Hadoop Ecosystem: Interacting with other Hadoop components like Hive, Pig, and HBase.

Deep Learning Projects for Final Year Students


Distributed Computing: Executing R code across multiple nodes in a Hadoop cluster.
Scalability: Handling large datasets and complex computations efficiently.

Cloud Computing Projects Final Year Projects