How to set up a custom Hadoop single node cluster in the pseudo-distributed mode

In this article, we’ll explore how to set up a custom Hadoop single node cluster in the pseudo-distributed mode. Apache Hadoop provides a software framework for distributed storage and processing of big data using the MapReduce programming model. The core of Apache Hadoop consists of a storage part, known as Hadoop Distributed File System (HDFS), and […]

Read More

Hive Installation on ubuntu 18.04 | MySQL Metastore

Apache Hive is a data warehouse infrastructure that facilitates querying and managing large data sets which resides in distributed storage system. It is developed on top of Hadoop. Hive has its own SQL-like query language called HiveQL (Hive Query Language). Hive query language is similar to SQL wherein it supports subqueries. With Hive query language, it is possible […]

Read More