Other Posts

How to set up a custom Hadoop single node cluster in the pseudo-distributed mode

In this article, we’ll explore how to set up a custom Hadoop single node cluster in the pseudo-distributed mode. Apache Hadoop provides a software framework for distributed storage and processing of big data using the MapReduce programming model. The core of Apache Hadoop consists of a storage part, known as Hadoop Distributed File System (HDFS), and […]

Read More

Hive Installation on ubuntu 18.04 | MySQL Metastore

Apache Hive is a data warehouse infrastructure that facilitates querying and managing large data sets which resides in distributed storage system. It is developed on top of Hadoop. Hive has its own SQL-like query language called HiveQL (Hive Query Language). Hive query language is similar to SQL wherein it supports subqueries. With Hive query language, it is possible […]

Read More

How to conduct vector similarity semantic search with Elasticsearch

Here, we’ll learn how to do vector similarity search using elasticsearch with an example. Before jumping into the tutorial, let’s brush up on our knowledge of elasticsearch semantic search, vector similarity, similarity search, etc.

Read More