Other Posts

Getting started with Julia language- Part 3, Variables and types

Welcome to the third part in the article series – ‘Getting started with Julia language’. Julia language is growing in popularity among data scientists, many surveys has listed it as one in top 5 programming languages for data scientists. Mostly due to the factors such as the faster execution time and speedier development process facilitated by […]

Read More

How to conduct vector similarity search with Elasticsearch

In this article, we’ll learn how to do vector similarity search using elasticsearch with an example. Before jumping into the tutorial, let’s brush up our knowledge a bit a familiarise the basics of semantic search, vector similarity, similarity search, etc. You’re welcome to skip the intro and jump to topics that interest you from the index […]

Read More

Getting started with Julia language- Part 2, REPL and Packages

This is the second article in the ‘Getting started with Julia language’ series. In the previous article, we discussed the strengths and limitations of Julia. We also covered the environment setup of Julia on Ubuntu 20.04. Now that you have a basic idea about the language and how to set it up, it’s time to dive […]

Read More

Getting started with Julia language – Part 1

In the this article, we’ll discuss how to get started with Julia programming language.  Whenever a new programming language makes an appearance, we have to ask ourselves whether learning it is worth it or not. This is what happened when Julia programming language was first released. However, over the years, Julia proved itself to be […]

Read More

How to set up a custom Hadoop single node cluster in the pseudo-distributed mode

In this article, we’ll explore how to set up a custom Hadoop single node cluster in the pseudo-distributed mode. Apache Hadoop provides a software framework for distributed storage and processing of big data using the MapReduce programming model. The core of Apache Hadoop consists of a storage part, known as Hadoop Distributed File System (HDFS), and […]

Read More

Hive Installation on ubuntu 18.04 | MySQL Metastore

Apache Hive is a data warehouse infrastructure that facilitates querying and managing large data sets which resides in distributed storage system. It is developed on top of Hadoop. Hive has its own SQL-like query language called HiveQL (Hive Query Language). Hive query language is similar to SQL wherein it supports subqueries. With Hive query language, it is possible […]

Read More