Big Data

MapReduce Algorithm

In this tutorial, we will focus on MapReduce Algorithm, its working, example, Word Count Problem, Implementation of wordcount problem in…

10 months ago

Airflow Operators

Airflow operators are core components of any workflow defined in airflow. The operator represents a single task that runs independently…

2 years ago

Big Data Interview Questions and Answers

Top 20 frequently asked Big Data interview questions and answers for freshers and experienced Data Engineers, ETL engineers, Data Scientists,…

2 years ago

Apache Airflow: A Workflow Management Platform

Apache Airflow is a workflow management platform that schedules and monitors the data pipelines. We can also describe airflow as…

3 years ago

Apache Sqoop

In this tutorial, we will focus on the data ingestion tool Apache Sqoop for processing big data. Most of the…

3 years ago

Apache Hive Hands-0n

In this tutorial, we will focus on Hadoop Hive for processing big data. What is Hive? Hive is a component in…

4 years ago

Apache Pig Hands-On

In this tutorial, we will focus on scripting language Apache PIG for processing big data.  Apache Pig is a scripting…

4 years ago

Introduction to Apache Spark

In this tutorial, we will focus on Spark, Spark Framework, its Architecture, working, Resilient Distributed Datasets, RDD operations, Spark programming…

4 years ago

Hadoop Distributed File System

Hadoop is a Big Data computing platform for handling large datasets. Hadoop has a core two components: HDFS and MapReduce.…

4 years ago

Understanding BigData: Its Characteristics, Challenges, and Benefits

In this tutorial, we will focus on what is big data, its characteristics, types, benefits, barriers, and job roles. In…

4 years ago