Skip to content
February 6, 2023
Latest:
  • Python Generators
  • Python Iterators Examples
  • Big Data Interview Questions and Answers
  • Predicting Employee Churn in Python
  • Explain Machine Learning Model using SHAP
Machine Learning Geek

Machine Learning Geek

Boost Your Machine Learning Knowledge

  • Machine Learning
  • Interview
  • NLP
  • Python
  • Statistics
  • Optimization Techniques
  • Big Data
  • Books

Big Data

Big Data Interview 

Big Data Interview Questions and Answers

November 26, 2022November 26, 2022 Avinash Navlani big data, hadoop, hadoop big data, interview, interview questions

Top 20 frequently asked Big Data interview questions and answers for freshers and experienced Data Engineers, ETL engineers, Data Scientists,

Read more
Big Data Data Engineering 

Apache Airflow: A Workflow Management Platform

June 2, 2022June 1, 2022 Avinash Navlani Apache Airflow, data engineering, python, workflow management

Apache Airflow is a workflow management platform that schedules and monitors the data pipelines. We can also describe airflow as

Read more
Big Data 

Apache Sqoop

July 26, 2021April 30, 2022 Avinash Navlani apache sqoop, big data, hadoop, hadoop big data, sqoop

In this tutorial, we will focus on the data ingestion tool Apache Sqoop for processing big data. Most of the

Read more
Big Data 

Apache Hive Hands-0n

June 18, 2021November 23, 2022 Avinash Navlani 0 Comments big data, big data analytics, hadoop, Hive

In this tutorial, we will focus on Hadoop Hive for processing big data. What is Hive? Hive is a component in

Read more
Big Data 

Apache Pig Hands-On

June 11, 2021May 29, 2021 Avinash Navlani 0 Comments big data, big data analytics, hadoop, Pig

In this tutorial, we will focus on scripting language Apache PIG for processing big data.  Apache Pig is a scripting

Read more
Big Data 

Introduction to Apache Spark

October 27, 2020October 27, 2020 Avinash Navlani 0 Comments big data, big data analytics, hadoop, in-memory computation, spark

In this tutorial, we will focus on Spark, Spark Framework, its Architecture, working, Resilient Distributed Datasets, RDD operations, Spark programming

Read more
Big Data 

MapReduce Algorithm

October 26, 2020February 9, 2021 Avinash Navlani 0 Comments big data, hadoop, mapreduce, pyspark, spark, word count problem

In this tutorial, we will focus on MapReduce Algorithm, its working, example, Word Count Problem, Implementation of wordcount problem in

Read more
Big Data 

Hadoop Distributed File System

October 21, 2020October 21, 2020 Avinash Navlani 0 Comments big data, big data analytics, hadoop, HDFS

Hadoop is a Big Data computing platform for handling large datasets. Hadoop has a core two components: HDFS and MapReduce.

Read more
Big Data 

Understanding BigData: Its Characteristics, Challenges, and Benefits

October 21, 2020October 21, 2020 Avinash Navlani 0 Comments big data, big data analytics

In this tutorial, we will focus on what is big data, its characteristics, types, benefits, barriers, and job roles. In

Read more
Big Data 

Introduction to Hadoop

October 21, 2020October 21, 2020 Avinash Navlani 0 Comments big data, big data analytics, hadoop

In this tutorial, we will focus on what is Hadoop, its features, components, job trends, architecture, ecosystem, applications, and disadvantage.

Read more

Latest Posts

  • Python Generators
  • Python Iterators Examples
  • Big Data Interview Questions and Answers
  • Predicting Employee Churn in Python
  • Explain Machine Learning Model using SHAP
  • Text Clustering: Grouping News Articles in Python
  • Apache Airflow: A Workflow Management Platform
  • Recurrent Neural Networks
  • Git and GitHub for Data Scientists
  • Understanding Convolutional Neural Network (CNN) using Python

About Us


We love Data Science and we are here to provide you Knowledge on Machine Learning, Text Analytics, NLP, Statistics, Python, and Big Data. We focus on simple, elegant, and easy to learn tutorials.

Resources

  • AWS
  • Big Data
  • Business Analytics
  • Data Engineering
  • Deep Learning
  • Essentials Skills
  • Interview
  • Julia
  • Machine Learning
  • Mathematics
  • NLP
  • Optimization Techniques
  • Python
    • pandas
  • Recommender System
  • Statistics
  • Text Analytics

Archives

  • January 2023
  • November 2022
  • June 2022
  • May 2022
  • April 2022
  • March 2022
  • February 2022
  • January 2022
  • December 2021
  • September 2021
  • July 2021
  • June 2021
  • May 2021
  • April 2021
  • March 2021
  • February 2021
  • January 2021
  • December 2020
  • November 2020
  • October 2020
  • September 2020

Data Science Deals

  • DataCamp
  • UpGrad
  • Edureka Data Science
  • Dataquest
Copyright © 2023 Machine Learning Geek. All rights reserved.
Theme: ColorMag by ThemeGrill. Powered by WordPress.