Hadoop Archives

More articles in Hadoop

Beginner Big data Data Engineering Hadoop

Introduction to Apache Kafka: Fundamentals and Working

Apache Kafka is a distributed, real-time streaming platform for large-scale data processing used by organizations.

Sunil Kumar 30 Dec, 2022
Beginner Data Engineering Hadoop

Hadoop Ecosystem

Hadoop is an open-source Apache framework written in Java that enables distributed processing of large data sets.

ANURAG SINGH CHOUDHARY 09 Oct, 2022
AWS Beginner Big data Cloud Computing Data Engineering

Basic Concept and Backend of AWS Elasticsearch

Amazon Elasticsearch Service is now called Amazon OpenSearch Service. Amazon OpenSearch supports both OpenSearch and Legacy Elasticsearch OSS.

Trupti Dekate 14 Jun, 2023
Beginner Big data Data Engineering Guide Hadoop

A Comprehensive Guide On Apache Sqoop

Apache Sqoop is data ingestion and migration technology for exporting and importing data from external sources.

Rahul 10 Oct, 2022
Beginner Big data Data Engineering Hadoop

Apache Sqoop: Features, Architecture and Operations

Apache SQOOP is a tool designed to aid in the large-scale export and import of data into HDFS from structured data repositories.

Prateek Majumder 04 Aug, 2023
Big data Data Engineering Hadoop Intermediate

Frequent Itemset Mining Using MapReduce on Hadoop

This article tries to solve the Hands-on practical Frequent Itemset Mining using the MapReduce algorithm on Hadoop.

Nitin 14 Sep, 2022
Beginner Big data Cloud Computing Data Engineering Hadoop

Basic Concept Behind Apache Hive and Elasticsearch

Elasticsearch is a RESTful search engine based on Lucene, a high-performance text search library that is in turn based on inverted indexes.

Trupti Dekate 04 Sep, 2022
Big data Data Engineering Hadoop Intermediate

Apache Impala- Features and Architecture

Apache Impala runs several systems in an Apache Hadoop cluster. Unlike traditional storage systems, it is not tied to its storage core.

Trupti Dekate 02 Sep, 2022
Beginner Data Engineering Hadoop

Apache Zookeeper Architecture and Installation

Apache Zookeeper is a data model. Zookeeper Architecture goes through the master node, so all writes are guaranteed to be sequential.

Trupti Dekate 03 Aug, 2022
Beginner Data Engineering Hadoop Interview Prep Interviews

Apache Flume Interview Questions

Apache Flume is a data ingestion mechanism for gathering, aggregating, and transmitting huge amounts of streaming data.

Prashant 08 Aug, 2022

Popular Categories

Generative AI Tools and Techniques

Popular GenAI Models

AI Development Frameworks

Data Science Tools and Techniques

More articles in Hadoop

Introduction to Apache Kafka: Fundamentals and Working

Hadoop Ecosystem

Basic Concept and Backend of AWS Elasticsearch

A Comprehensive Guide On Apache Sqoop

Apache Sqoop: Features, Architecture and Operations

Frequent Itemset Mining Using MapReduce on Hadoop

Basic Concept Behind Apache Hive and Elasticsearch

Apache Impala- Features and Architecture

Apache Zookeeper Architecture and Installation

Apache Flume Interview Questions

Popular in Hadoop

Want to Become a Data Engineer? Here’s a Comprehensive List of Resources to get Started

Top 10 Data Analytics Projects for 2026

Data Engineer Roadmap for 2026

Must Read Books for Beginners on Big Data, Hadoop and Apache Spark

Flagship Programs

Free Courses

Popular Categories

Generative AI Tools and Techniques

Popular GenAI Models

AI Development Frameworks

Data Science Tools and Techniques