hadoop-cluster

Star

Here are 129 public repositories matching this topic...

big-data-europe / docker-hadoop

Star

Apache Hadoop docker image

docker hadoop hadoop-cluster hadoop-docker docker-hadoop

Updated Feb 1, 2024
Shell

groda / big_data

Star

Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.

docker big-data spark apache-spark hadoop bigdata jupyter-notebook pyspark hadoop-cluster mapreduce gutenberg-ebooks hadoop-mapreduce spark-sql mrjob bigtop hadoop-hdfs testdfsio mapreduce-bash apache-sedona

Updated Jul 27, 2025
Jupyter Notebook

Impetus / jumbune

Star

Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,

yarn hadoop apm developer-tools data-analysis hadoop-cluster devops-tools data-quality optimization-framework cluster-monitoring monitoring-tool hadoop-monitor yarn-hadoop-cluster aiops hadoop-monitoring

Updated Jan 1, 2023
Java

Segence / docker-hadoop

Star

A Docker container with a full Hadoop cluster setup with Spark and Zeppelin

docker spark hadoop hadoop-cluster zeppelin-notebook

Updated Feb 2, 2020
Shell

sergevs / ansible-cloudera-hadoop

Star

ansible playbook to deploy cloudera hadoop components to the cluster

kafka impala hbase hadoop-cluster oozie cloudera-hadoop

Updated Sep 8, 2018
Shell

rainmaple / WIFI_BussinessBigDataAnalyseSystem

Star

A System is designed to analyse BigData collect from Wifi probe

spark realtime hbase hadoop-cluster echarts

Updated Dec 31, 2018
JavaScript

hokstack / hok-helm

Star

HokStack - Run Hadoop Stack on Kubernetes

kubernetes automation hadoop bigdata dataops operator hadoop-cluster devops-tools hdp hadoop-hdfs

Updated May 10, 2020
Shell

waltherg / distributable_docker_sql_on_hadoop

Star

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Updated Nov 16, 2017
Shell

hyeonsangjeon / dataplatform

Star

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

hive hadoop hadoop-cluster hadoop-mapreduce hadoop-docker pyspark-notebook zeppelin-notebook hadoop-ecosystem

Updated Nov 7, 2019
Shell

manuparra / MasterDegreeCC_Practice

Star

Taller del Máster Profesional de Informática UGR. Curso de CloudComputing.

docker practice hadoop docker-container virtual-machine cluster hdfs hadoop-cluster opennebula cloudcomputing docker-cluster

Updated May 6, 2019

pfisterer / apache-knox-docker

Star

Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker

dockerfile hadoop rest-api hadoop-cluster hadoop-ecosystem apache-knox gateway-server

Updated Mar 21, 2022
Dockerfile

lyingbo / hadoop-cluster-docker

Star

Run Hadoop Cluster within Docker Containers

hadoop-cluster hadoop-docker hadoop-3-2-0

Updated Jan 19, 2020
Shell

Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows

Star

A storage reference to a comprehensive guide on installing Hadoop on Windows

hadoop-cluster hadoop-mapreduce hadoop-framework

Updated Jun 11, 2018
Shell

MitaliBhiwande / Clustering-Algorithms

Star

Colelction of various clustering algorithms including K means, HAC, DBscan. Also includes Hadoop, MapReduce, implementation of K mean algorithm

hadoop-cluster mapreduce kmeans-clustering hierarchical-clustering density-based-clustering

Updated Mar 4, 2018
Python

jinho-yoo-jack / HadoopCluster

Star

based Docker

hadoop docker-compose hadoop-cluster hadoop-docker

Updated Jul 18, 2023
Shell

aimanamri / raspberry-pi4-hadoop-spark-cluster

Star

This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.

big-data yarn pyspark hdfs distributed-storage hadoop-cluster parallel-processing spark-shell spark-cluster raspberry-pi-4

Updated Jul 13, 2024
Shell

roboxue / YarnVision

Star

UI for Hadoop Resource Manager

hadoop-cluster resource-manager

Updated Mar 1, 2018
Vue

malabz / HAlign-2

Star

a multiple sequence alignment tool

hadoop-cluster multiple-sequence-alignment

Updated Jul 15, 2021
HTML

tugrulhkarabulut / hadoop-movie-rating-prediction

Star

Movie rating prediction application

flask machine-learning natural-language-processing hadoop hadoop-cluster hadoop-mapreduce mrjob

Updated Jun 30, 2021
CSS

AnalyticsApps / LogAnalyzer

Star

Analyses the customer logs for bigdata components like HDFS, Hive, HBase, Yarn, MapReduce, Storm, Spark, Spark 2, Knox, Ambari Metrics, Nifi, Accumulo, Kafka, Flume, Oozie, Falcon, Atlas & Zookeeper.

docker hadoop-cluster ambari loganalyzer

Updated Jun 18, 2018
Shell

Improve this page

Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hadoop-cluster

Here are 129 public repositories matching this topic...

big-data-europe / docker-hadoop

groda / big_data

Impetus / jumbune

Segence / docker-hadoop

sergevs / ansible-cloudera-hadoop

rainmaple / WIFI_BussinessBigDataAnalyseSystem

hokstack / hok-helm

waltherg / distributable_docker_sql_on_hadoop

hyeonsangjeon / dataplatform

manuparra / MasterDegreeCC_Practice

pfisterer / apache-knox-docker

lyingbo / hadoop-cluster-docker

Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows

MitaliBhiwande / Clustering-Algorithms

jinho-yoo-jack / HadoopCluster

aimanamri / raspberry-pi4-hadoop-spark-cluster

roboxue / YarnVision

malabz / HAlign-2

tugrulhkarabulut / hadoop-movie-rating-prediction

AnalyticsApps / LogAnalyzer

Improve this page

Add this topic to your repo