Apache Hadoop docker image
-
Updated
Feb 1, 2024 - Shell
Apache Hadoop docker image
Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
ansible playbook to deploy cloudera hadoop components to the cluster
A System is designed to analyse BigData collect from Wifi probe
HokStack - Run Hadoop Stack on Kubernetes
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
Taller del Máster Profesional de Informática UGR. Curso de CloudComputing.
Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker
Run Hadoop Cluster within Docker Containers
A storage reference to a comprehensive guide on installing Hadoop on Windows
Colelction of various clustering algorithms including K means, HAC, DBscan. Also includes Hadoop, MapReduce, implementation of K mean algorithm
This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.
a multiple sequence alignment tool
Movie rating prediction application
Analyses the customer logs for bigdata components like HDFS, Hive, HBase, Yarn, MapReduce, Storm, Spark, Spark 2, Knox, Ambari Metrics, Nifi, Accumulo, Kafka, Flume, Oozie, Falcon, Atlas & Zookeeper.
Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."