-
Updated
Jan 22, 2019 - Jupyter Notebook
#
big-data-essentials
Here are 6 public repositories matching this topic...
big-data spark apache-spark hadoop coursera mapreduce distributed-file-system hadoop-mapreduce big-data-essentials coursera-big-data yandex-big-data
Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.
kafka big-data spark hive hadoop architecture bigdata hbase zookeeper spark-streaming hdfs sqoop hadoop-ecosystem architecture-components yarn-hadoop-cluster bigdata-module hbase-cluster big-data-essentials hadooparchitecture
-
Updated
Sep 10, 2019 - Shell
Predict trips fare of New York city taxi using machine learning in Google BigQuery.
bigquery machine-learning big-data linear-regression google-bigquery prediction-model big-data-essentials
-
Updated
Apr 6, 2020
Simplified Hadoop Setup and Configuration Automation
data-science big-data hdfs ec2-instance big-data-analytics apache-hadoop big-data-projects hdfs-cluster big-data-essentials
-
Updated
Sep 2, 2023 - Shell
Basic commands used of Hadoop and it's Ecosystems
hadoop architecture bigdata terminologies basics commands-cheat-sheet big-data-essentials basic-hadoop
-
Updated
Jul 31, 2021 - Shell
Big Data system predicts pandemic risk (COVID-19) via data analysis, ML modeling, and real-time dashboard.
aws scala hive architecture hbase pyspark cloud-computing hdfs hadoop-ecosystem architecture-components bigdata-module big-data-essentials delta-lake lakehouse
-
Updated
Sep 23, 2025 - Python
Improve this page
Add a description, image, and links to the big-data-essentials topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data-essentials topic, visit your repo's landing page and select "manage topics."