This repository demonstrates big data processing, visualization, and machine learning using tools such as Hadoop, Spark, Kafka, and Python.
python big-data apache-spark data-visualization spark-streaming apache-kafka hadoop-mapreduce spark-mllib-library spark-mllib big-data-analytics hiveql hadoop-installation spark-graphx hadoop-hdfs spark-rdd hadoop-hive data-preprocessing-and-cleaning big-data-analytics-techniques data-stratification
-
Updated
Dec 5, 2024 - Jupyter Notebook