#

Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Here are 328 public repositories matching this topic...

zhonghuasheng / Tutorial

后端（Java Golang）全栈知识架构体系总结

mysql java go redis mqtt tutorial spark spring mongodb netty tomcat keepalived springboot rocketmq springcloud emsp

Updated Oct 5, 2024
Shell

collabH / bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

kafka spark hive hadoop bigdata kudu hbase olap hdfs mapreduce flink debezium bigdatalearning hudi

Updated Nov 14, 2024
Shell

HariSekhon / Dockerfiles

50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak

Updated Oct 8, 2024
Shell

jorgebucaran / spark.fish

▁▂▄▆▇█▇▆▄▂▁

fish spark fish-plugin

Updated Jan 16, 2021
Shell

miguno / wirbelsturm

[PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.

vagrant puppet kafka spark apache-spark storm apache-storm apache-kafka

Updated Feb 21, 2022
Shell

ruoyu-chen / hadoop-docker

基于Docker构建的Hadoop开发测试环境，包含Hadoop，Hive，HBase，Spark

spark hive centos hadoop-docker docker-hadoop

Updated May 26, 2019
Shell

HariSekhon / Knowledge-Base

Large Tech Knowledge Base from 20 years in DevOps, Linux, Cloud, Big Data, AWS, GCP etc - gradually porting my large private knowledge base to public

Updated Oct 13, 2024
Shell

apache / spark-docker

Official Dockerfile for Apache Spark

python java r scala sql big-data spark jdbc

Updated Dec 22, 2024
Shell

testdrivenio / spark-kubernetes

spark on kubernetes

docker kubernetes spark

Updated Feb 20, 2023
Shell

iamabug / BigDataParty

大数据组件 All-in-One 的 Dockerfile

dockerfile kafka big-data spark hadoop

Updated Nov 19, 2024
Shell

bambrow / docker-hadoop-workbench

A Hadoop cluster based on Docker, including Hive and Spark.

docker big-data spark hive hadoop docker-compose

Updated Nov 13, 2022
Shell

radanalyticsio / openshift-spark

docker spark openshift

Updated Nov 17, 2021
Shell

martinprobson / vagrant-hadoop-hive-spark

Vagrant project to spin up a single node VM running current versions of Hadoop, Hive and Spark

training vagrant spark hive hadoop sandbox

Updated Feb 15, 2022
Shell

doubaokun / dockers

Docker hello world templates

shell docker vagrant kafka spark zookeeper docker-template

Updated Mar 20, 2015
Shell

Segence / docker-hadoop

A Docker container with a full Hadoop cluster setup with Spark and Zeppelin

docker spark hadoop hadoop-cluster zeppelin-notebook

Updated Feb 2, 2020
Shell

treebeardtech / kubeflow-bootstrap

🪐 1-click Kubeflow using ArgoCD

kubernetes machine-learning airflow ai spark jupyter gpu terraform helm jupyterhub jupyterlab dask ray kubeflow mlflow kustomize argocd kserve llms

Updated Aug 8, 2024
Shell

rubenafo / docker-spark-cluster

A Spark cluster setup running on Docker containers

docker scala big-data spark hadoop docker-image openjdk

Updated Dec 26, 2019
Shell

mrugankray / Big-Data-Cluster

The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin. This cluster is solely intended for usage in a development environment. Do not use it to run any production workloads.

airflow kafka spark cassandra hive hadoop schema-registry postgresql python3 pyspark hdfs flume hue zeppelin pgadmin4 kadmin sqoop conda-environment control-center

Updated Feb 27, 2023
Shell

PierreKieffer / docker-spark-yarn-cluster

Docker multi-nodes Hadoop cluster with Spark 2.4.1 on Yarn

docker spark yarn hadoop cluster yarn-hadoop-cluster

Updated Dec 7, 2020
Shell

niqdev / devops

DevOps

docker kubernetes ansible vagrant kafka spark cassandra hadoop zookeeper mapreduce oozie zeppelin

Updated Mar 24, 2023
Shell

Created by Matei Zaharia

Released May 26, 2014

Followers: 427 followers
Repository: apache/spark
Website: spark.apache.org
Wikipedia: Wikipedia

Related Topics