Skip to content
Change the repository type filter

All

    Repositories list

    • The repository for Scala Spark workshop held by Tenaris Data Science Department in universities
      MIT License
      2400Updated Jan 12, 2021Jan 12, 2021
    • This repo contains a notebook which shows how to use PySpark in Google Colab.
      Jupyter Notebook
      MIT License
      0000Updated Mar 16, 2020Mar 16, 2020
    • A simple PySpark notebook meant to describe Spark SQL to high school students.
      Jupyter Notebook
      MIT License
      0000Updated Mar 18, 2019Mar 18, 2019
    • A set of tools to manage Parquet files
      Java
      MIT License
      2000Updated Jun 9, 2017Jun 9, 2017
    • Apache Airflow (Incubating)
      Python
      Apache License 2.0
      15k000Updated Oct 2, 2016Oct 2, 2016
    • flume

      Public
      Adding includePath option to SpoolDir source
      Java
      Apache License 2.0
      1.6k100Updated Sep 11, 2016Sep 11, 2016
    • hadoop

      Public
      Contributing to Apache Hadoop to implement hadoop dfs -rename command
      Java
      Apache License 2.0
      9k000Updated Aug 6, 2016Aug 6, 2016
    • sqoop

      Public
      Mirror of Apache Sqoop
      Java
      Apache License 2.0
      586000Updated Aug 3, 2016Aug 3, 2016