A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)
sql bigdata pandas dataset datapipeline datalake dataprocessing dataengineering capstone-project apachespark datacleaning bigdataproject datamodeling datawherehouse dataschema bigdataprocessing
-
Updated
Dec 25, 2022 - Jupyter Notebook