Skip to content

asktushar/Generic-Data-Recon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Complaints Report

Overview

  • This repository contains batch processing code for generating customer complaints report. Code is written in scala 2.12.10 and spark 3.0.1

Author

Author of the code is Tushar Kesarwani tushar.kesarwani2@gmail.com

How to develop locally

Develop locally. Tests are good to create mock data and run tests against it. Developing against Hadoop clusters is possible but generally not necessary and reduces the velocity.

How to run from local

  • Use intelliJ run application to run the program locally
  • Use the below CLI Input in intelliJ run configuration as it is :
ComplaintReport
local[1]
file:///absoluteFilePath/complaints.jsonl
file:///absoluteFilePath/category_names.json
file:///absoluteFilePath/service_names.json
file:///absoluteFilePath/output

How to test

mvn test

How to build

mvn clean compile install

How to run on cluster

Use the below unix script to execute code in cluster

sh deploy/run.sh ComplaintReport local[1] file:///absoluteFilePath/complaints.jsonl file:///absoluteFilePath/category_names.json file:///absoluteFilePath/service_names.json file:///absoluteFilePath/output

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published