Skip to content

Latest commit

 

History

History
41 lines (23 loc) · 2 KB

README.MD

File metadata and controls

41 lines (23 loc) · 2 KB

#Feeder Service Monitors a directory for new files, file thresholds may be set to the number of files, size in bytes, or age of files. Files will be moved to a target directory and posted to a message topic.

One service per directory to be monitored. Each directory will have a single associated dataType and InputFormatClass

##FileScannerProperties (prefix: file) file.inputDir - Directory to monitor for new files

file.frequency - time in MS to scan file.inputDir for files to process

file.ignorePrefix - Files that match this prefix will not be processed

file.maxAge - (default -1) max age in MS before a file should be processed regardless of count or size

file.maxSize - (default -1) max size in bytes an aggregated set of files should be before they should be processed regardless of count or age

file.maxFiles - (default -1) max number of files before files should be processed regardless of age or size

file.recursive - (default false) if true recursively search inputDir for files

file.errorRetryInterval - (default 60000) interval that a file should not be retried if it resulted in a processing error

file.errorRetryTimeUnit - (default MILLISECONDS) default time unit for errorRetryInterval

file.fsConfigResources - List of files to be added to Configuration, applied in order. This should include hadoop core/site

##FeederProperties (prefix: feeder) feeder.targetDir - location to move files once they have been fed

feeder.inputFormatClass - expected class of files the feeder is monitoring in file.inputDir

feeder.dataType - expected dataType of files the feeder is monitoring in file.inputDir

feeder.preservePath - if true, any part of the path beyond the file.inputDir will be preserved when moving the file to feeder.targetDir

##Additional Configuration (Spring Boot) spring.cloud.stream.bindings.feedSource-out-0.destination - target exchange for messages

###Environment: DEST_QUEUE - environment variable, defaults to ingest. May override to direct feeder messages to another queue