Skip to content

Latest commit

 

History

History
20 lines (14 loc) · 587 Bytes

README.md

File metadata and controls

20 lines (14 loc) · 587 Bytes

Gene-Finder

This is a playground exercise to find substrings within a long character sequence

Getting Started

Installing & Running

Going Fwd

  • Writing prefixes-tree to DB instead of running in-memory
  • Splitting tree to partitions by prefixes and storing each separately
  • Using map-reduce to process different portions of the data in a distributed manner
  • Edge-cases: coinciding prefixes