Information-Retrieval Text Search Engine Tokenization Term weighting Building Inverted Index Command Line Retrieval Engine Document Clustering