AU: 4.0
Programme: CSC(CE)

Use models: Document search, document clustering, automatic topic hierarchy generation, document classification. Performance evaluation: Precision versus recall, experiment design. Vector Space Model. Latent Semantic Indexing. Features: Word stemming, case folding, stop words, thesauri, N-grams. Relevance ranking: Cosine, IDF, link-based scoring. Implementation issues: Inverted indexes, dictionaries, parsing, compression.



Comments