Course description:
This course introduces the basic techniques for text-based information systems: efficient text indexing, Boolean and vector space retrieval models, evaluation and interface issues; Web search including crawling, link-based algorithms, and Web metadata; text/Web clustering, classification, text mining. This course has a lab where the student will apply the studied techniques and do a project using appropriate programming language.