
TF-IDF
TF-IDF (Term Frequency-Inverse Document Frequency) is a metric used in text analysis to evaluate how important a word is in a specific document compared to a collection of documents. It combines two measures: Term Frequency (TF), which counts how often a word appears in a document, and Inverse Document Frequency (IDF), which reduces the weight of common words that appear across many documents. Together, TF-IDF highlights words that are unique and significant within a particular document, helping to identify key topics or keywords for tasks like search or classification efficiently.