This flashcard is just one of a free flashcard set. See all flashcards!
53
Inverse Document Frequency
• Extract vocabulary from Wiki text (including stemming and
stop list application)
• Compute TF-IDF from wiki pages („documents“).
• We get ranked lists for every document.
• Do the first ranked terms ‘characterize’ the document?
• Are the first ranked terms important for all documents?
stop list application)
• Compute TF-IDF from wiki pages („documents“).
• We get ranked lists for every document.
• Do the first ranked terms ‘characterize’ the document?
• Are the first ranked terms important for all documents?
Flashcard info:
Author: CoboCards-User
Main topic: PTT
Topic: PTT
School / Univ.: Uni Koblenz
City: Koblenz
Published: 08.07.2016