Zu dieser Karteikarte gibt es einen kompletten Satz an Karteikarten. Kostenlos!
53
Inverse Document Frequency
• Extract vocabulary from Wiki text (including stemming and
stop list application)
• Compute TF-IDF from wiki pages („documents“).
• We get ranked lists for every document.
• Do the first ranked terms ‘characterize’ the document?
• Are the first ranked terms important for all documents?
stop list application)
• Compute TF-IDF from wiki pages („documents“).
• We get ranked lists for every document.
• Do the first ranked terms ‘characterize’ the document?
• Are the first ranked terms important for all documents?
Karteninfo:
Autor: CoboCards-User
Oberthema: PTT
Thema: PTT
Schule / Uni: Uni Koblenz
Ort: Koblenz
Veröffentlicht: 08.07.2016