Difference between revisions of "Orange: Word Cloud"

From OnnoWiki
Jump to navigation Jump to search
 
(5 intermediate revisions by the same user not shown)
Line 11: Line 11:
  
 
  Corpus: Documents that match the selection.
 
  Corpus: Documents that match the selection.
  Selected Word: Selected word that can be used as query in Concordance.
+
  Selected Word: Selected word that can be used as query in [[Concordance]].
 
  Word Counts: Words and their weights.
 
  Word Counts: Words and their weights.
  
Line 30: Line 30:
 
==Contoh==
 
==Contoh==
  
Word Cloud is an excellent widget for displaying the current state of the corpus and for monitoring the effects of preprocessing.
+
WIdget Word Cloud adalah widget yang sangat baik untuk menampilkan keadaan saat ini dari corpus dan untuk memantau efek dari widget Preprocess Text.
  
Use Corpus to load the data. Connect Preprocess Text to it and set your parameters. We’ve used defaults here, just to see the difference between the default preprocessing in the Word Cloud widget and the Preprocess Text widget.
+
Gunakan widget [[Corpus]] untuk memuat data. Dengan menyambungkan widget [[Preprocess Text]] ke dalamnya dan atur parameter. Kita menggunakan default di sini, hanya untuk melihat perbedaan antara preprocessing default di widget Word Cloud dan widget Preprocess Text.
  
[[File:Word-Cloud-Example.png|center|200px|thumb]]
+
[[File:Word-Cloud-Example.png|center|600px|thumb]]
  
We can see from the two widgets, that Preprocess Text displays only words, while default preprocessing in the Word Cloud tokenizes by word and punctuation.
+
Kita dapat melihat dari dua widget, bahwa widget Preprocess Text hanya menampilkan kata-kata, sementara preprocessing default di Word Cloud tokenizes oleh kata dan tanda baca.
 +
 
 +
==Youtube==
 +
 
 +
* [https://youtu.be/Xbxo6gWTp_8 YOUTUBE: Word Cloud dari Wikipedia]
  
 
==Referensi==
 
==Referensi==

Latest revision as of 19:12, 11 April 2020

Sumber: https://orange3-text.readthedocs.io/en/latest/widgets/wordcloud.html

Membuat word cloud dari corpus.

Input

Topic: Selected topic.
Corpus: A collection of documents.

Output

Corpus: Documents that match the selection.
Selected Word: Selected word that can be used as query in Concordance.
Word Counts: Words and their weights.

Word Cloud menampilkan token di corpus, besar font menunjukkan frekuensi kata dalam corpus atau rata-rata jumlah Kata, ketika fitur bag of words berada di input widget. Kata-kata terdaftar berdasarkan seringnya muncul (weight) dalam widget. Widget mengeluarkan dokumen, berisi token yang dipilih dari word cloud.

Word-Cloud-stamped.png
  • Information on the input.
    • number of words (tokens) in a topic
    • number of documents and tokens in the corpus
  • Adjust the plot.
    • If Color words is ticked, words will be assigned a random color. If unchecked, the words will be black.
    • Word tilt adjust the tilt of words. The current state of tilt is displayed next to the slider (‘no’ is the default).
    • Regenerate word cloud plot the cloud anew.
  • Words & weights displays a sorted list of words (tokens) by their frequency in the corpus or topic. Clicking on a word will select that same word in the cloud and output matching documents. Use Ctrl to select more than one word. Documents matching ANY of the selected words will be on the output (logical OR).
  • Save Image saves the image to your computer in a .svg or .png format.

Contoh

WIdget Word Cloud adalah widget yang sangat baik untuk menampilkan keadaan saat ini dari corpus dan untuk memantau efek dari widget Preprocess Text.

Gunakan widget Corpus untuk memuat data. Dengan menyambungkan widget Preprocess Text ke dalamnya dan atur parameter. Kita menggunakan default di sini, hanya untuk melihat perbedaan antara preprocessing default di widget Word Cloud dan widget Preprocess Text.

Word-Cloud-Example.png

Kita dapat melihat dari dua widget, bahwa widget Preprocess Text hanya menampilkan kata-kata, sementara preprocessing default di Word Cloud tokenizes oleh kata dan tanda baca.

Youtube

Referensi

Pranala Menarik