Difference between revisions of "Orange: Wikipedia"

From OnnoWiki
Jump to navigation Jump to search
 
(5 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
Sumber: https://orange3-text.readthedocs.io/en/latest/widgets/wikipedia-widget.html
 
Sumber: https://orange3-text.readthedocs.io/en/latest/widgets/wikipedia-widget.html
  
 +
Widget Wikipedia dapat mengambil data dari MediaWiki RESTful web service API.
  
Fetching data from MediaWiki RESTful web service API.
+
==Input==
  
Inputs
+
None
  
    None
+
==Output==
  
Outputs
+
Corpus: A collection of documents from the Wikipedia.
  
    Corpus: A collection of documents from the Wikipedia.
+
Widget Wikipedia digunakan untuk mengambil text dari MediaWiki RESTful web service API dan ini sangat berguna terutama pada proses belajar mengajar maupun saat demonstrasi.
  
Wikipedia widget is used to retrieve texts from Wikipedia API and it is useful mostly for teaching and demonstration.
+
[[File:Wikipedia-stamped.png|center|400px|thumb]]
  
[[File:Wikipedia-stamped.png|center|200px|thumb]]
+
* Query parameters:
 
+
** Query word list, where each query is listed in a new line.
    Query parameters:
+
** Language of the query. English is set by default.
        Query word list, where each query is listed in a new line.
+
** Number of articles to retrieve per query (range 1-25). Please note that querying is done recursively and that disambiguations are also retrieved, sometimes resulting in a larger number of queries than set on the slider.
        Language of the query. English is set by default.
+
* Select which features to include as text features.
        Number of articles to retrieve per query (range 1-25). Please note that querying is done recursively and that disambiguations are also retrieved, sometimes resulting in a larger number of queries than set on the slider.
+
* Information on the output.
    Select which features to include as text features.
+
* Produce a report.
    Information on the output.
+
* Run query.
    Produce a report.
 
    Run query.
 
  
 
==Contoh==
 
==Contoh==
  
This is a simple example, where we use Wikipedia and retrieve the articles on ‘Slovenia’ and ‘Germany’. Then we simply apply default preprocessing with Preprocess Text and observe the most frequent words in those articles with Word Cloud.
+
Dalam workflow sederhana berikut, kita menggunakan widget Wikipedia dan menggambil tulisan tentang ‘Slovenia’ dan ‘Germany’. Kemudian, melakukan preprocessing menggunakan widget Preprocess Text dan mengamati kata yang paling sering muncul dalam tulisan-tulisan tersebut menggunakan widget Word Cloud.  
 
 
[[File:Wikipedia-Example.png|center|200px|thumb]]
 
 
 
Wikipedia works just like any other corpus widget (NY Times, Twitter) and can be used accordingly.
 
  
 +
Widget Wikipedia bekerja seperti corpus widget lainnya (widget NY Times, widget Twitter) dan dapat digunakan seperti mereka.
  
  
 +
[[File:Wikipedia-Example.png|center|600px|thumb]]
  
 
==Referensi==
 
==Referensi==

Latest revision as of 08:34, 12 April 2020

Sumber: https://orange3-text.readthedocs.io/en/latest/widgets/wikipedia-widget.html

Widget Wikipedia dapat mengambil data dari MediaWiki RESTful web service API.

Input

None

Output

Corpus: A collection of documents from the Wikipedia.

Widget Wikipedia digunakan untuk mengambil text dari MediaWiki RESTful web service API dan ini sangat berguna terutama pada proses belajar mengajar maupun saat demonstrasi.

Wikipedia-stamped.png
  • Query parameters:
    • Query word list, where each query is listed in a new line.
    • Language of the query. English is set by default.
    • Number of articles to retrieve per query (range 1-25). Please note that querying is done recursively and that disambiguations are also retrieved, sometimes resulting in a larger number of queries than set on the slider.
  • Select which features to include as text features.
  • Information on the output.
  • Produce a report.
  • Run query.

Contoh

Dalam workflow sederhana berikut, kita menggunakan widget Wikipedia dan menggambil tulisan tentang ‘Slovenia’ dan ‘Germany’. Kemudian, melakukan preprocessing menggunakan widget Preprocess Text dan mengamati kata yang paling sering muncul dalam tulisan-tulisan tersebut menggunakan widget Word Cloud.

Widget Wikipedia bekerja seperti corpus widget lainnya (widget NY Times, widget Twitter) dan dapat digunakan seperti mereka.


Wikipedia-Example.png

Referensi

Pranala Menarik