Difference between revisions of "Orange: Pubmed"

From OnnoWiki
Jump to navigation Jump to search
Line 3: Line 3:
 
Fetch data from PubMed journals.
 
Fetch data from PubMed journals.
  
Inputs
+
==Input==
  
    None
+
None
  
Outputs
+
==Output==
  
    Corpus: A collection of documents from the PubMed online service.
+
Corpus: A collection of documents from the PubMed online service.
  
 
PubMed comprises more than 26 million citations for biomedical literature from MEDLINE, life science journals, and online books. The widget allows you to query and retrieve these entries. You can use regular search or construct advanced queries.
 
PubMed comprises more than 26 million citations for biomedical literature from MEDLINE, life science journals, and online books. The widget allows you to query and retrieve these entries. You can use regular search or construct advanced queries.
Line 15: Line 15:
 
[[File:Pubmed-stamped.png|center|200px|thumb]]
 
[[File:Pubmed-stamped.png|center|200px|thumb]]
  
    Enter a valid e-mail to retrieve queries.
+
* Enter a valid e-mail to retrieve queries.
    Regular search:
+
* Regular search:
        Author: queries entries from a specific author. Leave empty to query by all authors.
+
** Author: queries entries from a specific author. Leave empty to query by all authors.
        From: define the time frame of publication.
+
** From: define the time frame of publication.
        Query: enter the query. Advanced search: enables you to construct complex queries. See PubMed’s website to learn how to construct such queries. You can also copy-paste constructed queries from the website.
+
** Query: enter the query. Advanced search: enables you to construct complex queries. See PubMed’s website to learn how to construct such queries. You can also copy-paste constructed queries from the website.
    Find records finds available data from PubMed matching the query. Number of records found will be displayed above the button.
+
* Find records finds available data from PubMed matching the query. Number of records found will be displayed above the button.
    Define the output. All checked features will be on the output of the widget.
+
* Define the output. All checked features will be on the output of the widget.
    Set the number of record you wish to retrieve. Press Retrieve records to get results of your query on the output. Below the button is an information on the number of records on the output.
+
* Set the number of record you wish to retrieve. Press Retrieve records to get results of your query on the output. Below the button is an information on the number of records on the output.
  
 
==Contoh==
 
==Contoh==

Revision as of 10:00, 29 January 2020

Sumber: https://orange3-text.readthedocs.io/en/latest/widgets/pubmed.html

Fetch data from PubMed journals.

Input

None

Output

Corpus: A collection of documents from the PubMed online service.

PubMed comprises more than 26 million citations for biomedical literature from MEDLINE, life science journals, and online books. The widget allows you to query and retrieve these entries. You can use regular search or construct advanced queries.

Pubmed-stamped.png
  • Enter a valid e-mail to retrieve queries.
  • Regular search:
    • Author: queries entries from a specific author. Leave empty to query by all authors.
    • From: define the time frame of publication.
    • Query: enter the query. Advanced search: enables you to construct complex queries. See PubMed’s website to learn how to construct such queries. You can also copy-paste constructed queries from the website.
  • Find records finds available data from PubMed matching the query. Number of records found will be displayed above the button.
  • Define the output. All checked features will be on the output of the widget.
  • Set the number of record you wish to retrieve. Press Retrieve records to get results of your query on the output. Below the button is an information on the number of records on the output.

Contoh

PubMed can be used just like any other data widget. In this example we’ve queried the database for records on orchids. We retrieved 1000 records and kept only ‘abstract’ in our meta features to limit the construction of tokens only to this feature.

Pubmed-Example.png

We used Preprocess Text to remove stopword and words shorter than 3 characters (regexp \b\w{1,2}\b). This will perhaps get rid of some important words denoting chemicals, so we need to be careful with what we filter out. For the sake of quick inspection we only retained longer words, which are displayed by frequency in Word Cloud.



Referensi

Pranala Menarik