Difference between revisions of "Orange: t-SNE"

From OnnoWiki
Jump to navigation Jump to search
Line 3: Line 3:
 
Two-dimensional data projection with t-SNE.
 
Two-dimensional data projection with t-SNE.
  
Inputs
+
==Input==
  
    Data: input dataset
+
Data: input dataset
    Data Subset: subset of instances
+
Data Subset: subset of instances
  
Outputs
+
==Output==
  
    Selected Data: instances selected from the plot
+
Selected Data: instances selected from the plot
    Data: data with an additional column showing whether a point is selected
+
Data: data with an additional column showing whether a point is selected
  
 
The t-SNE widget plots the data with a t-distributed stochastic neighbor embedding method. t-SNE is a dimensionality reduction technique, similar to MDS, where points are mapped to 2-D space by their probability distribution.
 
The t-SNE widget plots the data with a t-distributed stochastic neighbor embedding method. t-SNE is a dimensionality reduction technique, similar to MDS, where points are mapped to 2-D space by their probability distribution.
Line 17: Line 17:
 
[[File:TSNE-stamped.png|center|200px|thumb]]
 
[[File:TSNE-stamped.png|center|200px|thumb]]
  
    Number of iterations for optimization and the measure of perplexity. Press Start to (re-)run the optimization.
+
* Number of iterations for optimization and the measure of perplexity. Press Start to (re-)run the optimization.
    Select the number of PCA components used for projection.
+
* Select the number of PCA components used for projection.
    Set the color of the displayed points (you will get colors for discrete values and grey-scale points for continuous). Set shape, size and label to differentiate between points. Set symbol size and opacity for all data points. Set jittering to randomly disperse data points.
+
* Set the color of the displayed points (you will get colors for discrete values and grey-scale points for continuous). Set shape, size and label to differentiate between points. Set symbol size and opacity for all data points. Set jittering to randomly disperse data points.
    Adjust plot properties:
+
* Adjust plot properties:
        Show legend displays a legend on the right. Click and drag the legend to move it.
+
** Show legend displays a legend on the right. Click and drag the legend to move it.
        Show all data on mouse hover enables information bubbles if the cursor is placed on a dot.
+
** Show all data on mouse hover enables information bubbles if the cursor is placed on a dot.
        Show class density colors the graph by class.
+
** Show class density colors the graph by class.
        Label only selected points allows you to select individual data instances and label them.
+
** Label only selected points allows you to select individual data instances and label them.
    If Send selected automatically is ticked, changes are communicated automatically. Alternatively, press Send Selected.
+
* If Send selected automatically is ticked, changes are communicated automatically. Alternatively, press Send Selected.
    Select, zoom, pan and zoom to fit are the options for exploring the graph. The manual selection of data instances works as an angular/square selection tool. Double click to move the projection. Scroll in or out for zoom.
+
* Select, zoom, pan and zoom to fit are the options for exploring the graph. The manual selection of data instances works as an angular/square selection tool. Double click to move the projection. Scroll in or out for zoom.
    Access help, save image or produce a report.
+
* Access help, save image or produce a report.
  
 
==Contoh==
 
==Contoh==

Revision as of 08:45, 29 January 2020

Sumber: https://docs.biolab.si//3/visual-programming/widgets/unsupervised/tsne.html

Two-dimensional data projection with t-SNE.

Input

Data: input dataset
Data Subset: subset of instances

Output

Selected Data: instances selected from the plot
Data: data with an additional column showing whether a point is selected

The t-SNE widget plots the data with a t-distributed stochastic neighbor embedding method. t-SNE is a dimensionality reduction technique, similar to MDS, where points are mapped to 2-D space by their probability distribution.

TSNE-stamped.png
  • Number of iterations for optimization and the measure of perplexity. Press Start to (re-)run the optimization.
  • Select the number of PCA components used for projection.
  • Set the color of the displayed points (you will get colors for discrete values and grey-scale points for continuous). Set shape, size and label to differentiate between points. Set symbol size and opacity for all data points. Set jittering to randomly disperse data points.
  • Adjust plot properties:
    • Show legend displays a legend on the right. Click and drag the legend to move it.
    • Show all data on mouse hover enables information bubbles if the cursor is placed on a dot.
    • Show class density colors the graph by class.
    • Label only selected points allows you to select individual data instances and label them.
  • If Send selected automatically is ticked, changes are communicated automatically. Alternatively, press Send Selected.
  • Select, zoom, pan and zoom to fit are the options for exploring the graph. The manual selection of data instances works as an angular/square selection tool. Double click to move the projection. Scroll in or out for zoom.
  • Access help, save image or produce a report.

Contoh

We will use Single Cell Datasets widget to load Bone marrow mononuclear cells with AML (sample) data. Then we will pass it through k-Means and select 2 clusters from Silhouette Scores. Ok, it looks like there might be two distinct clusters here.

But can we find subpopulations in these cells? Let us load Bone marrow mononuclear cells with AML (markers) with Single Cell Datasets. Now, pass the marker genes to Data Table and select, for example, natural killer cells from the list (NKG7).

Pass the markers and k-Means results to Score Cells widget and select geneName to match markers with genes. Finally, add t-SNE to visualize the results.

In t-SNE, use Scores attribute to color the points and set their size. We see that killer cells are nicely clustered together and that t-SNE indeed found subpopulations.

TSNE-Example.png



Referensi

Pranala Menarik