Difference between revisions of "Orange: PCA"

From OnnoWiki
Jump to navigation Jump to search
Line 15: Line 15:
 
Principal Component Analysis (PCA) computes the PCA linear transformation of the input data. It outputs either a transformed dataset with weights of individual instances or weights of principal components.
 
Principal Component Analysis (PCA) computes the PCA linear transformation of the input data. It outputs either a transformed dataset with weights of individual instances or weights of principal components.
  
 +
[[File:PCA-stamped.png|center|200px|thumb]]
  
../../_images/PCA-stamped.png
 
  
 
     Select how many principal components you wish in your output. It is best to choose as few as possible with variance covered as high as possible. You can also set how much variance you wish to cover with your principal components.
 
     Select how many principal components you wish in your output. It is best to choose as few as possible with variance covered as high as possible. You can also set how much variance you wish to cover with your principal components.
Line 31: Line 31:
 
PCA can be used to simplify visualizations of large datasets. Below, we used the Iris dataset to show how we can improve the visualization of the dataset with PCA. The transformed data in the Scatter Plot show a much clearer distinction between classes than the default settings.
 
PCA can be used to simplify visualizations of large datasets. Below, we used the Iris dataset to show how we can improve the visualization of the dataset with PCA. The transformed data in the Scatter Plot show a much clearer distinction between classes than the default settings.
  
../../_images/PCAExample.png
+
[[File:PCAExample.png|center|200px|thumb]]
  
 
The widget provides two outputs: transformed data and principal components. Transformed data are weights for individual instances in the new coordinate system, while components are the system descriptors (weights for principal components). When fed into the Data Table, we can see both outputs in numerical form. We used two data tables in order to provide a more clean visualization of the workflow, but you can also choose to edit the links in such a way that you display the data in just one data table. You only need to create two links and connect the Transformed data and Components inputs to the Data output.
 
The widget provides two outputs: transformed data and principal components. Transformed data are weights for individual instances in the new coordinate system, while components are the system descriptors (weights for principal components). When fed into the Data Table, we can see both outputs in numerical form. We used two data tables in order to provide a more clean visualization of the workflow, but you can also choose to edit the links in such a way that you display the data in just one data table. You only need to create two links and connect the Transformed data and Components inputs to the Data output.
  
../../_images/PCAExample2.png
+
[[File:PCAExample2.png|center|200px|thumb]]
 +
 
  
  

Revision as of 07:39, 24 January 2020

Sumber: https://docs.biolab.si//3/visual-programming/widgets/unsupervised/PCA.html


PCA linear transformation of input data.

Inputs

   Data: input dataset

Outputs

   Transformed Data: PCA transformed data
   Components: Eigenvectors.

Principal Component Analysis (PCA) computes the PCA linear transformation of the input data. It outputs either a transformed dataset with weights of individual instances or weights of principal components.

PCA-stamped.png


   Select how many principal components you wish in your output. It is best to choose as few as possible with variance covered as high as possible. You can also set how much variance you wish to cover with your principal components.
   You can normalize data to adjust the values to common scale.
   When Apply Automatically is ticked, the widget will automatically communicate all changes. Alternatively, click Apply.
   Press Save Image if you want to save the created image to your computer.
   Produce a report.
   Principal components graph, where the red (lower) line is the variance covered per component and the green (upper) line is cumulative variance covered by components.

The number of components of the transformation can be selected either in the Components Selection input box or by dragging the vertical cutoff line in the graph.

Contoh

PCA can be used to simplify visualizations of large datasets. Below, we used the Iris dataset to show how we can improve the visualization of the dataset with PCA. The transformed data in the Scatter Plot show a much clearer distinction between classes than the default settings.

PCAExample.png

The widget provides two outputs: transformed data and principal components. Transformed data are weights for individual instances in the new coordinate system, while components are the system descriptors (weights for principal components). When fed into the Data Table, we can see both outputs in numerical form. We used two data tables in order to provide a more clean visualization of the workflow, but you can also choose to edit the links in such a way that you display the data in just one data table. You only need to create two links and connect the Transformed data and Components inputs to the Data output.

PCAExample2.png



Referensi

Pranala Menarik