Difference between revisions of "Orange: Sieve Diagram"

From OnnoWiki
Jump to navigation Jump to search
Line 5: Line 5:
 
Plots a sieve diagram for a pair of attributes.
 
Plots a sieve diagram for a pair of attributes.
  
Inputs
+
==Input==
  
    Data: input dataset
+
Data: input dataset
  
 
A Sieve Diagram is a graphical method for visualizing frequencies in a two-way contingency table and comparing them to expected frequencies under assumption of independence. It was proposed by Riedwyl and Schüpbach in a technical report in 1983 and later called a parquet diagram (Riedwyl and Schüpbach 1994). In this display, the area of each rectangle is proportional to the expected frequency, while the observed frequency is shown by the number of squares in each rectangle. The difference between observed and expected frequency (proportional to the standard Pearson residual) appears as the density of shading, using color to indicate whether the deviation from independence is positive (blue) or negative (red).
 
A Sieve Diagram is a graphical method for visualizing frequencies in a two-way contingency table and comparing them to expected frequencies under assumption of independence. It was proposed by Riedwyl and Schüpbach in a technical report in 1983 and later called a parquet diagram (Riedwyl and Schüpbach 1994). In this display, the area of each rectangle is proportional to the expected frequency, while the observed frequency is shown by the number of squares in each rectangle. The difference between observed and expected frequency (proportional to the standard Pearson residual) appears as the density of shading, using color to indicate whether the deviation from independence is positive (blue) or negative (red).
Line 13: Line 13:
 
[[File:SieveDiagram-stamped.png|center|200px|thumb]]
 
[[File:SieveDiagram-stamped.png|center|200px|thumb]]
  
    Select the attributes you want to display in the sieve plot.
+
* Select the attributes you want to display in the sieve plot.
    Score combinations enables you to fin the best possible combination of attributes.
+
* Score combinations enables you to fin the best possible combination of attributes.
    Save Image saves the created image to your computer in a .svg or .png format.
+
* Save Image saves the created image to your computer in a .svg or .png format.
    Produce a report.
+
* Produce a report.
  
 
The snapshot below shows a sieve diagram for the Titanic dataset and has the attributes sex and survived (the latter is a class attribute in this dataset). The plot shows that the two variables are highly associated, as there are substantial differences between observed and expected frequencies in all of the four quadrants. For example, and as highlighted in the balloon, the chance for surviving the accident was much higher for female passengers than expected (0.06 vs. 0.15).
 
The snapshot below shows a sieve diagram for the Titanic dataset and has the attributes sex and survived (the latter is a class attribute in this dataset). The plot shows that the two variables are highly associated, as there are substantial differences between observed and expected frequencies in all of the four quadrants. For example, and as highlighted in the balloon, the chance for surviving the accident was much higher for female passengers than expected (0.06 vs. 0.15).

Revision as of 08:57, 28 January 2020

Sumber: https://docs.biolab.si//3/visual-programming/widgets/visualize/sievediagram.html


Plots a sieve diagram for a pair of attributes.

Input

Data: input dataset

A Sieve Diagram is a graphical method for visualizing frequencies in a two-way contingency table and comparing them to expected frequencies under assumption of independence. It was proposed by Riedwyl and Schüpbach in a technical report in 1983 and later called a parquet diagram (Riedwyl and Schüpbach 1994). In this display, the area of each rectangle is proportional to the expected frequency, while the observed frequency is shown by the number of squares in each rectangle. The difference between observed and expected frequency (proportional to the standard Pearson residual) appears as the density of shading, using color to indicate whether the deviation from independence is positive (blue) or negative (red).

SieveDiagram-stamped.png
  • Select the attributes you want to display in the sieve plot.
  • Score combinations enables you to fin the best possible combination of attributes.
  • Save Image saves the created image to your computer in a .svg or .png format.
  • Produce a report.

The snapshot below shows a sieve diagram for the Titanic dataset and has the attributes sex and survived (the latter is a class attribute in this dataset). The plot shows that the two variables are highly associated, as there are substantial differences between observed and expected frequencies in all of the four quadrants. For example, and as highlighted in the balloon, the chance for surviving the accident was much higher for female passengers than expected (0.06 vs. 0.15).

SieveDiagram-Titanic.png

Pairs of attributes with interesting associations have a strong shading, such as the diagram shown in the above snapshot. For contrast, a sieve diagram of the least interesting pair (age vs. survival) is shown below.

SieveDiagram-Titanic-age-survived.png

Contoh

Below, we see a simple schema using the Titanic dataset, where we use the Rank widget to select the best attributes (the ones with the highest information gain, gain ratio or Gini index) and feed them into the Sieve Diagram. This displays the sieve plot for the two best attributes, which in our case are sex and status. We see that the survival rate on the Titanic was very high for women of the first class and very low for female crew members.

SieveDiagram-Example2.PNG

The Sieve Diagram also features the Score Combinations option, which makes the ranking of attributes even easier.

SieveDiagram-Example1.PNG


Referensi

Riedwyl, H., and Schüpbach, M. (1994). Parquet diagram to plot contingency tables. In Softstat ‘93: Advances in Statistical Software, F. Faulbaum (Ed.). New York: Gustav Fischer, 293-299.


Referensi

Pranala Menarik