Orange: Misclassifications
Sumber: https://orange.biolab.si/workflows/
Cross-validation dari, misalnya, logistic regression dapat mengekspos instance data yang salah klasifikasi. Ada enam contoh untuk dataset iris dan ridge-regularized logistic regression. Kita dapat memilih berbagai jenis kesalahan klasifikasi dalam Confusion Matrix dan menayangkannya dalam Scatter Plot. Tidak mengherankan: contoh kesalahan klasifikasi berada dekat / wilayah daerah yang class-nya berbatasan terlihat di scatter plot projection.
The image represents an Orange Data Mining workflow designed for evaluating a classification model using logistic regression and visualizing misclassifications.
Workflow Breakdown:
1. File (Data Input)
- This node loads the dataset (likely the Iris dataset) as input for further processing.
2. Learner (Logistic Regression)
- The logistic regression model is used as the classification algorithm.
- The note in green suggests that this model can be replaced with any other classification method.
3. Test & Score
- This node is responsible for evaluating the logistic regression model.
- It measures the model’s performance using various metrics like accuracy, precision, recall, and F1-score.
4. Confusion Matrix
- Displays the types of misclassifications in the dataset.
- The note in red highlights that for the Iris dataset, Iris virginica and Iris versicolor are often confused with each other.
5. Scatter Plot
- Visualizes the data distribution and classification results.
- The note suggests that misclassifications are best seen in petal length vs. petal width projection.
Purpose of the Workflow:
- The workflow trains a logistic regression classifier on the Iris dataset.
- It evaluates performance using a confusion matrix.
- Misclassifications are visualized using a scatter plot.
- Users can swap the classification method to compare different algorithms.
This workflow helps in understanding classification errors and improving model performance by selecting better feature projections or classification methods.
Source
- https://service.biolab.si/download/workflow?name=470-misclassification-scatterplot.ows&domain=orange