Visible to Intel only — GUID: GUID-5AE16FBA-6060-435A-A2D4-54A02501CCCA
Visible to Intel only — GUID: GUID-5AE16FBA-6060-435A-A2D4-54A02501CCCA
Quality Metrics for Multi-class Classification Algorithms
For l classes , given a vector
of class labels computed at the prediction stage of the classification algorithm and a vector
of expected class labels, the problem is to evaluate the classifier by computing the confusion matrix and connected quality metrics: precision, error rate, and so on.
QualityMetricsId for multi-class classification is confusionMatrix.
Details
Further definitions use the following notations:
true positive |
the number of correctly recognized observations for class |
|
true negative |
the number of correctly recognized observations that do not belong to the class |
|
false positive |
the number of observations that were incorrectly assigned to the class |
|
false negative |
the number of observations that were not recognized as belonging to the class |
The library uses the following quality metrics for multi-class classifiers:
Quality Metric |
Definition |
---|---|
Average accuracy |
|
Error rate |
|
Micro precision ( |
|
Micro recall ( |
|
Micro F-score ( |
|
Macro precision ( |
|
Macro recall ( |
|
Macro F-score ( |
For more details of these metrics, including the evaluation focus, refer to [Sokolova09].
The following is the confusion matrix:
Classified as Class |
Classified as Class |
Classified as Class |
|||
---|---|---|---|---|---|
Actual Class |
|||||
Actual Class |
|||||
Actual Class |
The positives and negatives are defined through elements of the confusion matrix as follows:
data:image/s3,"s3://crabby-images/e7c30/e7c3097c4a4f681d8026a3533e3fcda14564cc88" alt=""
data:image/s3,"s3://crabby-images/2d798/2d7989457905e404999f28f4b08beca959b1d694" alt=""
data:image/s3,"s3://crabby-images/77f2a/77f2a2751c181d41ccb1800833339a489e79aee8" alt=""
data:image/s3,"s3://crabby-images/08e1d/08e1d1016671c5fb9549e9e5cb6ed64d162933de" alt=""
Batch Processing
Algorithm Input
The quality metric algorithm for multi-class classifiers accepts the input described below. Pass the Input ID as a parameter to the methods that provide input for your algorithm. For more details, see Algorithms.
Input ID |
Input |
---|---|
predictedLabels |
Pointer to the This input can be an object of any class derived from NumericTable except PackedSymmetricMatrix, PackedTriangularMatrix, and CSRNumericTable. |
groundTruthLabels |
Pointer to the This input can be an object of any class derived from NumericTable except PackedSymmetricMatrix, PackedTriangularMatrix, and CSRNumericTable. |
Algorithm Parameters
The quality metric algorithm has the following parameters:
Parameter |
Default Value |
Description |
---|---|---|
algorithmFPType |
float |
The floating-point type that the algorithm uses for intermediate computations. Can be float or double. |
method |
defaultDense |
Performance-oriented computation method, the only method supported by the algorithm. |
nClasses |
0 |
The number of classes (l). |
useDefaultMetrics |
true |
A flag that defines a need to compute the default metrics provided by the library. |
beta |
1 |
The |
Algorithm Output
The quality metric algorithm calculates the result described below. Pass the Result ID as a parameter to the methods that access the results of your algorithm. For more details, see Algorithms.
Result ID |
Result |
---|---|
confusionMatrix |
Pointer to the
NOTE:
By default, this result is an object of the HomogenNumericTable class, but you can define the result as an object of any class derived from NumericTable except PackedTriangularMatrix, PackedSymmetricMatrix, and CSRNumericTable.
|
multiClassMetrics |
Pointer to the
NOTE:
By default, this result is an object of the HomogenNumericTable class, but you can define the result as an object of any class derived from NumericTable except PackedTriangularMatrix, PackedSymmetricMatrix, and CSRNumericTable.
|