DEV Community

Azad Kshitij
Azad Kshitij

Posted on

Machine Learning: Basic Terms

In these series of articles we will talk about ML algorithms and implement them in python and the main focus of these articles is to understand math and logic behind these algorithms. We will start by understanding basic terms of ML.

1. Classification Algorithms

As the name suggest these are classification algorithms that categorizes the data into different category. It is a Supervised Learning technique.

2. Supervised Learning

Supervised learning is associated with learning with supervision or training. In supervised learning, the algorithms are trained with data which is labeled or classified.

3. Unsupervised learning

Unsupervised learning is associated with learning without supervision or training. In unsupervised learning, the algorithms are trained with data which is neither labeled nor classified. In unsupervised learning, the agent needs to learn from patterns without corresponding output values.

4. Cross-Entropy Loss

Is a metric used to measure how well a classification model in machine learning performs. The loss (or error) is measured as a number between 0 and 1, with 0 being a perfect model.

5. Condition positive (P)

The number of real positive cases in the data

6. Condition negative (N)

The number of real negative cases in the data

7. True positive (TP)

A test result that correctly indicates the presence of a condition or characteristic

8. True negative (TN)

A test result that correctly indicates the absence of a condition or characteristic

9. False positive (FP)

A test result which wrongly indicates that a particular condition or attribute is present

10. False negative (FN)

A test result which wrongly indicates that a particular condition or attribute is absent

11. Confusion Matrix

It provides us the result of our model in matrix form (as shown below). We use this to measure accuracy of the mode.

Actual positive Actual Negative
Predicted Positive True Positive False Negative
Predicted Negative False Negative True Negative

12. Area Under the Curve (AUC)

Is the area encapsulated between the curve and x-axis.

Area Under the Curve

13. Receiver Operating Characteristic (ROC)

The receiver operating characteristic (ROC) curve, which is defined as a plot of test sensitivity as the y coordinate versus its 1-specificity or false positive rate (FPR) as the x coordinate

ROC=hitshits+misses ROC = \frac{hits}{hits + misses}

Receiver Operating Characteristic

This image from Wikipedia shows exact definition of ROC.

If you want me to include any other terms in this article comment them and I will do my best to include them.

References

Top comments (0)