Skip to main content

Types of Machine Learning problems

In the previous blog, we had discussed brief about What is Machine Learning? In this blog, we are going to learn about the types of ML. 

ML is broadly classified into four types:
  • Supervised Learning
  • Unsupervised Learning
  • Semi-supervised Learning
  • Reinforcement Learning
1. Supervised Learning
  • Supervised learning is where there are input variables, say X and there are corresponding output variables, say Y. We use a particular algorithm to map a function from input(X) to output(Y).
  • Mathematically, Y=f(X).
  • Majority of the ML models use this type of learning to feed itself and learn.
  • The goal of supervised learning is to approximate the said function so well that whenever we enter any new input, it's output is accurately predicted.
  • Here, we can say that there is a teacher who guides the model if it generates incorrect results and hence, the machine will keep on learning until it performs to desired results.
  • Supervised Learning can be further classified into:
    • Classification: Here, the output variable is a category. Example: diseased or not diseased, cooking or not cooking, etc.
    • Regression: Here, the output variable is a real value. Example: rupee, dollar, etc.
  • Few supervised learning algorithms:
    • Linear Regression (For regression)
    • Random Forest (For regression and classification)
    • Support Vector Machine (For classification) 
Figure 1: Supervised Learning

Figure 2: Classification and Regression in ML


2. Unsupervised Learning
  • Unsupervised Learning is where we have input variable(X) but no output variable(Y).
  • The goal is to model the underlying structure in data in order to learn more about the data.
  • Here, as there is no output pre-determined, there is no teacher and no correct values unlike supervised learning.
  • The algorithm is own it's own to discover the correct outputs and learn own it's own.
  • Unsupervised learning can be further classified into:
    • Clustering: Used when we want to find the group of the date. Example: Group of stocks listed in Nifty50, Group of people who are Corona positive, etc.
    • Association: Used when we wan to find rules that describes a large portion of data. Example: People who buys stock X will tend to buy stock Y, People who purchases X will tend to purchase Y, etc.
  • Few unsupervised Learning algorithms:
    • K-means (For clustering)
    • Apriori Algorithm (For association)
Figure 3: Unsupervised Learning

Figure 4: Clustering and Association ML


3. Semi-supervised Learning
  • In this type, we have a large collection of input data(X) but few collection of output data(Y).
  • This type sits between supervised learning and unsupervised learning.
  • Example: Photo gallery, where few are labelled pictures and rest are unlabeled. 
  • Labeling data is time-consuming, tedious and expensive while unlabeled data are cheap and easy to get.
Figure 5: Semi-supervised Learning

 4. Reinforcement Learning:
  • Here, the program interacts with dynamic environment, in which it plays certain role.
  • Program provides feedback of output if it is correct or not and machine learns through it.
Figure 6: Reinforcement Learning

That's all about the types of ML and it's learning.
Hope it was worth investing time!

For further queries related to the blog, you can ask in the comments.

Comments

  1. Thank you for sharing such a useful article. I had a great time. This article was fantastic to read. Continue to publish more articles on

    Data Engineering Solutions 

    Data Analytics Service Provider

    Data Modernization Services

    Machine Learning Services

    ReplyDelete
  2. I appreciate you taking the time and effort to share your knowledge. This material proved to be really efficient and beneficial to me. Continue to publish more articles on
    Low Code App Development Company
    Mendix Solutions
    Software Testing Services
    Test Automation Services
    Product Engineering Services

    ReplyDelete
  3. I recommend everyone to read this blog, as it contains some of the best ever content you will find on data science. The best part is that the writer has presented the information in an attractive and engaging manner. Every line gives you something new to learn, and this itself tells volumes about the quality of the information presented here.
    data science certification course in hyderabad

    ReplyDelete

Post a Comment

Popular posts from this blog

Statistics in Data Science

Introduction Statistics is one of the popularly regarded disciplines this is particularly centered on records collection, records organization, records analysis, records interpretation and records visualization. Earlier, facts become practiced through statisticians, economists, enterprise proprietors to calculate and constitute applicable records of their field. Nowadays, facts have taken a pivotal position in diverse fields like records technology, system learning, records analyst position, enterprise intelligence analyst position, pc technology position, and plenty more. Statistics is a type of mathematical analysis that uses quantified models and representations to analyze a set of experimental data or real-world research. The fundamental benefit of statistics is that information is provided in an easy-to-understand style. Statistical & Non-Statistical Analysis Statistical analysis is used to better understand a wider population by analyzing data from a sample. Statistical analy

COVID19 Analysis using Power BI Desktop

Analysis of the data before running into predictions is very important. Understand a few rows and a few columns is very nominal task and we can easily examine the data. However, with a little larger data, suppose 10,000 rows with 50 columns, we really need to do analysis of the data so that we can come to know which factors are going to affect our prediction. Data Analysis with Python is a bit tedious task as we have to prepare the data i.e. cleaning, pre-processing and normalization. We use Seaborn and Matplotlib for our data visualization. But before plotting the graphs, we need to know which columns are inter-related. For that, we need a co-relation matrix which we can create using Python. However, PPS matrix is more better than co-relation matrix. Fig1: Co-relation Matrix of Covid19 dataset Fig2: PPS Matrix of Covid19 dataset It is always a tedious task when we code for Data Analysis. So, we have certain tools available in the market for it like Power BI, Tableau, etc. I have done