Unlocking Insights: Constructing a Scorecard with Logistic Regression | by Vassily Morozov

Machine Learning

Unlocking Insights: Constructing a Scorecard with Logistic Regression | by Vassily Morozov | Feb, 2024

hhhhm

2024年2月15日

Unlocking Insights: Constructing a Scorecard with Logistic Regression | by Vassily Morozov | Feb, 2024

[ad_1]

After a bank card? An insurance coverage coverage? Ever puzzled concerning the three-digit quantity that shapes these choices?

Introduction

Scores are utilized by numerous industries to make choices. Monetary establishments and insurance coverage suppliers are utilizing scores to find out whether or not somebody is true for credit score or a coverage. Some nations are even utilizing social scoring to find out a person’s trustworthiness and choose their behaviour.

For instance, earlier than a rating was used to make an computerized resolution, a buyer would go right into a financial institution and communicate to an individual relating to how a lot they need to borrow and why they want a mortgage. The financial institution worker might impose their very own ideas and biases into their decision-making course of. The place is that this individual from? What are they sporting? Even, how do I really feel at this time?

A rating ranges the enjoying subject and permits everybody to be assessed on the identical foundation.

Just lately, I’ve been participating in a number of Kaggle competitions and analyses of featured datasets. The primary playground competitors of 2024 aimed to find out the probability of a buyer leaving a financial institution. It is a frequent process that’s helpful for advertising departments. For this competitors, I assumed I might put apart the tree-based and ensemble modelling strategies usually required to be aggressive in these duties, and return to the fundamentals: a logistic regression.

Right here, I’ll information you thru the event of the logistic regression mannequin, its conversion right into a rating, and its presentation as a scorecard. The purpose of doing that is to indicate how this may reveal insights about your information and its relationship to a binary goal. The benefit of this sort of mannequin is that it’s easier and simpler to clarify, even to non-technical audiences.

My Kaggle pocket book with all my code and maths could be discovered right here. This text will concentrate on the highlights.

What’s a Rating?

The rating we’re describing right here relies on a logistic regression mannequin. The mannequin assigns weights to our enter options and can output a chance that we are able to convert by a calibration step right into a rating. As soon as we now have this, we are able to symbolize it with a scorecard: displaying how a person is scoring based mostly on their accessible information.

Let’s undergo a easy instance.

Mr X walks right into a financial institution in search of mortgage for a brand new enterprise. The financial institution makes use of a easy rating based mostly on earnings and age to find out whether or not the person must be accepted.