Management team > Responsible for developing LINE Score This session will benefit those who are interested in data analytics, especially credit score modeling Hello Developers and Data Scientists!
predictive model usually used in non-financial companies to achieve a specific purpose. > Mostly focusing on efficiency or effectiveness > Indirectly affects businesses or companies Credit Scoring Model The credit scoring model usually used in financial institutions or credit bureaus to evaluate a customer’s creditworthiness. > Mostly focusing on default > Influenced by various regulations > Directly affects the profit or losses of financial institutions
credit scoring model is a traditional evaluation method that usually uses financial data. > Uses financial data for scoring > Mainly used in countries with advanced financial industries > Only for those who have a history of financial transactions in the past Alternative Credit Scoring Model The alternative credit scoring model is an evaluation method that is in the limelight recently. It usually uses non-financial data. > Uses non-financial data for scoring > Available in all countries > Possible for those who don't have a history of financial transactions in the past
can be utilized in various businesses at the same time Creating the LINE Score Platform Data source Financial Institutions Sharing Economy C2C Platforms Reservation Services Rental Businesses Score platform
for loans Application Scoring System Behavior Scoring System > A credit scoring model for evaluating the behavior of existing loan customers Credit Scoring System > A statistical model that utilizes a customer's past history to predict creditworthiness in the future Key Terminology
(Input) ‘Build’ or ‘Develop’ credit scoring model > A process of developing a statistical model that predicts 'defaults' using 'features' Default (Target) > Mainly uses a loan delinquency history of more than 60 days or 90 days as target information Key Terminology
Credit Scoring Model Past default record (y) Past transaction history (x) Fitting the best algorithm to predict default y = f (x) How to Operate a Credit Scoring Model Funtion Loan applicant's future possibility of default Loan applicant's past transaction history Selected features (x) Probability of default (y)
divided into 5 steps as follows: Establishing the purpose of the model and selecting groups or subjects to be used for model development Feature creation & selection Model fitting Scaling & calibration Purpose of model Target definition
divided into 5 steps as follows: Feature creation & selection Model fitting Scaling & calibration Purpose of model Target definition Selecting the target value that needs to be predicted
divided into 5 steps as follows: Feature creation & selection Model fitting Scaling & calibration Purpose of model Target definition Creating and selecting features that are highly predictable and highly correlated with the target
divided into 5 steps as follows: Feature creation & selection Model fitting Scaling & calibration Purpose of model Target definition Exploring the highly predictable and optimal algorithms suitable for the purpose of model development
divided into 5 steps as follows: Feature creation & selection Model fitting Scaling & calibration Purpose of model Target definition Converting to a score or grade form that is easy for customers to understand
Definition > Behavior Scoring Model > Collection Scoring Model > Application Scoring Model > Fraud Detection Model > Early Warning Model → Event of default → Event of default → Possibility of collection → Event of fraud → Event of issue related to credit risk
Type of mobile phone > Number of received or sent calls in a day/week/month/year > The ratio between sending and receiving calls/messages > Main time zone
has also been used to increase predictability Model Fitting Logistic Regression A model usually can be derived in the form of a highly-explainable scorecard > Higher Interpretability > Simple > Relatively small amounts of data > Relatively low performance Machine Learning Algorithm A model usually can be derived in the form of a highly-complex and hard-to- interpret black box > Lower interpretability > Complicated > Requires a large amount of data > Relatively high performance
and discriminability How to Measure Model Performance Stability Indicator The difference in the distribution between the development data set and validation data set, which is used to verify the stability of the features, score, and grade > PSI Discriminative Power A numerical measure of how well rank ordering was done on predicting the target, which is used to show the discriminant power of the features, score, and grade > AUROC (AR) > K-S
100 customers, and 5 customers in default are of the 5th, 10th, 20th, 25th, and 30th lowest scores Example of AUROC Calculation False Positive Rate True Positive Rate Random Model A
100 customers, and 5 customers in default are of the 5th, 10th, 20th, 25th, and 30th lowest scores Example of AUROC Calculation False Positive Rate True Positive Rate Developed Model Random Model A Perfect Model 5 10 20 25 30
100 customers, and 5 customers in default are of the 5th, 10th, 20th, 25th, and 30th lowest scores Example of AUROC Calculation False Positive Rate True Positive Rate Developed Model Area Under the ROC = A+B Random Model A Perfect Model 5 10 20 25 30 B