Machine Learning full-time. Contributes to Julia, Python and Go’s libraries as volunteer work, along with moderating the Devops and DataScience sites of StackOverflow
of Regression Analysis • How can one leverage the age-old practice of regression analysis in the Deep Learning world • Doing data-centric data science, and not model-centric.
data. Models become obsolete everyday • Advanced models require expensive resources • Model might become obsolete, but the data-driven domain knowledge doesn’t
relationships among variables. • It helps one understand how the typical value of the dependant variable changes when any of the independent variables is varied, while the other independent variables are fixed What exactly is Regression Analysis?
Lesser the value, greater the statistical significance of the variable. Probability that the variable is not relevant in explaining the independent variable • T-value: The higher the t-value, the higher the confidence in rejecting the null hypothesis. It is the difference between population’s mean and the null mean • F-value: The higher the test value. The farthest the distribution is from the null hypothesis distribution. It is more like a scaled out version of a T-value because it can take in more than 1 variable at a time for testing EXPLAINABILITY (GOOD AND UNTOLD)
Polynomial Regression, Spline-based regression analysis • One can also model nonlinear relationships with these advanced algorithms HIGH FLEXIBILITY AND VARIETY
• IT IS A CRITICAL MISSING STEP IN MODERN DAY DATA SCIENCE PIPELINE • MAKING DATA-CENTRIC DATA SCIENCE GREAT AGAIN • BECOMING BETTER DATA SCIENTISTS/ML ENGINEERS TAKEAWAYS