OS@IBM - The Machine in Sheep’s Clothing: Trust and Transparency in the ML Lifecycle

Open Source @ IBM Blueprint Talk The Machine in Sheep’s
Clothing Trust and Transparency in the ML Lifecycle May 8th, 2019

Open Source @ IBM May 8, 2019 / © 2019
IBM Corporation 2 JEFFREY BOREK WW Program Director, Open Technology & IP Mgt. IBM Cognitive Applications [email protected] @jeffborek

The Machine in Sheep’s Clothing Trust and Transparency in the
ML Lifecycle Maureen McElaney Developer Advocate IBM Center for Open Source Data and AI Technologies [email protected] @Mo_Mack May 8, 2019 / © 2019 IBM Corporation

The Machine in Sheep’s Clothing Building Trust and Transparency into
the ML Lifecycle 4 May 8, 2019 / © 2019 IBM Corporation

“A cognitive bias is a systematic pattern of deviation from
norm or rationality in judgment. Individuals create their own "subjective social reality" from their perception of the input.” - Wikipedia 6

Examples of bias in machine learning. 8 May 8, 2019
/ © 2019 IBM Corporation

Google’s Cloud Natural Language API 9 Image Credit: #WOCinTech

October 2017 - Google Natural Language API https://cloud.google.com/natural-language/ 10 Source:
https://motherboard.vice.com/en_us/article/j5jmj8/google-artiﬁcial-intelligence-bias

“We will correct this speciﬁc case, and, more broadly, building
more inclusive algorithms is crucial to bringing the beneﬁts of machine learning to everyone.” 13

NorthPointe’s COMPAS Algorithm 14 Image Credit: #WOCinTech

Source: https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing May 2016 - Northpointe’s COMPAS Algorithm http://www.equivant.com/solutions/inmate- classiﬁcation
15 May 8, 2019 / © 2019 IBM Corporation

May 2016 - Northpointe’s COMPAS Algorithm http://www.equivant.com/solutions/inmate- classiﬁcation Source: https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing
May 8, 2019 / © 2019 IBM Corporation

Source: https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing 19 Black Defendant’s Risk Scores May 8, 2019

Source: https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing 20 White Defendant’s Risk Scores May 8, 2019

BLACK VS. WHITE DEFENDANTS ◦ Falsely labeled black defendants as
likely of future crime at twice the rate as white defendants. ◦ White defendants mislabeled as low risk more than black defendants ◦ Pegged Black defendants 77% more likely to be at risk of committing future violent crime 21

Amazon Rekognition 23 Image Credit: #WOCinTech

July 2018 - Amazon Rekognition https://aws.amazon.com/rekognition/ 24 Source: https://www.aclu.org/blog/privacy-technology/surveillance-technologies/amazons-face-recognition-falsely-matched-28

April 2019 - Amazon Rekognition https://aws.amazon.com/rekognition/ 25 Source: https://www.openmic.org/news/2019/4/4/a-win-for-shareholders-amazon

Joy Buolamwini, Algorithmic Justice League Gender Shades Project Released February
2018 26

“If we fail to make ethical and inclusive artiﬁcial intelligence
we risk losing gains made in civil rights and gender equity under the guise of machine neutrality.” 28 - Joy Boulamwini @jovialjoy

Solutions? What can we do to combat bias in AI?

“Coders are the most empowered laborers that have ever existed.”
31 - Anil Dash @anildash

EDUCATION IS KEY 32 Image Credit: #WOCinTech

https://www.nytimes.com/2018/02/12/business/computer-science- ethics-courses.html

Questions posed to students in these courses... Is the technology
fair? How do you make sure that the data is not biased? Should machines be judging humans? 34 May 8, 2019 / © 2019 IBM Corporation

35 https://twitter.com/Neurosarda/status/1084198368526680064

FIX THE PIPELINE? 36 Image Credit: #WOCinTech

“Cognitive bias in machine learning is human bias on steroids.”
37 - Rediet Abebe @red_abebe

January 2019 - New Search Feature on... https://www.pinterest.com Source: https://www.engadget.com/2019/01/24/pinterest-skin-tone-search-diversity/

“By combining the latest in machine learning and inclusive product
development, we're able to directly respond to Pinner feedback and build a more useful product.” 39 - Candice Morgan @Candice_MMorgan

TOOLS TO COMBAT BIAS 40 Image Credit: #WOCinTech

Tool #1: AI Fairness 360 Toolkit Open Source Library 41
May 8, 2019 / © 2019 IBM Corporation

http://aif360.mybluemix.net/

TYPES OF METRICS ◦ Individual vs. Group Fairness, or Both
◦ Group Fairness: Data vs Model ◦ Group Fairness: We’re All Equal vs What You See is What You Get ◦ Group Fairness: Ratios vs Differences 44

Machine Learning Pipeline In- Processing Pre- Processing Post- Processing 47
Modifying the training data. Modifying the learning algorithm. Modifying the predictions (or outcomes.)

http://aif360.mybluemix.net/ Demos

https://github.com/IBM/AIF360 AI Fairness 360 Toolkit Public Repo 50 May 8,
2019 / © 2019 IBM Corporation

http://aif360.mybluemix.net/community AI Fairness 360 Toolkit Slack 51 May 8, 2019

Tool #2: Model Asset Exchange Open Source Pre-Trained Deep Learning
Models 52 May 8, 2019 / © 2019 IBM Corporation

Step 1: Find a model ...that does what you need
...that is free to use ...that is performant enough 53

Step 2: Get the code Is there a good implementation
available? ...that does what you need ...that is free to use ...that is performant enough 54

Step 3: Verify the model ◦ Does it do what
you need? ◦ Is it free to use (license)? ◦ Is it performant enough? ◦ Accuracy? 55

Step 4: Train the model 56

Step 4: Train the model 57

Step 5: Deploy your model ◦ Adjust inference code (or
write from scratch) ◦ Package inference code, model code, and pre-trained weights together ◦ Deploy your package 58

Step 6: Consume your model 59

Model Asset Exchange The Model Asset Exchange (MAX) is a
one stop shop for developers/data scientists to ﬁnd and use free and open source deep learning models ibm.biz/model-exchange 60

◦ Wide variety of domains (text, audio, images, etc) ◦
Multiple deep learning frameworks ◦ Vetted and tested code/IP ◦ Build and deploy a model web service in seconds 61 Model Asset Exchange

ibm.biz/model-exchange 62

http://ibm.biz/model-exchange http://ibm.biz/max-slack Model Asset eXchange (MAX) 63 May 8, 2019

Dedicated Open Source Efforts IBM Center for Open-Source Data and
AI Technologies (CODAIT) 64 May 8, 2019 / © 2019 IBM Corporation

Center for Open Source Data and AI Technologies CODAIT aims
to make AI solutions dramatically easier to create, deploy, and manage in the enterprise. 40 open source developers! 65 CODAIT codait.org May 8, 2019 / © 2019 IBM Corporation

Improving Enterprise AI lifecycle in Open Source

Active IBM Users of Open Source (Certiﬁed to consume and/or
contribute open source in 2018) 69 May 8, 2019 / © 2019 IBM Corporation

http://ibm.biz/codait-projects IBM Center for Open Source Data and AI Technologies
(CODAIT) 70 May 8, 2019 / © 2019 IBM Corporation

UPDATE TO THE GENDER SHADES PROJECT 71 Image Credit: #WOCinTech

72 http://www.aies-conference.com/wp-content/uploads/2019/01/AIES-19_paper_223.pdf

73 https://www.ajlunited.org/ﬁght

74 Photo by rawpixel on Unsplash No matter what it
is our responsibility to build systems that are fair.

75 https://w3.ibm.com/developer/callforcode/

THANKS! Any questions? @Mo_Mack 76

OS@IBM - The Machine in Sheep’s Clothing: Trust...

OS@IBM - The Machine in Sheep’s Clothing: Trust and Transparency in the ML Lifecycle

More Decks by Maureen McElaney

Other Decks in Technology

Featured

Transcript