Machine Learning + Symbolic Reasoning: a Quarkus story on Artificial Intelligence

Slide 1

Slide 1 text

Machine Learning + Symbolic Reasoning: A Quarkus story on Artificial Intelligence by Mario Fusco + Nicole Prentzas +

Slide 2

Slide 2 text

Why not Machine Learning alone?

Slide 3

Slide 3 text

What is Machine Learning? ML is the thing suggesting to buy laundry detergent together with your new washing machine because they're statistically related ...

Slide 4

Slide 4 text

ML is the thing suggesting to buy laundry detergent together with your new washing machine because they're statistically related ... … or more often the one thinking that, since you want to buy a washing machine, you decided to start up a whole new laundry. What is Machine Learning?

Slide 5

Slide 5 text

No content

Slide 6

Slide 6 text

It’s all about (fancy) naming

Slide 7

Slide 7 text

Why not Machine Learning + Symbolic Reasoning ? ML to find statistical correlations and discover new patterns SR to filter away the ones not making sense under a business point of view

Slide 8

Slide 8 text

Why not Machine Learning + Symbolic Reasoning ? ML to find statistical correlations and discover new patterns SR to filter away the ones not making sense under a business point of view

Slide 9

Slide 9 text

Why not Machine Learning + Symbolic Reasoning ? ML to find statistical correlations and discover new patterns SR to filter away the ones not making sense under a business point of view Your business domain knowledge lives here!

Slide 10

Slide 10 text

No content

Slide 11

Slide 11 text

No content

Slide 12

Slide 12 text

No content

Slide 13

Slide 13 text

No content

Slide 14

Slide 14 text

No content

Slide 15

Slide 15 text

Would you allow an hallucinated blue Caterpillar to decide about your business?

Slide 16

Slide 16 text

No content

Slide 17

Slide 17 text

General Machine Learning algorithms problems ❖ Accountability ➢ Who is responsible for the decision taken by a ML algorithm? The creator of the ML-model? The service provider? The final user?

Slide 18

Slide 18 text

General Machine Learning algorithms problems ❖ Accountability ➢ Who is responsible for the decision taken by a ML algorithm? The creator of the ML-model? The service provider? The final user? ❖ Transparency ➢ While traditional algorithms can be examined line by line, arriving at an exact understanding of their internal dynamics and the reasons for a certain result, ML algorithms automatically extracts knowledge from the data, forming an idea of the problem based on statistical correlations known only to it. ➢ Trying to understand the reason that drove to a decision simply looking at the ML-model would be like trying to know why someone thinks in a certain way by dissecting their brain.

Slide 19

Slide 19 text

General Machine Learning algorithms problems ❖ Accountability ➢ Who is responsible for the decision taken by a ML algorithm? The creator of the ML-model? The service provider? The final user? ❖ New definition of malfunctioning ➢ Given their statistical nature, ML algorithms can hallucinate or make other spectacular mistakes even when they work very well. ➢ These mistakes cannot be fixed in the same way of a software bug. ❖ Transparency ➢ While traditional algorithms can be examined line by line, arriving at an exact understanding of their internal dynamics and the reasons for a certain result, ML algorithms automatically extracts knowledge from the data, forming an idea of the problem based on statistical correlations known only to it. ➢ Trying to understand the reason that drove to a decision simply looking at the ML-model would be like trying to know why someone thinks in a certain way by dissecting their brain.

Slide 20

Slide 20 text

Why not fixing LLM’s hallucinations using another LLM to validate the outcome of the first?

Slide 21

Slide 21 text

Why not fixing LLM’s hallucinations using another LLM to validate the outcome of the first? … this reminds me of a 20 years old joke …

Slide 22

Slide 22 text

Slide 23

Slide 23 text

Mixing LLM and rule engine for a safer airline chatbot LLM - (Mistral 7B) Implement a chatbot to collect information about the customer and his flight Rule engine - (Drools) Calculate customer’s refund based on precise business rules session data Extract customer data from chat Extract flight data from chat Data gathering complete data incomplete data incomplete State machine

Slide 24

Slide 24 text

DEMO TIME! https://github.com/mariofusco/quarkus-drools-llm

Slide 25

Slide 25 text

Mixing LLM and rule engine for a safer airline chatbot

Slide 26

Slide 26 text

No content

Slide 27

Slide 27 text

Machine Learning is a new paradigm Rule Engine Rules Data Answers A rule based program produces Answers matching Data against Rules

Slide 28

Slide 28 text

Machine Learning is a new paradigm Rule Engine Rules Data Answers A rule based program produces Answers matching Data against Rules Machine Learning Answers Data Rules A machine learning system discovers Rules comparing Data and expected Answers

Slide 29

Slide 29 text

Why not mixing them? Rule Engine Rules Data Answers Machine Learning Answers Data Rules

Slide 30

Slide 30 text

Leveraging the strengths of both entities ML (sub-symbolic AI) + High adaptability and flexibility + Handling complexity, vast amounts of data + Automatic rule extraction - Lack of interpretability - Data dependence Rule-engine (symbolic AI) + Speed and reliability + Effective with well-structured problems + Interpretable - Scalability - Lack of adaptability - Knowledge acquisition Machine Learning Rule-engine Hybrid AI + →

Slide 31

Slide 31 text

Symbolic Reasoning Knowledge base Inference engine Question Answer Symbolic AI (i.e. Drools) Human input

Slide 32

Slide 32 text

Slide 33

Slide 33 text

Hybrid AI = Symbolic Reasoning + Machine Learning Knowledge base Inference engine Question Answer Symbolic AI (i.e. Drools) Knowledge base Inference engine Question Answer Data ML learning process Hybrid AI (Drools + ML) Human input Human input Hybrid AI: more versatile AI system that can efficiently handle complex scenarios, adjust to changing conditions, and improve performance and accuracy.

Slide 34

Slide 34 text

ML learning process overview ● Business rules (KB) from data. ● Step 1 - “Initialize-rules”: ○ Add decision-rules to cover all instances in the input dataset. ● Step 2 - “Improve-rules”: ○ Identify rules that erroneously support some data. ○ Add decision-rules to fix the errors. ● We used a revised version of the OneR algorithm to learn decision-rules from data.

Slide 35

Slide 35 text

The OneR algorithm ● OneR, short for "One Rule", is a simple, yet accurate, classification algorithm that generates one rule for each predictor/feature in a dataset, and then selects the rule with the smallest total error as its "one rule" (Berry M and Linoff, 2000)(*). ● Revised version of OneR: the rule with the highest accuracy (smallest error) and highest coverage is selected. We have introduced a “score” metric as the product of the accuracy and coverage of a rule, to ease this selection. (*) F. Alam and S. Pachauri, “Comparative Study of J48, Naive Bayes and One-R Classification Technique for Credit Card Fraud Detection using WEKA,” Adv. Comput. Sci. Technol. , vol. 10, no. 6, 2017.

Slide 36

Slide 36 text

OneR example ● One frequency table for each predictor against the target variable ● For each condition (i.e. outlook=sunny) the most frequent (majority) class is assigned to the rule: ● The rules with the highest score is selected as the one rule for the predictor

Slide 37

Slide 37 text

ML learning process ● Step 1 - “Initialize-rules”: ○ Decision-rules to cover all instances in the input dataset ● Step 2 - “Improve-rules”: ○ Find rules that erroneously support some data. ○ Find a new rule for this subset of data. ○ Fix existing rule and create a new rule for this subset of data. step 1 step 2

Slide 38

Slide 38 text

Drools KB from data Example ● Dataset: diabetes early stage ● Feature selection: ○ polyuria ○ polydipsia ○ gender where age ~ middle age ● Prediction: positive / negative Source: https://www.kaggle.com/code/therealsampat/early-stage-diabetes-prediction

Slide 39

Slide 39 text

Step 1: initial set of rules rule "rule 0" when $a : DiabetesDatapoint( polydipsia == 'Yes' ) then $a.setPrediction( 'positive' ); update( $a ); end rule "rule 1" when $a : DiabetesDatapoint( polydipsia == 'No' ) then $a.setPrediction( 'negative' ); update( $a ); end KB Assessment: >> Coverage = 100% >> Errors = 28/186 (15%)

Slide 40

Slide 40 text

Step 2: improve initial ruleset rule "rule 0" when $a : DiabetesDatapoint( polydipsia == 'Yes' ) then $a.setPrediction( 'positive' ); update( $a ); end rule "rule 1" when $a : DiabetesDatapoint( polydipsia == 'No' ) then $a.setPrediction( 'negative' ); update( $a ); end KB Assessment: >> Coverage = 100% >> Errors = 17/186 (7%) rule "rule 2" when $a : DiabetesDatapoint( gender == ‘Female’, polydipsia == 'No' ) then $a.setPrediction( 'positive' ); update( $a ); end ... rules “rule N”

Slide 41

Slide 41 text

Conclusions ❖ ML is only a fraction of the whole AI story ➢ The general goal of artificial intelligence is mimicking the human intellect and behaviors ➢ There are different ways to achieve this goal, letting the machine learning it from data is only one of them ❖ Machine Learning doesn’t replace Symbolic Reasoning: they are complementary technologies ➢ ML and SR are different technologies with their own pros and cons ➢ Often both the flexibility and human friendliness of ML and the strictness and transparency of SR are required in the same intelligent application and you can leverage and integrate the strengths of both

Slide 42

Slide 42 text

Our projects ● Quarkus + LLM + Drools - https://github.com/mariofusco/quarkus-drools-llm ● Rules creation from data - https://github.com/nprentza/droolsML ● Neuro-symbolic AI - https://en.wikipedia.org/wiki/Neuro-symbolic_AI ● Approaches in Using Generative AI for Business Automation: The Path to Comprehensive Decision Automation - https://medium.com/@pierrefeillet/approaches-in-using-generative-ai-for-business-automat ion-the-path-to-comprehensive-decision-3dd91c57e38f ● Explainable Machine Learning via Argumentation - https://www.researchgate.net/publication/372688199_Explainable_Machine_Learning_via_ Argumentation References