Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Get AI Ready - Prepare for AI Cloud

Get AI Ready - Prepare for AI Cloud

Slide deck from the BrightGen webinar to help organisations start thinking about what they need to do to get ready for AI Cloud, or generative AI in general.

Keir Bowden

July 19, 2023
Tweet

More Decks by Keir Bowden

Other Decks in Technology

Transcript

  1. Welcome Your host • Keir Bowden, CTO, CTA, MVP •

    @bob_buzzard • www.linkedin.com/in/keirbowden What we'll cover • Right Now • Responsible AI • Technology • People • Now, what?! 19/07/2023 2
  2. Popularity Time to one million users: • Netflix – 3.5

    years • Twitter – 2 years • Facebook – 10 months • Spotify – 5 months • Instagram – 2.5 months • Chat GPT – 5 days 19/07/2023 5 https://www.statista.com/chart/29174/time-to-one-million-users/ Threads – 1 hour
  3. Consumers Love AI • 73% of consumers trust content produced

    by Generative AI • 51% are aware of the latest AI trends • 49% are not concerned about the potential for fake news • 34% are worried about use in phishing attacks • 33% worried about copyright issues • 27% worried it will be used to copy competitors designs 19/07/2023 6 Capgemini Research Institute - Why consumers love generative AI : https://www.capgemini.com/insights/research-library/creative-and-generative-ai/
  4. Leadership Senior IT Leaders: • 67% prioritising generative AI withing

    the next 18 months • 33% consider it a top priority • 33% believe it is over-hyped • 80% who believe it is over-hyped still believe it will help • 71% believe it will introduce new security risks • 66% believe their employees don't have the skills • 99% believe their business must take measures to be successful 19/07/2023 7 Salesforce – IT Leaders Call Generative AI a ‘Game Changer’ but Seek Progress on Ethics and Trust : https://www.salesforce.com/uk/news/stories/generative-ai-research/
  5. Get Control Put someone / group in charge • If

    nobody is in charge, everybody is in charge • Who is accountable/responsible? Publish guidelines • Data controls • Approval • Acceptable use cases • Training Encourage collaboration 19/07/2023 9
  6. Questions to ask Are you using existing AI? • If

    not, why not? Why Generative AI? Do you have a use case? • Low hanging fruit • Define success • What will you measure 19/07/2023 10
  7. Examine Your Data Quality data beats clever algorithms • Quality

    in = Quality out • Garbage in = Garbage out Identify dirty data • Incomplete • Inaccurate • Redundant • Irrelevant Fix • Modify • Replace • Delete 19/07/2023 11
  8. Examine Your Data Too little data • Underfitting – simplistic

    model, unable to detect nuance • Overfitting – finely tuned to training data, memorizes rather than learns Too much data • Concept drift – outdated view • Curse of dimensionality – fields that aren't relevant skewing the model • Data saturation – all values present Bias • We'll cover this later. 19/07/2023 12
  9. Trust • AI generates confident and plausible answers • Answers

    are guilty until proven innocent o How did it arrive at that answer? o Do citations/references exist, are they relevant • Create a clear explanation • Share this with your stakeholders – leadership/users/customers • Encourage engagement, so interested parties can: o Understand how it works o Understand how it benefits them o Provide input and guidance to improve 19/07/2023 15
  10. Transparency • Identify where AI is used • Be clear

    about how you will use people's data o Allow them to opt out • Explain how your algorithms work o Identify limitations o Call out potential bias 19/07/2023 16 "Humans need to be aware that they are interacting with an AI system, and must be informed of the system’s capabilities and limitations." https://digital-strategy.ec.europa.eu/en/library/ethics-guidelines-trustworthy-ai
  11. Bias / Fairness "the absence of prejudice or preference for

    an individual or group based on their characteristics" https://towardsdatascience.com/understanding-bias-and-fairness-in-ai-systems-6f7fbfe267f3 Define what fair means to your organisation Weed out bias • Bias – certain elements are overweighted or overrepresented Bias leads to • Skewed outcomes • Low accuracy • Systematic prejudice 19/07/2023 17
  12. Historical Bias Already established by previous activity/data Reflects existing systemic

    bias Reflects social inequities Amplified by machine learning 19/07/2023 18 Example - Amazon CV screening project, 2014 • 10 years of historic data • Most employees male • Almost all technical employees male • Men flagged as suitable candidates • Scrapped in 2015
  13. Selection Bias AKA representation or sample bias Not representative of

    the population you intend to serve Sample set not randomised Limited set of volunteers (self-selection) 19/07/2023 19 Example – Nottingham NHS maternity enquiry • Letters sent in English only • Technical legal language • 250 white respondents • 20 Black/Asian respondents
  14. Implicit Bias Reflecting opinions of the trainer Process data to

    confirm pre-existing hypothesis (confirmation bias) Keep tuning the model until desired outcome achieved (experimenter's bias) 19/07/2023 20 Example – training model for positive sentiment • Thumbs up considered positive • True for large parts of the world • Offensive in parts of West Africa and Middle East
  15. Aggregation Bias One size fits all Incorrectly grouping data Categories

    are too broad 19/07/2023 21 Example – aggregated salary versus years of experience • True for IT, Legal, Finance, Consulting – large cohort • False for professional sport – small cohort • Aggregation biased against pro sports
  16. Removing Bias Scrub/exclude data that identifies gender, race, ethnicity, age,

    schools, location Give all potential participants an equal chance to respond Look for outliers that will have an outsized impact Identify use of default values Internal and external review Don't oversanitise • Stops reflecting original values 19/07/2023 22
  17. Hallucination AKA response not supported by training data • According

    to humans! Impact depends on context • Blog on rugby tactics – low • Self-driving car – high • Medical diagnosis - catastrophic Mitigation: • Verify! • Incorporate context into prompts • Restrict the response – simple yes/no 19/07/2023 23 Lawyer Steven Schwartz, lawsuit vs Avianca airlines • Used ChatGPT for research • 6 cases cited • Included LexisNexis/Weslaw references • Did not exist
  18. Safe and Reliable Protect your data/models from unauthorised access/attack •

    Even AI powered! • Secure data, model and inputs At least as reliable as existing systems: • Scalable • Performant Dependable • Consistent output, even under stress/attack • Fail gracefully and predictably 19/07/2023 24
  19. Sustainable GPT-3 Training • 1064 MWh electricity • 700,000 litres

    of fresh water • 4,000 – 200,000 Kg CO2 GPT-3 Query • 10-20 ml water • 0.002 - 0.004 KWh electricity 19/07/2023 25 https://interestingengineering.com/innovation/training-chatgpt-consumes-water https://towardsdatascience.com/chatgpts-electricity-consumption-7873483feac4 https://shrinkthatfootprint.com/carbon-footprint-of-training-gpt-3-and-large-language-models/ "By 2025, without sustainable AI practices, AI will consume more energy than the human workforce, significantly offsetting carbon-zero gains." https://www.gartner.com/en/articles/gartner-top-10-strategic-predictions-for-2023-and-beyond
  20. AI Ethics Board Responsibilities • Define policy for ethical use

    of AI • Audit use of data and AI • Create training program • Handle issues or complaints raised • Document how AI is being used Diverse membership Executive sponsorship 19/07/2023 26
  21. Ownership Legal implications are complex and uncertain Who owns generated

    content? • Prompt writer? • Developer/trainer of model? • AI itself? • Nobody? Read terms and conditions of service 19/07/2023 28
  22. Patent Reverse of ownership – who infringes? • Original author

    • Trainer • Operator • AI, after additional learning Patent trolls powered by AI? • Generate vast amounts of code • Publish under proprietary license • Sue and profit 19/07/2023 29 https://www.jonesday.com/en/insights/2017/12/catch-me-if-you-can-litigating-artificial-intelligence-patents https://www.todaysgeneralcounsel.com/ai-trolls-will-mirror-patent-trolls/
  23. Copyright USA, only if "substantial human involvement" UK, "the person

    by whom the arrangements necessary for the creation of the work are undertaken" Does copyright on input training data give rights on output? • Maybe, if it replicates and competes with the original creator Diligently document your output • Make clear what was generated by humans vs AI Legal implications are complex and uncertain 19/07/2023 30
  24. Primer - GPT Generative • Creates (generates) new data •

    Based on existing data • Identifies patterns, predicts words Pre-trained • Trained on existing, large dataset • Tuned for specific tasks Transformer • Neural net architecture • Good at context, parallelisation 19/07/2023 33
  25. Primer – Machine Learning Learn from past data Solve problems

    without custom built algorithms No need to program the model Machine learns from data • Supervised learning – human gives examples/labels input • Unsupervised learning – finds patterns in data • Reinforcement learning – "earns rewards" for achieving goals Targets specific tasks – predict / classify 19/07/2023 34
  26. Primer – Artificial Intelligence Still machine learning, plus • Simulate

    human intelligence • Perform complex tasks • Learning, reasoning, self-correction • Handles unstructured data GPT • Predicts the next word in a sentence • Based on provided words and training data • Appends to text • Repeat until stopping condition encountered 19/07/2023 35 https://org62.my.salesforce.com/sfc/p/#000000000062/a/3y000000iOrb/li90p54YaGginl_Ajb1a7z5Tg81rJLkaWQQB4Rdy1Bg
  27. Primer - LLM Large Language Model • Neural Network with

    potentially billions of parameters (neurons) 19/07/2023 36 Hidden Layers Multiple=deep Input Layer Output Layer
  28. AI Cloud An assistant for every product • Improve productivity

    • Licence cost • Ethics should be front of mind Human centred • Make request • Review and adjust • Approve response Trust Salesforce! 19/07/2023 38
  29. Einstein GPT Trust Layer Secure Data Retrieval • Access controls,

    encryption Dynamic Grounding • Additional information specific to use case Data Masking • Replace PII in request with similar, inauthentic data. Reverse in response. Toxicity Detection • Filtering harmful/inappropriate content Audit Trail • Log actions 19/07/2023 40
  30. Einstein GPT Trust Layer Trust Salesforce • But not blind

    trust • Your company is still responsible and accountable • Understand how Einstein GPT Trust Layer works in depth General • Verify accuracy of output • Make sure everyone involved is aware they are using AI • All actions taken by humans – AI assisting • Not 'one and done' - models learn, retest regularly 19/07/2023 41
  31. AI Cloud Scenarios and verification • Transcribing a call –

    everyone consents to recording • Chatbot – respecting boundaries, responding with sensitive data, constant vigilance • Generating messaging content – matches values, goals; appropriate tone • Summarising case – excludes irrelevant information (signal vs noise) • Segmenting audience – matches requirements • Creating code – doesn't introduce security vulnerabilities 19/07/2023 42
  32. The Skills Gap • 60% of service agents don't know

    how to get the most out of generative AI • 48% worry they will lose their job if they don't learn • 53% of sales reps don't know how to get the most out of generative AI • 39% worry they will lose their job if they don't learn Currently using Generative AI 19/07/2023 45 https://www.salesforce.com/uk/news/stories/sales-service-research-generative-ai SALES 35% MARKETING 51% SERVICE 24%
  33. Train Staff – New Skills • Concepts of machine learning

    / AI • How specific tool works • What tasks it is safe to use for • Risks associated with use • How to evaluate output • Where to go with concerns • Never use real data to learn 19/07/2023 46
  34. Train Staff – Existing Skills Human Centered • Know how

    to talk to customers • Understand your products • Handle customers that AI can't • Recognise accurate output Keep training juniors, or there will be no seniors 19/07/2023 47
  35. Staff are Worried AI will take all jobs AI will

    take my job Increased efficiency leads to job cuts Is it worth learning skills • Writing code • Building flows • Creating campaigns 19/07/2023 49
  36. Staff are Worried AI is an assistant, not a superior

    • Creates early drafts, not final versions • Can't do the creative work, just the mundane, repetitive tasks • Staff can work at a higher level • Humans need to understand, correct and approve content Not less staff, the same staff doing more Just because AI can replace humans, doesn't mean it will 19/07/2023 50
  37. Start Learning Be an AI-Minded Admin : sforce.co/44LweUm Einstein GPT

    Quest : bit.ly/3NSN7Wt Building Ethical and Inclusive Products : bit.ly/46WqIQI Data Ethics and Trusted Personalization : bit.ly/3pWXfp6 Webinar: Unlock ROI with Generative AI and GPT : bit.ly/43F3sUM DeepLearning.ai Short Courses : https://bit.ly/46UB3N5 19/07/2023 53
  38. Start Exploring OpenAI GPT Chat BOT : chat.openai.com (Free account

    required) OpenAI Playground : platform.openai.com/playground (API credits/paid plan required) Anthropic Claude : claude.ai (Free account required) Hugging Face : huggingface.co/ AI community/machine learning platform. Lots of apps to try. 19/07/2023 54
  39. Full Disclosure The following images used in this presentation were

    generated by Stable Diffusion 2.1 19/07/2023 55