Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Leveraging Generative AI to help the blind and ...

Dunith Dhanushka
October 09, 2024
20

Leveraging Generative AI to help the blind and visually impaired people

Dunith Dhanushka

October 09, 2024
Tweet

Transcript

  1. © 2024 Redpanda Data About the presenter 2 Dunith Dhanushka

    Senior Developer Advocate, Redpanda Data • Event streaming, real-time analytics, and stream processing enthusiast • Frequent blogger, speaker, and an educator
  2. © 2024 Redpanda Data 6 Generative Models with Vision Capabilities

    • Open AI - Open AI • Llama 3 - Meta • Claude 3 - Anthropic • Vertex AI - Google • Mistral 7B, Pixtral 12B - Mistral
  3. © 2024 Redpanda Data 13 Make it affordable Make it

    cost-efficient and economical in long term
  4. © 2024 Redpanda Data 15 1. User presses the push

    button 2. Camera takes a photo 3. Image is sent to the LLM 4. Image description is generated 5. Image is narrated to the user
  5. © 2024 Redpanda Data Implementation Options 16 Building a Smart

    Device with Raspberry Pi Building a smartphone app
  6. © 2024 Redpanda Data Foundation - Raspberry Pi 5 How

    the device does the processing? 18 Affordable compute for reasonable performance
  7. © 2024 Redpanda Data Python Program 1. Takes the image

    captured by the camera. 2. Sends the base64 encoded image to LLM over the Internet. 3. Receives the response and converts it to audio via a text-to-speech converter. 20
  8. © 2024 Redpanda Data Why a Smartphone? 24 Camera Storage

    Connectivity Audio Power Performance
  9. © 2024 Redpanda Data Raspberry Pi FTW! • Affordable •

    Flexible • Offers reasonable performance 26
  10. © 2024 Redpanda Data Working with Local LLMs Pros •

    Reduced latency • Itʼs free! • Data privacy Cons • Performance Models with large parameters requires more storage and processing power. GPUs are preferred over CPUs) 29
  11. 32

  12. © 2024 Redpanda Data Potential use cases… • Remote patient

    monitoring (ideal for care homes and hospitals) • Monitoring the workers working in low visibility environments E.g Monitoring the vitals of fire fighters and assist them with their vision) • Personal assistant to the elderly • AI-assisted bodycam for rescue workers 33 What else we can do with this?
  13. © 2024 Redpanda Data Wrap up • Blind people need

    a practical and affordable solution to make their lives easier. Especially, increase their confidence in mobility and social inclusion. • Image recognition and classification is an already solved problem. But using Generative AI to describe images makes it more accessible, affordable and scalable. • Redpanda and its ecosystem can take individual devices to the next level by implementing a monitoring and assistive technology for the visually impaired people. 34
  14. © 2023 REDPANDA DATA Redpanda University Free, self-paced online learning

    https://university.redpanda.com • Learn the fundamentals of data streaming and Redpanda • Install Redpanda and use the rpk CLI to configure it • Create producers and consumers in Java, Python and NodeJS • Sign up today for free! 35
  15. © 2024 Redpanda Data Thanks for joining! Letʼs keep in

    touch @redpandadata redpanda-data redpanda-data [email protected]