Upgrade to Pro — share decks privately, control downloads, hide ads and more …

INTERFACE by apidays 2023 - Unlocking the Power...

INTERFACE by apidays 2023 - Unlocking the Power of LLM, Valliappan Narayanan, AT&T

INTERFACE by apidays 2023
APIs for a “Smart” economy. Embedding AI to deliver Smart APIs and turn into an exponential organization
June 28 & 29, 2023

Unlocking the Power of LLM: Harnessing the Potential of an API Gateway for Large Language Models
Valliappan Narayanan, Machine Learning Engineer at AT&T

------

Check out our conferences at https://www.apidays.global/

Do you want to sponsor or talk at one of our conferences?
https://apidays.typeform.com/to/ILJeAaV8

Learn more on APIscene, the global media made by the community for the community:
https://www.apiscene.io

Explore the API ecosystem with the API Landscape:
https://apilandscape.apiscene.io/

apidays

July 11, 2023
Tweet

More Decks by apidays

Other Decks in Programming

Transcript

  1. Unlocking the Power of LLM: Harnessing the Potential of an

    API Gateway for Large Language Models - Valli Narayanan, Engineering Lead, AT&T Inc
  2. Why API Gateway & Benefits 1. Security Considerations 2. Performance

    Optimization 3. Cost Optimization 4. Error Handling and Resilience 5. Monitoring 6. Integration with Existing Systems 7. Ethical and Responsible AI
  3. Security Considerations 1. Securing data privacy 2. Authentication and authorization

    mechanisms to prevent unauthorized access. 3. Privacy regulations, intellectual property rights, and compliance frameworks 4. Centralized Access Control
  4. Cost Optimization 1. Request optimization, resource allocation, and usage monitoring

    to ensure efficient resource utilization and cost-effective deployments. 2. Using Multiple Models
  5. Error Handling and Resilience 1. Error handling mechanisms and resilience

    strategies 2. Error codes, retries, circuit breakers, and fault tolerance
  6. Monitoring 1. Usage patterns, performance metrics, and error rates 2.

    track usage, identify bottlenecks, troubleshoot issues, and optimize resource allocation