Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Introduction to NLP : How to improve accessibil...
Search
Victoria Ubaldo
June 11, 2021
Technology
1
180
Introduction to NLP : How to improve accessibility with Machine Learning
WWCode CONNECT REIMAGINE 2021
Victoria Ubaldo
June 11, 2021
Tweet
Share
More Decks by Victoria Ubaldo
See All by Victoria Ubaldo
Analizar Datos con Python y Google Colab
vickyale
1
110
¿Cómo estar al día en Tecnología, como profesional tech?
vickyale
1
390
Primeros pasos en AI/ML
vickyale
0
97
ML Zero to Hero in GCP
vickyale
0
66
Agile & Big Data
vickyale
1
35
Clean Data with Python
vickyale
0
220
AoG Workshop for IWD19
vickyale
0
120
Introducción a Actions for Google Assistant
vickyale
0
56
Kit para crear un sitio web móvil asombroso!
vickyale
0
35
Other Decks in Technology
See All in Technology
利用終了したドメイン名の最強終活〜観測環境を育てて、分析・供養している件〜 / The Ultimate End-of-Life Preparation for Discontinued Domain Names
nttcom
2
200
次世代KYC活動報告 / 20250219-BizDay17-KYC-nextgen
oidfj
0
260
30分でわかる『アジャイルデータモデリング』
hanon52_
9
2.7k
Swiftの “private” を テストする / Testing Swift "private"
yutailang0119
0
130
Helm , Kustomize に代わる !? 次世代 k8s パッケージマネージャー Glasskube 入門 / glasskube-entry
parupappa2929
0
250
データの品質が低いと何が困るのか
kzykmyzw
6
1.1k
あれは良かった、あれは苦労したB2B2C型SaaSの新規開発におけるCloud Spanner
hirohito1108
2
610
急成長する企業で作った、エンジニアが輝ける制度/ 20250214 Rinto Ikenoue
shift_evolve
3
1.3k
CZII - CryoET Object Identification 参加振り返り・解法共有
tattaka
0
370
TAMとre:Capセキュリティ編 〜拡張脅威検出デモを添えて〜
fujiihda
2
250
白金鉱業Meetup Vol.17_あるデータサイエンティストのデータマネジメントとの向き合い方
brainpadpr
6
760
株式会社EventHub・エンジニア採用資料
eventhub
0
4.3k
Featured
See All Featured
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
160
15k
Faster Mobile Websites
deanohume
306
31k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.1k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
12
960
StorybookのUI Testing Handbookを読んだ
zakiyama
28
5.5k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
47
5.2k
Side Projects
sachag
452
42k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
6
550
Build The Right Thing And Hit Your Dates
maggiecrowley
34
2.5k
Product Roadmaps are Hard
iamctodd
PRO
50
11k
The Cost Of JavaScript in 2023
addyosmani
47
7.3k
The Power of CSS Pseudo Elements
geoffreycrofte
75
5.5k
Transcript
None
Victoria Ubaldo @Vikyale Introduction to NLP : How to improve
accessibility with Machine Learning CONNECT REIMAGINE 2021
Hello! Computer and System Engineer. Peruvian Data Analyst @Interbank Msc
Candidate Computer Science, PUCP. Enjoy mentoring, illustration and dancing!
Agenda • Machine Learning and Accessibility • Natural Language Processing
• Tensorflow for NLP • Next steps
None
None
None
Machine Learning is programming with data
None
None
Natural Language Process (NLP)
None
Where find NLP ? Chatbots Predictive text (autocomplete) Transcription Language
translation Others: Text Analysis Email filters Search Result Sentiment Analysis
Challenges in NLP
None
Challenges in NLP : Textual Accessibility Visual Auditory Speech
Accessible = Comprehensible
Challenges in NLP : Textual Accessibility Transcription Speech recognition Language
Translation
Case : Recognize sentiment in Text
None
Amazon and Yelp reviews 1 : Good 0: Bad
Pre-processing data
Text Processing
Tokenization
Out Of Vocabulary (OOV) EXTRA WORDS unknown tokens (UNK)
Stop Words
Stemming
Lemmatization
Padding
Tools! • Python 3.+ • Jupyter Notebook or Google Colab
• and using a framework for ML
¿Why Tensorflow? Whether you’re an expert or a beginner, TensorFlow
is an end-to-end platform that makes it easy for you to build and deploy ML models.
Trying in Google Colab bit.ly/2TU9MJz
Challenges in NLP : Textual Accessibility Transcription Speech recognition
None
Challenges in NLP : Textual Accessibility Language Translation
None
Next steps Try new tools : - Recurrent Neural Network
(RNN) - Long short-term memory (LSTM)
Next steps Increase your corpus with diverse data. - PDFMiner
- Tweepy API
Next steps Datasets with native or indigenous language or code
mixing
“We can build a much brighter future where humans are
relieved of menial work using AI capabilities” - Andrew Ng
Thanks! @womenwhocode #WWCode Keep in touch! linkedin.com/in/victoriaubaldo @vikyale
None