จষͷײεύϜτϐοΫΛྨ Text Classification Topic Classification Spam/Not Spam Sentiment Analysis ! " Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Label 1 Label 2 Label 3 Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum
Face capture quality Sound analysis Speech Saliency Speech on Mac Sentiment classification Animal Detection Text Recognition Object Tracking Attention Saliency Document Camera On device speech Word Tagging Image similarity Face Landmark Image Similarity Landmark Detection Image saliency Rectangle Detection Text catalog Word Embeddings Image Classification NL Transfer learning
͞WTਖ਼֬͞ Fast Versus Accurate Fast Accurate Processing time Optimized for real-time Asynchronous processing Memory footprint Smallest Larger Support for rotated text Limited Broad Support for variety of fonts Limited Diverse font styles Accuracy for natural language Good Best
దͳύϥϝʔλઃఆΛ͢͠ Use case • Read codes/serial numbers just like a barcode reader • Constrained camera usage • Interactivity is key request = VNRecognizeTextRequest(completionHandler: recognizeTextHandler) request.recognitionLevel = .fast
จষͷײղੳ Natural Language Sentiment Analysis I was so excited for the season finale, $ Natural Language Sentiment Analysis but it was a bit disappointing. I was so excited for the season finale, $ ☹
Sentiment Analysis Text Classification Sentiment Analysis -1.0 1.0 Natural Language 0 “We had a not so fun time in Hawaii cause mom twisted her ankle.” -0.8 Natural Language Text ෛPSਖ਼ͷײͷԽ
ຊޠͳ͍ On-Device Language Support English United States, Canada, Great Britain, India Spanish United States, Mexico, Spain Italian Brazilian Portuguese Russian Turkish Chinese Mandarin and Cantonese
ͷಛ ˙+JUUFS໎͍ͷ͋Δʁ ˙4IJNNFSԒͷ͋Δʁ // Printing new results when recognizing pre-recorded audio if result.isFinal { let formattedString = result.bestTranscription.formattedString let speakingRate = result.bestTranscription.speakingRate let averagePauseDuration = result.bestTranscription.averagePauseDuration for segment in recognitionResult.bestTranscription.segments { let jitter = segment.voiceAnalytics?.jitter.acousticFeatureValuePerFrame let shimmer = segment.voiceAnalytics?.shimmer.acousticFeatureValuePerFrame let pitch = segment.voiceAnalytics?.pitch.acousticFeatureValuePerFrame let voicing = segment.voiceAnalytics?.voicing.acousticFeatureValuePerFrame } } ৽