Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Image recognition of handwritten digits in MNIS...
Search
Dimitris Spathis
January 17, 2016
Research
0
410
Image recognition of handwritten digits in MNIST and flower classification in IRIS dataset
Code available here
https://github.com/sdimi/handwritten-digits-recognition
Dimitris Spathis
January 17, 2016
Tweet
Share
More Decks by Dimitris Spathis
See All by Dimitris Spathis
Song analysis "Jueves - La Oreja de Van Gogh "
sdimi
0
730
Wikipedia graph network analysis
sdimi
0
160
Character level LSTM Recurrent Neural Networks for language and music modeling
sdimi
0
310
Topic modeling and summarization of live TV shows via Twitter
sdimi
0
150
Diagnosing respiratory diseases with machine learning
sdimi
0
170
Detecting Irony on Greek Political Tweets: A Text Mining Approach
sdimi
0
120
Greece Unknown Facts
sdimi
1
190
Glocal News - Discover Top Local News, Globally.
sdimi
0
200
Responsive Web - Mobile First. An overview.
sdimi
1
190
Other Decks in Research
See All in Research
Grounding Text Complexity Control in Defined Linguistic Difficulty [Keynote@*SEM2025]
yukiar
0
120
2026年3月1日(日)福島「除染土」の公共利用をかんがえる
atsukomasano2026
0
390
ブレグマン距離最小化に基づくリース表現量推定:バイアス除去学習の統一理論
masakat0
0
170
都市交通マスタープランとその後への期待@熊本商工会議所・熊本経済同友会
trafficbrain
0
160
Dwangoでの漫画データ活用〜漫画理解と動画作成〜@コミック工学シンポジウム2025
kzmssk
0
140
Akamaiのキャッシュ効率を支えるAdaptSizeについての論文を読んでみた
bootjp
1
480
一般道の交通量減少と速度低下についての全国分析と熊本市におけるケーススタディ(20251122 土木計画学研究発表会)
trafficbrain
0
170
Satellites Reveal Mobility: A Commuting Origin-destination Flow Generator for Global Cities
satai
3
600
それ、チームの改善になってますか?ー「チームとは?」から始めた組織の実験ー
hirakawa51
0
850
第66回コンピュータビジョン勉強会@関東 Epona: Autoregressive Diffusion World Model for Autonomous Driving
kentosasaki
0
440
AIスーパーコンピュータにおけるLLM学習処理性能の計測と可観測性 / AI Supercomputer LLM Benchmarking and Observability
yuukit
1
710
AI Agentの精度改善に見るML開発との共通点 / commonalities in accuracy improvements in agentic era
shimacos
5
1.4k
Featured
See All Featured
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
11
850
SEO Brein meetup: CTRL+C is not how to scale international SEO
lindahogenes
0
2.4k
sira's awesome portfolio website redesign presentation
elsirapls
0
180
My Coaching Mixtape
mlcsv
0
64
How To Speak Unicorn (iThemes Webinar)
marktimemedia
1
400
How to Talk to Developers About Accessibility
jct
2
140
Design in an AI World
tapps
0
160
Measuring & Analyzing Core Web Vitals
bluesmoon
9
770
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
55
3.3k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
141
35k
Ecommerce SEO: The Keys for Success Now & Beyond - #SERPConf2024
aleyda
1
1.8k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
122
21k
Transcript
Αναγνώριση χειρόγραφων χαρακτήρων στο MNIST dataset & φυτών στο IRIS
dataset με μεθόδους μηχανικής μάθησης Δημήτρης Σπαθής Εξαμηνιαία εργασία – Ιαν 2016 Υπολογιστική Νοημοσύνη – Στατιστική Μάθηση Καθ.: Αναστάσιος Τέφας
MNIST dataset 70.000 εικόνες 28 x 28 pixel 784 διαστάσεις
60.000 train 10.000 test {0,1,2,..9} multi-class
Εργαλεία Python python.org Scikit – learn scikit-learn.org Matplotlib matplotlib.org Numpy
numpy.org
Προεπεξεργασία δεδομένων Ανακάτεμα δειγμάτων X, y = shuffle(mnist.data, mnist.target) Κανονικοποίηση
pixels [0,1] X_train, y_train = np.float32(X[:60000])/ 255., np.float32(y[:60000])
Κρατάμε 90 components 90,3% της αρχικής πληροφορίας PCA – Μείωση
Διάστασης (784 → 90)
Εκπαίδευση SVM fitting classifier = svm.SVC(gamma=0.01, C=3, kernel='rbf') 5 Cross
validation cross_validation.cross_val_score(classifier, X_train, y_train, cv=5)
Αποτελέσματα εκπαίδευσης
Παραδείγματα ταξινόμησης
Μείωση Διάστασης Kernel PCA (784 → 300) kpca = KernelPCA(kernel="rbf",n_components=300
, gamma=1) LDA (300 → 9) lda = LDA() #should keep [classes – 1] components
Nearest Classifier K Nearest Neighbor clf = neighbors.KNeighborsClassifier(n_neighbors=5) Nearest Centroid
classifier = NearestCentroid(metric='euclidean', shrink_threshold=None)
Αποτελέσματα εκπαίδευσης
Embedding για Μείωση Διάστασης (784 → 2) Spectral Embedding manifold.SpectralEmbedding
(n_components=2, affinity='nearest_neighbors', gamma=None, random_state=None, eigen_solver=None, n_neighbors=5) Isomap Embedding manifold.Isomap(n_neighbors=5, n_components=2)
Spectral Clustering Kρατάμε 5000 δείγματα για οπτικοποίηση Spectral Clustering cluster.SpectralClustering(n_clusters=10,
eigen_solver='arpack', affinity="nearest_neighbors")
None
None
Αποτελέσματα clustering
IRIS dataset 150 λουλούδια 4 διαστάσεις sepal length sepal width
petal length petal width 3 κλάσεις Iris Setosa Iris Versicolour Iris Virginica
Αποτελέσματα SVM εκπαίδευσης
SVM fine-tuning C
SVM fine-tuning Degree
SVM fine-tuning Gamma
Μείωση Διάστασης
None
Αποτελέσματα clustering & embedding
None
None
Further work Kernel PCA – Memory Errors Incremental PCA Grid
Search Deep Architectures Distributed / Parallel MapReduce / Spark Κώδικας σύντομα στο github.com/sdimi