Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Image recognition of handwritten digits in MNIS...
Search
Dimitris Spathis
January 17, 2016
Research
0
390
Image recognition of handwritten digits in MNIST and flower classification in IRIS dataset
Code available here
https://github.com/sdimi/handwritten-digits-recognition
Dimitris Spathis
January 17, 2016
Tweet
Share
More Decks by Dimitris Spathis
See All by Dimitris Spathis
Song analysis "Jueves - La Oreja de Van Gogh "
sdimi
0
700
Wikipedia graph network analysis
sdimi
0
150
Character level LSTM Recurrent Neural Networks for language and music modeling
sdimi
0
310
Topic modeling and summarization of live TV shows via Twitter
sdimi
0
140
Diagnosing respiratory diseases with machine learning
sdimi
0
160
Detecting Irony on Greek Political Tweets: A Text Mining Approach
sdimi
0
120
Greece Unknown Facts
sdimi
1
190
Glocal News - Discover Top Local News, Globally.
sdimi
0
200
Responsive Web - Mobile First. An overview.
sdimi
1
190
Other Decks in Research
See All in Research
20250624_熊本経済同友会6月例会講演
trafficbrain
1
660
IMC の細かすぎる話 2025
smly
2
670
スキマバイトサービスにおける現場起点でのデザインアプローチ
yoshioshingyouji
0
240
最適化と機械学習による問題解決
mickey_kubo
0
180
Large Language Model Agent: A Survey on Methodology, Applications and Challenges
shunk031
17
10k
超高速データサイエンス
matsui_528
1
150
Sat2City:3D City Generation from A Single Satellite Image with Cascaded Latent Diffusion
satai
3
110
cvpaper.challenge 10年の軌跡 / cvpaper.challenge a decade-long journey
gatheluck
3
350
RHO-1: Not All Tokens Are What You Need
sansan_randd
1
190
なめらかなシステムと運用維持の終わらぬ未来 / dicomo2025_coherently_fittable_system
monochromegane
0
3.6k
Time to Cash: The Full Stack Breakdown of Modern ATM Attacks
ratatata
0
160
Unsupervised Domain Adaptation Architecture Search with Self-Training for Land Cover Mapping
satai
3
190
Featured
See All Featured
A Tale of Four Properties
chriscoyier
160
23k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
189
55k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
132
19k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.5k
Large-scale JavaScript Application Architecture
addyosmani
514
110k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
657
61k
GraphQLとの向き合い方2022年版
quramy
49
14k
The Language of Interfaces
destraynor
162
25k
How To Stay Up To Date on Web Technology
chriscoyier
791
250k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
15
1.7k
Rails Girls Zürich Keynote
gr2m
95
14k
Transcript
Αναγνώριση χειρόγραφων χαρακτήρων στο MNIST dataset & φυτών στο IRIS
dataset με μεθόδους μηχανικής μάθησης Δημήτρης Σπαθής Εξαμηνιαία εργασία – Ιαν 2016 Υπολογιστική Νοημοσύνη – Στατιστική Μάθηση Καθ.: Αναστάσιος Τέφας
MNIST dataset 70.000 εικόνες 28 x 28 pixel 784 διαστάσεις
60.000 train 10.000 test {0,1,2,..9} multi-class
Εργαλεία Python python.org Scikit – learn scikit-learn.org Matplotlib matplotlib.org Numpy
numpy.org
Προεπεξεργασία δεδομένων Ανακάτεμα δειγμάτων X, y = shuffle(mnist.data, mnist.target) Κανονικοποίηση
pixels [0,1] X_train, y_train = np.float32(X[:60000])/ 255., np.float32(y[:60000])
Κρατάμε 90 components 90,3% της αρχικής πληροφορίας PCA – Μείωση
Διάστασης (784 → 90)
Εκπαίδευση SVM fitting classifier = svm.SVC(gamma=0.01, C=3, kernel='rbf') 5 Cross
validation cross_validation.cross_val_score(classifier, X_train, y_train, cv=5)
Αποτελέσματα εκπαίδευσης
Παραδείγματα ταξινόμησης
Μείωση Διάστασης Kernel PCA (784 → 300) kpca = KernelPCA(kernel="rbf",n_components=300
, gamma=1) LDA (300 → 9) lda = LDA() #should keep [classes – 1] components
Nearest Classifier K Nearest Neighbor clf = neighbors.KNeighborsClassifier(n_neighbors=5) Nearest Centroid
classifier = NearestCentroid(metric='euclidean', shrink_threshold=None)
Αποτελέσματα εκπαίδευσης
Embedding για Μείωση Διάστασης (784 → 2) Spectral Embedding manifold.SpectralEmbedding
(n_components=2, affinity='nearest_neighbors', gamma=None, random_state=None, eigen_solver=None, n_neighbors=5) Isomap Embedding manifold.Isomap(n_neighbors=5, n_components=2)
Spectral Clustering Kρατάμε 5000 δείγματα για οπτικοποίηση Spectral Clustering cluster.SpectralClustering(n_clusters=10,
eigen_solver='arpack', affinity="nearest_neighbors")
None
None
Αποτελέσματα clustering
IRIS dataset 150 λουλούδια 4 διαστάσεις sepal length sepal width
petal length petal width 3 κλάσεις Iris Setosa Iris Versicolour Iris Virginica
Αποτελέσματα SVM εκπαίδευσης
SVM fine-tuning C
SVM fine-tuning Degree
SVM fine-tuning Gamma
Μείωση Διάστασης
None
Αποτελέσματα clustering & embedding
None
None
Further work Kernel PCA – Memory Errors Incremental PCA Grid
Search Deep Architectures Distributed / Parallel MapReduce / Spark Κώδικας σύντομα στο github.com/sdimi