Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Image recognition of handwritten digits in MNIS...
Search
Dimitris Spathis
January 17, 2016
Research
0
380
Image recognition of handwritten digits in MNIST and flower classification in IRIS dataset
Code available here
https://github.com/sdimi/handwritten-digits-recognition
Dimitris Spathis
January 17, 2016
Tweet
Share
More Decks by Dimitris Spathis
See All by Dimitris Spathis
Song analysis "Jueves - La Oreja de Van Gogh "
sdimi
0
690
Wikipedia graph network analysis
sdimi
0
150
Character level LSTM Recurrent Neural Networks for language and music modeling
sdimi
0
300
Topic modeling and summarization of live TV shows via Twitter
sdimi
0
140
Diagnosing respiratory diseases with machine learning
sdimi
0
160
Detecting Irony on Greek Political Tweets: A Text Mining Approach
sdimi
0
120
Greece Unknown Facts
sdimi
1
190
Glocal News - Discover Top Local News, Globally.
sdimi
0
200
Responsive Web - Mobile First. An overview.
sdimi
1
190
Other Decks in Research
See All in Research
Adaptive fusion of multi-modal remote sensing data for optimal sub-field crop yield prediction
satai
3
220
Generative Models 2025
takahashihiroshi
21
12k
rtrec@dbem6
myui
6
870
EarthMarker: A Visual Prompting Multimodal Large Language Model for Remote Sensing
satai
3
340
最適化と機械学習による問題解決
mickey_kubo
0
140
[輪講] SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
nk35jk
2
520
SkySense : A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery
satai
3
250
学生向けアンケート<データサイエンティストについて>
datascientistsociety
PRO
0
3.1k
RHO-1: Not All Tokens Are What You Need
sansan_randd
1
110
SSII2025 [TS3] 医工連携における画像情報学研究
ssii
PRO
2
1.2k
RapidPen: AIエージェントによるペネトレーションテスト 初期侵入全自動化の研究
laysakura
0
1.5k
MGDSS:慣性式モーションキャプチャを用いたジェスチャによるドローンの操作 / ec75-yamauchi
yumulab
0
240
Featured
See All Featured
The MySQL Ecosystem @ GitHub 2015
samlambert
251
13k
Building a Modern Day E-commerce SEO Strategy
aleyda
42
7.4k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.3k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
233
17k
Site-Speed That Sticks
csswizardry
10
670
Rebuilding a faster, lazier Slack
samanthasiow
82
9.1k
Code Review Best Practice
trishagee
69
18k
Git: the NoSQL Database
bkeepers
PRO
430
65k
Writing Fast Ruby
sferik
628
62k
Practical Orchestrator
shlominoach
188
11k
Side Projects
sachag
455
42k
Unsuck your backbone
ammeep
671
58k
Transcript
Αναγνώριση χειρόγραφων χαρακτήρων στο MNIST dataset & φυτών στο IRIS
dataset με μεθόδους μηχανικής μάθησης Δημήτρης Σπαθής Εξαμηνιαία εργασία – Ιαν 2016 Υπολογιστική Νοημοσύνη – Στατιστική Μάθηση Καθ.: Αναστάσιος Τέφας
MNIST dataset 70.000 εικόνες 28 x 28 pixel 784 διαστάσεις
60.000 train 10.000 test {0,1,2,..9} multi-class
Εργαλεία Python python.org Scikit – learn scikit-learn.org Matplotlib matplotlib.org Numpy
numpy.org
Προεπεξεργασία δεδομένων Ανακάτεμα δειγμάτων X, y = shuffle(mnist.data, mnist.target) Κανονικοποίηση
pixels [0,1] X_train, y_train = np.float32(X[:60000])/ 255., np.float32(y[:60000])
Κρατάμε 90 components 90,3% της αρχικής πληροφορίας PCA – Μείωση
Διάστασης (784 → 90)
Εκπαίδευση SVM fitting classifier = svm.SVC(gamma=0.01, C=3, kernel='rbf') 5 Cross
validation cross_validation.cross_val_score(classifier, X_train, y_train, cv=5)
Αποτελέσματα εκπαίδευσης
Παραδείγματα ταξινόμησης
Μείωση Διάστασης Kernel PCA (784 → 300) kpca = KernelPCA(kernel="rbf",n_components=300
, gamma=1) LDA (300 → 9) lda = LDA() #should keep [classes – 1] components
Nearest Classifier K Nearest Neighbor clf = neighbors.KNeighborsClassifier(n_neighbors=5) Nearest Centroid
classifier = NearestCentroid(metric='euclidean', shrink_threshold=None)
Αποτελέσματα εκπαίδευσης
Embedding για Μείωση Διάστασης (784 → 2) Spectral Embedding manifold.SpectralEmbedding
(n_components=2, affinity='nearest_neighbors', gamma=None, random_state=None, eigen_solver=None, n_neighbors=5) Isomap Embedding manifold.Isomap(n_neighbors=5, n_components=2)
Spectral Clustering Kρατάμε 5000 δείγματα για οπτικοποίηση Spectral Clustering cluster.SpectralClustering(n_clusters=10,
eigen_solver='arpack', affinity="nearest_neighbors")
None
None
Αποτελέσματα clustering
IRIS dataset 150 λουλούδια 4 διαστάσεις sepal length sepal width
petal length petal width 3 κλάσεις Iris Setosa Iris Versicolour Iris Virginica
Αποτελέσματα SVM εκπαίδευσης
SVM fine-tuning C
SVM fine-tuning Degree
SVM fine-tuning Gamma
Μείωση Διάστασης
None
Αποτελέσματα clustering & embedding
None
None
Further work Kernel PCA – Memory Errors Incremental PCA Grid
Search Deep Architectures Distributed / Parallel MapReduce / Spark Κώδικας σύντομα στο github.com/sdimi