Slide 24
Slide 24 text
Summary
This project illustrated how an image of Manchu script could be cut into each
recognizable unit through a pre-processing method. The pre-processing method first
standardizes the image data it reads in, then, in a step-by-step manner, divides the image
into (1) Lines of the script, then (2) Words of the script, and finally (3) Letters of the script.
Future research needs to be conducted in confirming whether or not (and how
accurately) each unit (letter) that was cut from the image matches the actual Manchu
alphabet. It is expected that there will be some margin of error due to the script cutting
perfectly horizontal lines at the narrowest point of each word.
Thus, a er performing an accuracy check, the algorithm may be improved upon by
adjusting the cutting threshold values for each word, or by rotation of the cutting line (or
the image itself) at certain locations.
In conclusion, we hope to illustrate that this method of extracting Manchu characters
from an image is a (relatively) simple and viable solution, even though some
improvements may need to be made.
24