We developed a segmentation free word spotter, which means that each word do not have to be extracted from the text. Instead a sliding window (cyan box) is used to traverse the document. Key point matching was performed on four sets of different key points, using a descriptor based on the Fourier transform followed by … Continue reading Segmentation-free Word Spotting
User Annotation and Bounding Text Boxes
It is quite a challenging task to make a prefect bounding box for a word by hand. This paper deals with this problem, but the algorithm can also be used in the word spotter for finding a perfectly fitting box of the found word. The image shows how the user has marked the red box, … Continue reading User Annotation and Bounding Text Boxes
The Log-Polar Magnitude Feature Descriptor
The idea proposed in this paper is to use a log polar sampling scheme followed by a FFT in order to obtain a rotation and to some degree scale invariant feature descriptor. The feature vector will be rather short since just the most discriminative part of the FFT is used. A short feature vector for … Continue reading The Log-Polar Magnitude Feature Descriptor
Image Derivatives
This paper propose an efficient way to combine different spline types for computing first and second order derivatives (please see my video presentation). The following formula is derived, which shows that a spline surface can be rewritten as an outer product and a convolution Note that this equation is differentiable and this fact is used … Continue reading Image Derivatives
From quill 2 bytes
This cross disciplinary initiative takes its point of departure in the analysis of handwritten text manuscripts using computational methods from image analysis and linguistics. It sets out to develop a manuscript analysis technology providing automatic tools for large-scale transcription, linguistic analysis, digital paleography and generic data mining of historical manuscripts. Our mission is to develop … Continue reading From quill 2 bytes
