This paper propose a very simple feature detector which computes the mean, max and min of pixels over columns and rows in a square mask. Then the squared difference of the mean and max, and the mean and min, respectively are computed (for each row and each column) and stored in a feature vector. This … Continue reading Mean-Max-Mean Features
Interactive Word Spotting
In this paper we propose to use visualisation of word spotting results as a powerful tool to be able to manually determine in an efficient manner which found words are correct and which are incorrect. This will make it possible to quickly make a reliable transcription of each word. An Intelligent User Interface for Efficient … Continue reading Interactive Word Spotting
Semi-Automatic Transcription
Word spotting can be used to make a semi-automatic transcription of the text. The idea is that the user should mark up the word that needs to be transcribed and then the word spotter finds all occurrences of that word. Hence, the word needs to be transcribed only once, and the process can be performed in … Continue reading Semi-Automatic Transcription
The Radial Line Fourier Descriptor
One of several improvements made to the word spotter was to use faster descriptor based on a few elements of the Fourier transform of radial lines with logarithmic sampling. This makes the descriptor much faster, but still robust enough. Actually, it must not be too precise as the shape of the words varies in the text. … Continue reading The Radial Line Fourier Descriptor
Background Removal and Binarisation
Since documents often are somewhat degraded it is important to be able to efficiently remove the disturbing background from the text. The next step would be to binarise the segmented text, but in our wordspotter we prefer to work on the background removed text. We have published two papers dealing with these problems. Automatic Document … Continue reading Background Removal and Binarisation
