In this paper we propose to use visualisation of word spotting results as a powerful tool to be able to manually determine in an efficient manner which found words are correct and which are incorrect. This will make it possible to quickly make a reliable transcription of each word. An Intelligent User Interface for Efficient … Continue reading Interactive Word Spotting
Blog
Semi-Automatic Transcription
Word spotting can be used to make a semi-automatic transcription of the text. The idea is that the user should mark up the word that needs to be transcribed and then the word spotter finds all occurrences of that word. Hence, the word needs to be transcribed only once, and the process can be performed in … Continue reading Semi-Automatic Transcription
The Radial Line Fourier Descriptor
One of several improvements made to the word spotter was to use faster descriptor based on a few elements of the Fourier transform of radial lines with logarithmic sampling. This makes the descriptor much faster, but still robust enough. Actually, it must not be too precise as the shape of the words varies in the text. … Continue reading The Radial Line Fourier Descriptor
Background Removal and Binarisation
Since documents often are somewhat degraded it is important to be able to efficiently remove the disturbing background from the text. The next step would be to binarise the segmented text, but in our wordspotter we prefer to work on the background removed text. We have published two papers dealing with these problems. Automatic Document … Continue reading Background Removal and Binarisation
Segmentation-free Word Spotting
We developed a segmentation free word spotter, which means that each word do not have to be extracted from the text. Instead a sliding window (cyan box) is used to traverse the document. Key point matching was performed on four sets of different key points, using a descriptor based on the Fourier transform followed by … Continue reading Segmentation-free Word Spotting
