Online ISSN:1349-8606
Progress in Informatics  
No6. March 2009  
Page 57-62  
 
Statistical string similarity model for information linkage
Atsuhiro TAKASU

LINK [1] M. Bilenko and R. J. Mooney, “Adaptive Duplicate Detection Using Learnable String SimilarityMeasures”. In Proc. of 9th Intl. Conf. on Knowledge Discovery and Data Mining (KDD03), pp.39-48, 2003.

LINK [2] S. Deligne and F. Bimbot, “Language Modeling by Variable Length Sequences: Theoretical Formulation and Evaluation of Multigrams”. In Proc. of Intl. Conf. on Acoustic, Speech, and Signal Processing, pp.169-172, 1995.

LINK [3] S. Kahan, T. Pavlidis, and H. S. Baird, “On the recognition of printed characters of any font and size”. IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.9, no.2, pp.274-288, March 1987.

LINK [4] T. Kuhn, H. Niemann, and E. G. Schukat-Talamazzini, “Ergodic Hidden Markov Models and Polygrams for Language Modeling”. In Proc. of Intl. Conf. on Acoustic, Speech, and Signal Processing, pp.357-360, 1994.

LINK [5] Y. Li, D. Lopresti, and A. Tomkins, “Validation of Document Image Defect Models for Optical Character Recognition”. In Proc. of 3rd Annual Symposium on Document Analysis and Information Retrieval, pp.137-150, 1994.

LINK [6] A. Myka and U. Guntzer, “Fuzzy Full-Text Searches in OCR Database”. In Proc. of Forum on Research & Technology Advances in Digital Libraries, pp.87-100, 1995.

LINK [7] G. Navarro, “A guided tour to approximate stringmatching”. ACM Computing Surveys, vol.33, no.1, pp.31-88, 2001.

LINK [8] M. Ohta, A. Takasu, and J. Adachi, “Probabilistic Automaton Model for Fuzzy English-text Retrieval”. In Lecture Notes in Computer Science 1923, pp.35-44, 2000.

LINK [9] L. R. Rabiner, “A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition”. Proceedings of the IEEE, vol.77, no.2, pp.257-286, 1989.

LINK [10] E. S. Ristad and P. N. Yianilos. Learning string-edit distance, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, no.5, pp.522-532, 1998.

LINK [11] A. Takasu, “Bibliographic Attribute Extraction from Erroneous References Based on a Statistical Model”. In Proc. of 3rd ACM & IEEE Joint Conf. on Digital Libraries, pp.49-60, 2003.

LINK [12] A. Takasu and K. Aihara, “DVHMM: Variable Length Text Recognition Error Model”. In Proc. of 15th Intl. Conf. on Pattern Recognition, pp.110-114, 2002.

LINK