Ari Pirkola
                 Academy of Finland

                 University of Tampere, Department of Information Studies and Interactive Media

Main  Page 

Research Interests


FIRE Research Group 



Publications

Recent Publications (2005-2011)

Pirkola, A. (2011). Constructing  Topic-Specific Search Keyphrase Suggestion Tools for Web Information Retrieval [long paper]. Accepted for ISI 2011 - 12th International Symposium on Information Science, Hildesheim, Germany, March 9 - 11, 2011.

Pirkola, A. (2011). A  Web Search System Focused on Climate Change. Accepted for EOGC 2011 - The Third Conference on Earth Observation of Global Changes, Munich, Germany, April  13 - 15, 2011.

Pirkola, A. and Talvensaari, T. (2010). A Topic-Specific Web  Search System Focusing on Quality Pages. ECDL - The European Conference on Research and Advanced Technology for Digital Libraries, Glasgow, September 6 - 10, 2010.

Pirkola, A. and Talvensaari, T. (2010). Addressing the limited scope problem of focused crawling using a result merging approach. Proceedings of the 25th Annual ACM Symposium on Applied Computing (ACM SAC), Sierre, Switzerland, March 22 - 26, 2010, pp. 1735 – 1740.

Talvensaari, T. and Pirkola, A. (2010). Multilingual focused crawling: Fetching topic-specific pages in different languages. Accepted for ATINER 2010.

Pirkola, A. (2009). The effectiveness of Web search engines to index new sites from different countries. Information Research, 14(2).

Pirkola, A. and Talvensaari, T. (2009). Implementing a collaborative encyclopedia related Web search engine. IADIS, WWW/Internet 2009, Rome, Italy, November 19-22.

Talvensaari, T. and Pirkola, A. (2009). Developing a focused crawling system capable of identifying link word variants. IADIS, WWW/Internet 2009, Rome, Italy, November 19-22.

Pirkola, A. and Talvensaari, T. (2009). Effects of crawling strategies on the performance of focused Web crawling. WEBIST – 5th International Conference on Web Information Systems and Technologies. Lisbon, Portugal, March 23-26.

Pirkola, A. and Talvensaari, T. (2009). Effects of start URLs in focused Web crawling. INFORUM, 15th Conference on Professional Information Resources, Prague, May 27-29.

Pirkola, A. and Talvensaari, T. (2009). Evaluating global link structure of the Web for focused crawling in the genomics and genetics domains. Biostec 2009, International Joint Conference on Biomedical Engineering Systems and Technologies, Porto, Portugal, January 14-17, 2009. pp 499-502.

Keskustalo, H., Järvelin, K., Pirkola, A., Sharma, T. and Lykke Nielsen, M. (2009). Test collection-based IR evaluation needs extension toward sessions - A  case of extremely short queries. The Fifth Asia Information Retrieval Symposium (AIRS 2009),  Sapporo, Hokkaido, Japan, October 21-23, 2009.

Loponen, A., Pirkola, A., Järvelin, K. & Keskustalo, H. (2008). An effective implementation of the FITE-TRT method for OOV word translation. Proceedings of the 30th European Conference on Information Retrieval (ECIR 2008), Glasgow, Scotland.

Pirkola, A. (2008). Extracting variant forms of chemical names for information
retrieval. Information Research, 13(3).

Talvensaari, T., Pirkola, A., Järvelin, K., Juhola, M. and Laurikkala, J. (2008). Focused Web crawling in the acquisition of comparable corpora. Information Retrieval, 11(5): 427-445.

Keskustalo, H., Järvelin, K., Pirkola, A and Kekäläinen. J. (2008). Intuition-supporting visualization of user’s performance based on explicit negative higher-order relevance. Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR '08),  Singapore, July 20 - 24, 2008.

Keskustalo, H., Järvelin, K. and Pirkola, A. (2008). Evaluating the effectiveness of relevance feedback based on a user simulation model: Effects of a user scenario on cumulated gain value. Information Retrieval, 11(3): 209-228.

Pirkola, A., Toivonen, J., Keskustalo, H. & Järvelin, K. (2007).
Frequency-based identification of correct translation equivalents (FITE) obtained through transformation rules. ACM Transactions on Information Systems (TOIS), 26(1).

Pirkola, A. (2007). Focused crawling: a means to acquire biological data from the Web. VLDB Workshop in Data Mining in Bioinformatics, Vienna, September 22, 2007.

Pirkola, A., Toivonen, J., Keskustalo, H. & Järvelin, K. (2006).  FITE-TRT: A high quality translation technique for OOV words. Proceedings of the 21st Annual ACM Symposium on Applied Computing (ACM SAC), Dijon, France, April 23 - 27, 2006, pp. 1043 – 1049.

Cosijn, E, Pirkola, A, Bothma, T. & Järvelin, K. (2006). Cross-cultural information retrieval: searching for English documents using Zulu queries. Proceedings of SCECSAL: Librarianship as a bridge to an information society in eastern, central and southern Africa. Dar es-Salaam, Tanzania, July 2006, pp. 648 – 663.

Keskustalo, H., Järvelin, K. & Pirkola, A. (2006). The effects of relevance feedback quality and quantity in interactive relevance feedback - a simulation based on user modeling.  Proceedings of the 28th European Conference on Information Retrieval (ECIR 2006), London, UK.

Järvelin, A., Kumpulainen, S., Pirkola, A., & Sormunen, E. (2006). Dictionary-independent translation in CLIR between closely related languages. 6th Dutch-Belgian Information Retrieval Workshop (DIR 2006), Delft, The Netherlands.

Toivonen, J., Pirkola, A., Keskustalo, H., Visala, K. & Järvelin, K. (2005). Translating cross-lingual spelling variants using transformation rules. Information Processing & Management, 41(4): 859-872.

Cosijn, E., Bothma, T., Pirkola, A., Järvelin, K., Keskustalo, H. & De Wet, K. (2005). Cross language information retrieval in South African indigenous languages. Southern African Online Information Conference, 21-23, June 2005, Pretoria, South Africa.


Selected Publications (1998-2004)

Pirkola, A., Toivonen, J., Keskustalo, H., Visala, K. & Järvelin, K. (2003).  Fuzzy translation of cross-lingual spelling variants. The 26th Annual International ACM SIGIR Conference on  Research and Development in Information Retrieval (ACM SIGIR '03), Toronto, Canada, Jul 28 - Aug 1, 2003, pp. 345 - 352.
 
Pirkola, A., Puolamäki, D. & Järvelin, K. (2003). Applying query structuring in cross-language retrieval. Information Processing & Management, 39(3): 391 - 402.

Keskustalo. H., Pirkola, A., Visala, K., Leppänen, E. & Järvelin K. (2003). Non-adjacent digrams improve matching of cross-lingual spelling variants. String Processing and Information Retrieval (SPIRE '03) Conference, Manaus, Brazil, Oct 8-10, 2003, pp. 252 - 265.

Pirkola, A., Cosijn, E., Bothma, T. & Nel, JG. (2002). Cross-lingual information access in indigenous languages: a case study Zulu. Cross-Language Information Retrieval: A Research Roadmap. Workshop held at the SIGIR’2002, Tampere, August 15, pp. 38-42.

Pirkola, A., Keskustalo, H., Leppänen, E. , Känsälä, A.-P. & Järvelin, K. (2002). Targeted s-gram matching: A novel n-gram matching technique for cross- and monolingual word form variants. Information Research, 7(2).

Pirkola, A., Leppänen, E. & Järvelin, K. (2002). The RATF Formula (Kwok’s Formula): Exploiting average term frequency in cross-language retrieval. Information Research, 7(2)

Hedlund, T., Pirkola, A. & Järvelin, K. (2001). Aspects of Swedish morphology and semantics from the perspective of mono- and cross-language retrieval. Information Processing & Management,  37(1): 147-161.

Pirkola, A. (2001). Morphological typology of languages for IR. Journal of Documentation,  57(3): 330-348.

Pirkola, A. & Järvelin, K (2001). Employing the resolution power of search keys. Journal of the American Society for Information Science and Technology, 52(7): 575-583.

Pirkola, A., Hedlund, T., Keskustalo, H. & Järvelin, K (2001). Dictionary-based cross-language information retrieval: problems, methods, and research findings. Information Retrieval, 4(3/4): 209-230.

Pirkola, A. (1998). The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval. The 21st Annual International ACM Sigir Conference on Research and Development in Information Retrieval, Melbourne, August 24-28. New York: ACM, pp. 55-63.

Dissertation

Pirkola, A. (1999). Studies on linguistic problems and methods in text retrieval: The effects of anaphor and ellipsis resolution in proximity searching, and translation and  query structuring methods in cross-language retrieval. PhD  Dissertation. University  of Tampere, Department of Information Studies. Acta Universitatis Tamperensis 672.