Recent Publications (2005-2011)
Pirkola,
A. (2011). Constructing Topic-Specific Search Keyphrase Suggestion
Tools for Web Information Retrieval [long paper]. Accepted for ISI 2011 - 12th International Symposium on Information Science, Hildesheim, Germany, March 9 - 11, 2011.
Pirkola,
A. (2011). A Web Search System Focused on Climate Change. Accepted for EOGC 2011 - The Third Conference on Earth Observation of Global Changes, Munich, Germany, April 13 - 15, 2011.
Pirkola, A. and Talvensaari, T. (2010). A Topic-Specific Web Search System Focusing on Quality Pages. ECDL - The European Conference on Research and Advanced Technology for Digital Libraries, Glasgow, September 6 - 10, 2010.
Pirkola,
A. and Talvensaari, T. (2010). Addressing the limited scope problem of
focused crawling using a result merging approach. Proceedings of the
25th Annual ACM Symposium on Applied Computing (ACM SAC), Sierre, Switzerland, March 22 - 26, 2010, pp. 1735 – 1740.
Talvensaari, T.
and Pirkola, A. (2010). Multilingual
focused crawling: Fetching
topic-specific pages in different languages. Accepted for ATINER 2010.
Pirkola, A. (2009). The effectiveness of Web
search engines to index new sites from different countries. Information
Research, 14(2).
Pirkola, A. and
Talvensaari, T. (2009). Implementing a collaborative encyclopedia
related Web search
engine. IADIS, WWW/Internet 2009, Rome, Italy, November
19-22.
Talvensaari, T. and
Pirkola, A. (2009). Developing a focused crawling system capable of
identifying link word
variants. IADIS, WWW/Internet 2009, Rome, Italy, November
19-22.
Pirkola, A. and
Talvensaari, T. (2009). Effects of crawling strategies on the
performance of
focused Web crawling. WEBIST – 5th
International Conference on Web Information Systems and Technologies. Lisbon, Portugal, March 23-26.
Pirkola,
A. and
Talvensaari, T. (2009). Effects of start URLs in focused Web
crawling. INFORUM, 15th Conference on
Professional Information Resources, Prague,
May
27-29.
Pirkola, A. and
Talvensaari, T. (2009). Evaluating
global link structure of the
Web for focused crawling in the genomics and genetics domains. Biostec
2009, International Joint Conference
on Biomedical Engineering Systems and Technologies, Porto, Portugal, January 14-17,
2009. pp 499-502.
Keskustalo, H., Järvelin, K., Pirkola, A., Sharma, T. and Lykke
Nielsen, M. (2009). Test collection-based IR evaluation needs extension
toward sessions - A case of extremely short queries. The Fifth Asia Information
Retrieval Symposium (AIRS 2009), Sapporo, Hokkaido, Japan,
October 21-23, 2009.
Loponen,
A., Pirkola, A., Järvelin, K. & Keskustalo, H. (2008). An
effective
implementation of the FITE-TRT method for OOV word translation.
Proceedings of the 30th European Conference on Information Retrieval
(ECIR 2008), Glasgow, Scotland.
Pirkola,
A. (2008). Extracting variant forms of chemical names for
information
retrieval. Information Research, 13(3).
Talvensaari,
T., Pirkola, A., Järvelin, K., Juhola, M. and Laurikkala, J. (2008).
Focused Web crawling in the acquisition of comparable corpora.
Information Retrieval, 11(5): 427-445.
Keskustalo, H., Järvelin,
K., Pirkola, A and Kekäläinen. J. (2008). Intuition-supporting
visualization of user’s performance based on explicit negative
higher-order relevance. Proceedings of the 31st Annual International
ACM SIGIR Conference on Research and Development in Information
Retrieval (ACM SIGIR '08), Singapore, July 20 - 24, 2008.
Keskustalo,
H., Järvelin, K. and Pirkola, A. (2008). Evaluating the effectiveness
of relevance feedback based on a user simulation model: Effects of a
user scenario on cumulated gain value. Information Retrieval, 11(3):
209-228.
Pirkola, A., Toivonen, J.,
Keskustalo, H. & Järvelin, K. (2007). Frequency-based identification of correct
translation equivalents (FITE)
obtained through transformation rules. ACM
Transactions on Information Systems (TOIS), 26(1).
Pirkola, A. (2007).
Focused crawling: a means to acquire biological data from the Web. VLDB
Workshop in Data Mining in
Bioinformatics, Vienna, September 22, 2007.
Pirkola,
A., Toivonen, J., Keskustalo, H. & Järvelin, K.
(2006).
FITE-TRT: A high quality translation technique for OOV words.
Proceedings of the 21st Annual ACM Symposium on Applied Computing (ACM SAC),
Dijon, France, April 23 - 27, 2006, pp. 1043 – 1049.
Cosijn,
E, Pirkola, A, Bothma, T. & Järvelin, K. (2006). Cross-cultural information
retrieval: searching for
English documents using Zulu queries. Proceedings of SCECSAL:
Librarianship
as a bridge to an information society in eastern, central and southern
Africa.
Dar es-Salaam, Tanzania, July 2006, pp.
648 – 663.
Keskustalo, H., Järvelin, K.
& Pirkola, A. (2006). The
effects of relevance feedback
quality and quantity in interactive relevance feedback - a simulation
based on
user modeling. Proceedings
of the 28th European Conference on Information Retrieval
(ECIR 2006), London, UK.
Järvelin, A., Kumpulainen,
S., Pirkola, A., & Sormunen,
E. (2006). Dictionary-independent translation in CLIR
between closely related languages. 6th
Dutch-Belgian Information Retrieval Workshop (DIR 2006), Delft,
The Netherlands.
Toivonen, J., Pirkola,
A., Keskustalo, H., Visala, K. & Järvelin,
K. (2005). Translating cross-lingual spelling variants using
transformation rules. Information Processing & Management,
41(4):
859-872.
Cosijn, E., Bothma, T.,
Pirkola, A., Järvelin, K., Keskustalo, H. & De Wet, K. (2005).
Cross language
information retrieval in South African indigenous languages. Southern
African Online Information
Conference, 21-23, June 2005, Pretoria, South Africa.
Selected Publications (1998-2004)
Pirkola,
A., Toivonen, J., Keskustalo, H., Visala, K. & Järvelin, K.
(2003). Fuzzy translation of cross-lingual spelling variants.
The
26th Annual International ACM SIGIR Conference on Research
and
Development in Information Retrieval (ACM SIGIR '03),
Toronto, Canada,
Jul 28 - Aug 1, 2003, pp. 345 - 352.
Pirkola, A., Puolamäki, D.
& Järvelin, K. (2003). Applying query structuring in
cross-language
retrieval. Information Processing & Management, 39(3): 391 -
402.
Keskustalo.
H., Pirkola, A., Visala, K., Leppänen, E. & Järvelin K. (2003).
Non-adjacent digrams improve matching of cross-lingual spelling
variants. String Processing and Information Retrieval (SPIRE '03)
Conference, Manaus, Brazil, Oct 8-10, 2003, pp. 252 - 265.
Pirkola,
A., Cosijn, E., Bothma, T. & Nel, JG. (2002). Cross-lingual
information access in indigenous languages: a case study Zulu.
Cross-Language Information Retrieval: A Research Roadmap. Workshop held
at the SIGIR’2002, Tampere, August 15, pp. 38-42.
Pirkola, A.,
Keskustalo, H., Leppänen, E. , Känsälä, A.-P. & Järvelin, K.
(2002). Targeted s-gram matching: A novel n-gram matching technique for
cross- and monolingual word form variants.
Information
Research,
7(2).
Pirkola, A.,
Leppänen, E. & Järvelin, K. (2002). The RATF Formula
(Kwok’s Formula):
Exploiting average term frequency in cross-language retrieval. Information
Research,
7(2)
Hedlund,
T., Pirkola, A. & Järvelin, K. (2001). Aspects of Swedish
morphology and semantics from the perspective of mono- and
cross-language retrieval. Information Processing &
Management, 37(1): 147-161.
Pirkola, A. (2001). Morphological typology of languages for IR. Journal
of Documentation, 57(3): 330-348.
Pirkola,
A. & Järvelin, K (2001). Employing the resolution power of
search
keys. Journal of the American Society for Information Science and
Technology, 52(7): 575-583.
Pirkola, A., Hedlund, T.,
Keskustalo, H. & Järvelin, K (2001). Dictionary-based
cross-language information retrieval: problems, methods, and research
findings. Information Retrieval, 4(3/4): 209-230.
Pirkola, A.
(1998). The effects of query structure and dictionary setups in
dictionary-based cross-language information retrieval. The 21st Annual
International ACM Sigir Conference on Research and Development in
Information Retrieval, Melbourne, August 24-28. New York: ACM, pp.
55-63.
Dissertation
Pirkola, A. (1999). Studies on linguistic problems and methods in text
retrieval: The effects of anaphor and ellipsis resolution in proximity
searching, and translation and query structuring methods in
cross-language retrieval. PhD Dissertation. University of
Tampere, Department of Information Studies. Acta Universitatis
Tamperensis 672.