To page body
university of tampere: sis/luo-coms: research: cis - the tampere research center for information and systems: research groups: ngis:
Faculties of Natural and Communication SciencesUniversity of TampereFaculties of Natural and Communication Sciences
CIS - The Tampere Research Center for Information and Systems


The following two projects operate within the NGIS research group:


Full name: Integration, Harmonization, and Profiling of Unfamiliar and Heterogeneous XML Data Sources

Description: In our digitalized society, it is essential to build systems that provide the users with the right information at the right time. Usually, this means that, in order to satisfy the users’ information needs, data must be collected from multiple autonomous data sources. These data sources may be related to one or more conceptual domains. Within a single conceptual domain, information needs may arise unexpectedly and require comparing and analyzing data on organizations, persons, results, etc. On the other hand, in several applications, such as in criminal investigation and investigative journalism, there also exist ad hoc needs to discover semantic relationships among data sources that belong to disparate conceptual domains. The common feature in the above cases is that they necessitate the combination of data from autonomous and heterogeneous data sources, which have not originally been intended to co-operate. In these kinds of situations, the users typically have little familiarity with the contents, structure, and/or semantics of the underlying data sources.

In our work, we aim to develop tools through which the user can incrementally obtain more and more detailed knowledge on the available data sources. Based on this knowledge, she can both assess the usefulness of a specific data source with respect to her information needs and, if necessary, customize it to be more suitable for her purpose. Further, we design and implement tools that are capable of satisfying sophisticated ad hoc information needs based on these customized data sources. In our approach, we assume that all available data sources are represented in XML (eXtensible Markup Language). By relying on a single data format, we avoid several problems caused by the heterogeneity of data formats. We selected XML, since XML data sources are very common and most other data formats can easily be converted into XML

People: Kalervo Järvelin, Mikko Kuru, Katja Moilanen, Timo Niemi, and Turkka Näppilä


Full name: Tampere Information Retrieval and Indexing of XML

Description: [...]

People: Paavo Arvola, Marko Junkkari, Jaana Kekäläinen, Sami Kiviharju, and Johanna Vainio


Maintained by:
Last update: 16.1.2012 8.57 Muokkaa

University of Tampere
+358 3 355 111

FINEEC Audited HR Excellence in Research

Cooperation and Services
About Us

Research & Study

Career Services
Finnish Social Science Data Archive
Centre for International Education
IT services
Language Centre
Language Services
Registrar's Office
Sports Activities
» more

Teaching schedules
Curricula guides
Student's Desktop

Andor search
Renew your loans
UTA intranet
Office 365 webmail
Uta webmail
Electronic exam service
Examination results