The Institute of Computer Science of the Polish Academy of Sciences is a leading national center of research in Computer Science, with some focus on the fundamental and applied research in the areas of Artificial Intelligence and Information Systems. The research activities are organized in two departments: Department of Theoretical Foundations of Computer Science, and department of Artificial Intelligence. There are 6 groups within the AI department, including the Linguistic Engineering Group (LEG), the largest Polish team specializing in Natural Language Processing, Linguistic Tools and Resources and in Corpus Linguistics. Within the last 5 years LEG has been involved in 4 European projects and 7 national projects, as well as a number of bilateral co-operation projects. Among the tools developed within LEG are taggers, shallow and deep parsers, as well as various machine learning and rule-based information extraction tools. LEG ICS PAS also developed the first large linguistically annotated corpus of Polish and currently coordinates the National Corpus of Polish project, which involves all previous developers of Polish corpora. Within the European CLARIN project, LEG is responsible for a working group dealing with the integration of linguistic tools and resources.
As a leader of WP4, ICS PAS supervises the formation of the core part of the project – the creation of language processing chains used for text annotation for each of the target languages. With their expertise in Natural Language Processing, ICS PAS brings an essential fragment to the integral knowledge and diversity of the ATLAS consortium. Furthermore, their experience with European and national projects allows them to give to other partners valuable feedback related to both the technical and the formal implementation of the project. Apart from their responsibilities as a WP leader, ICS PAS carries out the following major tasks:
- T2.1 – Specification of the linguistic framework used during the project
- T3.2 – Fine-tuning of the categorization tool to Polish language
- T4.1 – Implementation of a language processing chain for Polish language
- T5.1 – Implementation of a text summarization tool for Polish language