Information portal on various topics of management of public resources of the Portuguese State

Project sheet

Name

Full Professor in “Multimedia Information Systems”

Total project amount

180,34 thousand €

Amount paid

0 €

Non-refundable funding

180,34 thousand €

Loan funding

0 €

Start date

01.02.2025

Expected end date

31.03.2026

Dimension

Resilience

Component

Qualifications and Skills

Investment

Science Plus Training

Operation code

02/C06-i06/2024.P2023.14864.TENURE.054

Summary

Improving multimedia information systems that close the gap between human information-needs and the information contained in large repositories of multimodal data, while leveraging well-grounded machine learning methods, is a key research challenge nowadays and in the years to come.The prospective Professor will develop research work in Multimedia Information Systems. In the short-term, it will concentrate on (i) multimodal conversational systems, (ii) planning in large vision and language models, and in the long-term it will address (iii) generative AI methods for multimedia content. A brief description of the expected research activities is given next:Multimodal Conversational Systems – Conversational systems are in its infancy with major breakthroughs that are not yet fully understood. The field is moving quickly, and it will be the root of great changes in the next decade. This natural way of accessing information will turn Information Systems into a completely pervasive commodity. Some of the most urgent research problems that need to be addressed in this area include its visual and linguistic reasoning capabilities, its lack of a real-world model, and its limited capability to keep track of state and sequence. All these capabilities are a corner stone for conversational multimodal systems that aim to guide humans in the physical world.Planning in large vision and language models – Despite the groundbreaking success of large vision and language models, their lack of control, noticeable in its hallucinations, has been a common critique of current LLMs. Hence, being able to control and audit LLMs is critical to many application domains where the safety of people is at risk. For these reasons, one would expect to see LLMs that can be instructed to follow a plan and guide a user through a sequence of actions.Generative AI for Multimedia Content – Automatically generating an image or a video from a text is currently an emerging area that has the potential to disrupt many areas such as video summarization, story illustration, multimodal dialog, and also as a data augmentation tool for other methods. Similarly to LLMs, empirical measures of consistency are still far from being sufficiently reliable with many hallucinations.The prospective Professor will enlarge the educational offer of DEEC/IST in this area. Two possible courses strongly related to the scientific activity to be developed by the prospective Professor are (i) Vision and Natural Language Systems (MSc level) and (ii) Deep Multimodal Information Processing (PhD level).The goal of the first course is to cover the first notions of Vision and Language information processing, addressing (i) learning algorithms, (ii) natural language, and (iii) visual analysis of information. The second course covers (i) multimodal information embeddings, (ii) storytelling and summarization of information streams, and (iii) quality of experience in visual information.This work is strongly related to the activities of DEEC/IST and the Human Language Technology group from INESC-ID. The Human Language Technology group (HLT) includes researchers/faculty from various universities, as well as graduate and post-graduate researchers, totaling around 30 researchers. Their background ranges from Electrical Engineering to Computer Science, and Linguistics. This strongly interdisciplinary group is actively involved in many areas of language research and development, both spoken and written, including, without limitation, speech recognition, speech synthesis, speech coding, speech understanding, audio indexing, multimodal dialogue systems, language and dialect identification, speech-to-speech machine translation, automatic summarization, natural language database interfaces, natural language generation, text simplification, named entity recognition, question answering, among others and in no particular order. The group has a very close cooperation with the spin-off company Voice Interaction, and with Unbabel, launched by a former PhD student.

Beneficiaries

Within the scope of the Recovery and Resilience Plan, two types of beneficiaries are responsible for carrying out the projects and using the funding provided. Due to their similar role, the reference to these two types of beneficiaries has been simplified and unified under the term "Beneficiary".
The two types are::
  • Direct Beneficiaries are those whose funding and projects to implement are part of the Recovery and Resilience Plan that has been negotiated and approved by the European Union;
  • Final Beneficiaries are those whose funding and projects to implement are approved following a selection process through Calls for Applications.

Call for applications

As part of the Call for Applications, submissions are requested to select the projects and final beneficiaries to whom funding will be awarded. Specific selection criteria are defined for each call, which must be reflected in the applications submitted and assessed.

The project is appraised on the basis of its compliance with the selection criteria laid down in the calls for applications, and a final score may be awarded, where applicable.

Final evaluation score

9,3
Important note

The components for calculating the assessment score can be found in the selection criteria document mentioned below.

Selection criteria

The funding selection criteria to which this project and its final beneficiary were subject and its score can be found in detail on the Recuperar Portugal platform.

Beneficiaries

Intermediate beneficiaries

Beneficiaries

Procurement

Beneficiaries representing public entities implement their project by signing one or more contracts with suppliers for goods or services through public procurement procedures.

To ensure and provide the utmost transparency in all these contracts, a list of the contracts that were signed under this project is available here, along with the information available on the Base.Gov platform. Please note that, according to the legislation in force at the time the contract was signed, some exceptions do not require the publication of the contracts signed on this platform, and, therefore, no information is available in such cases.

Geographic distribution

180,34 thousand €

Total amount of the project

Where was the money spent

By county

1 county financed .

  • Lisboa 180,34 thousand € ,
Source EMRP
10.02.2026
All themes
Transparency without leading