PRR Project
Full Professor in “Multimedia Information Systems”
Project sheet
Name
Full Professor in “Multimedia Information Systems”Total project amount
180,34 thousand €Amount paid
0 €Non-refundable funding
180,34 thousand €Loan funding
0 €Start date
01.02.2025Expected end date
31.03.2026Dimension
ResilienceComponent
Qualifications and SkillsInvestment
Science Plus TrainingOperation code
02/C06-i06/2024.P2023.14864.TENURE.054Summary
Improving multimedia information systems that close the gap between human information-needs and the information contained in large repositories of multimodal data, while leveraging well-grounded machine learning methods, is a key research challenge nowadays and in the years to come.The prospective Professor will develop research work in Multimedia Information Systems. In the short-term, it will concentrate on (i) multimodal conversational systems, (ii) planning in large vision and language models, and in the long-term it will address (iii) generative AI methods for multimedia content. A brief description of the expected research activities is given next:Multimodal Conversational Systems – Conversational systems are in its infancy with major breakthroughs that are not yet fully understood. The field is moving quickly, and it will be the root of great changes in the next decade. This natural way of accessing information will turn Information Systems into a completely pervasive commodity. Some of the most urgent research problems that need to be addressed in this area include its visual and linguistic reasoning capabilities, its lack of a real-world model, and its limited capability to keep track of state and sequence. All these capabilities are a corner stone for conversational multimodal systems that aim to guide humans in the physical world.Planning in large vision and language models – Despite the groundbreaking success of large vision and language models, their lack of control, noticeable in its hallucinations, has been a common critique of current LLMs. Hence, being able to control and audit LLMs is critical to many application domains where the safety of people is at risk. For these reasons, one would expect to see LLMs that can be instructed to follow a plan and guide a user through a sequence of actions.Generative AI for Multimedia Content – Automatically generating an image or a video from a text is currently an emerging area that has the potential to disrupt many areas such as video summarization, story illustration, multimodal dialog, and also as a data augmentation tool for other methods. Similarly to LLMs, empirical measures of consistency are still far from being sufficiently reliable with many hallucinations.The prospective Professor will enlarge the educational offer of DEEC/IST in this area. Two possible courses strongly related to the scientific activity to be developed by the prospective Professor are (i) Vision and Natural Language Systems (MSc level) and (ii) Deep Multimodal Information Processing (PhD level).The goal of the first course is to cover the first notions of Vision and Language information processing, addressing (i) learning algorithms, (ii) natural language, and (iii) visual analysis of information. The second course covers (i) multimodal information embeddings, (ii) storytelling and summarization of information streams, and (iii) quality of experience in visual information.This work is strongly related to the activities of DEEC/IST and the Human Language Technology group from INESC-ID. The Human Language Technology group (HLT) includes researchers/faculty from various universities, as well as graduate and post-graduate researchers, totaling around 30 researchers. Their background ranges from Electrical Engineering to Computer Science, and Linguistics. This strongly interdisciplinary group is actively involved in many areas of language research and development, both spoken and written, including, without limitation, speech recognition, speech synthesis, speech coding, speech understanding, audio indexing, multimodal dialogue systems, language and dialect identification, speech-to-speech machine translation, automatic summarization, natural language database interfaces, natural language generation, text simplification, named entity recognition, question answering, among others and in no particular order. The group has a very close cooperation with the spin-off company Voice Interaction, and with Unbabel, launched by a former PhD student.
Beneficiaries
The two types are::
- Direct Beneficiaries are those whose funding and projects to implement are part of the Recovery and Resilience Plan that has been negotiated and approved by the European Union;
- Final Beneficiaries are those whose funding and projects to implement are approved following a selection process through Calls for Applications.
Call for applications
As part of the Call for Applications, submissions are requested to select the projects and final beneficiaries to whom funding will be awarded. Specific selection criteria are defined for each call, which must be reflected in the applications submitted and assessed.
The project is appraised on the basis of its compliance with the selection criteria laid down in the calls for applications, and a final score may be awarded, where applicable.
Final evaluation score
The components for calculating the assessment score can be found in the selection criteria document mentioned below.
Selection criteria
Beneficiaries
Intermediate beneficiaries
Procurement
Beneficiaries representing public entities implement their project by signing one or more contracts with suppliers for goods or services through public procurement procedures.
To ensure and provide the utmost transparency in all these contracts, a list of the contracts that were signed under this project is available here, along with the information available on the Base.Gov platform. Please note that, according to the legislation in force at the time the contract was signed, some exceptions do not require the publication of the contracts signed on this platform, and, therefore, no information is available in such cases.
Geographic distribution
180,34 thousand €
Total amount of the project
Where was the money spent
By county
1 county financed .
-
Lisboa 180,34 thousand € ,