Trustworthy LLMs for Ethically Aligned AI-based Systems: A PhD Research Plan
de Cerqueira, José Antonio Siqueira; Rousi, Rebekah; Xi, Nannan; Hamari, Juho; Kemell, Kai-Kristian; Abrahamsson, Pekka (2025)
de Cerqueira, José Antonio Siqueira
Rousi, Rebekah
Xi, Nannan
Hamari, Juho
Kemell, Kai-Kristian
Abrahamsson, Pekka
2025
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202506137156
https://urn.fi/URN:NBN:fi:tuni-202506137156
Kuvaus
Peer reviewed
Tiivistelmä
In response to growing concerns around trustworthiness and ethical alignment in AI systems, this PhD aims to investigate how Large Language Models (LLMs) can be leveraged to support ethically aligned AI development in software engineering. Despite advancements, integrating ethical principles into AI workflows remains challenging, particularly in real-world applications that require compliance with emerging regulations, such as the EU AI Act. We will develop a Visual Studio Code (VSCode) Generative AI (GenAI) Extension powered by a multi-agent LLM system with Retrieval-Augmented Generation (RAG) capabilities. The extension will be designed to aid developers by evaluating code compliance with ethical standards, providing actionable recommendations to embed trustworthiness from early stages of development. The GenAI Extension will be evaluated through an iterative design science approach, encompassing dataset generation, ethical benchmarking, and practitioner testing. A dataset of over 2000 ethically aligned AI systems, will be created in compliance with leading regulatory frameworks, serving as a foundation for this tool's assessments. With this work, we hope to assist developers, particularly in startups and SMEs, by providing practical resources for building ethically aligned AI within limited resources. Through this approach, we aim to bridge the gap between abstract ethical principles and actionable software development practices, making ethical AI more accessible across industry contexts.
Kokoelmat
- TUNICRIS-julkaisut [22206]
