Hyppää sisältöön
    • Suomeksi
    • In English
Trepo
  • Suomeksi
  • In English
  • Kirjaudu
Näytä viite 
  •   Etusivu
  • Trepo
  • TUNICRIS-julkaisut
  • Näytä viite
  •   Etusivu
  • Trepo
  • TUNICRIS-julkaisut
  • Näytä viite
JavaScript is disabled for your browser. Some features of this site may not work without it.

Monolithic vs. hybrid controller for multi-objective Sim-to-Real learning

Dag, Atakan; Angleraud, Alexandre; Yang, Wenyan; Strokina, Nataliya; Pieters, Roel S.; Lanz, Minna; Kämäräinen, Joni-Kristian (2021)

 
Avaa tiedosto
iros2021_sim2real.pdf (1.687Mt)
Lataukset: 



Dag, Atakan
Angleraud, Alexandre
Yang, Wenyan
Strokina, Nataliya
Pieters, Roel S.
Lanz, Minna
Kämäräinen, Joni-Kristian
IEEE
2021


This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited
doi:10.1109/IROS51168.2021.9636426
Näytä kaikki kuvailutiedot
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202211098271

Kuvaus

Peer reviewed
Tiivistelmä
Simulation to real (Sim-to-Real) is an attractive approach to construct controllers for robotic tasks that are easier to simulate than to analytically solve. Working Sim-to-Real solutions have been demonstrated for tasks with a clear single objective such as "reach the target". Real world applications, however, often consist of multiple simultaneous objectives such as "reach the target" but "avoid obstacles". A straightforward solution in the context of reinforcement learning (RL) is to combine multiple objectives into a multi-term reward function and train a single monolithic controller. Recently, a hybrid solution based on pre-trained single objective controllers and a switching rule between them was proposed. In this work, we compare these two approaches in the multi-objective setting of a robot manipulator to reach a target while avoiding an obstacle. Our findings show that the training of a hybrid controller is easier and obtains a better success-failure trade-off than a monolithic controller. The controllers trained in simulator were verified by a real set-up.
Kokoelmat
  • TUNICRIS-julkaisut [15271]
Kalevantie 5
PL 617
33014 Tampereen yliopisto
oa[@]tuni.fi | Tietosuoja | Saavutettavuusseloste
 

 

Selaa kokoelmaa

TekijätNimekkeetTiedekunta (2019 -)Tiedekunta (- 2018)Tutkinto-ohjelmat ja opintosuunnatAvainsanatJulkaisuajatKokoelmat

Omat tiedot

Kirjaudu sisäänRekisteröidy
Kalevantie 5
PL 617
33014 Tampereen yliopisto
oa[@]tuni.fi | Tietosuoja | Saavutettavuusseloste