Hyppää sisältöön
    • Suomeksi
    • In English
Trepo
  • Suomeksi
  • In English
  • Kirjaudu
Näytä viite 
  •   Etusivu
  • Trepo
  • Opinnäytteet - ylempi korkeakoulututkinto
  • Näytä viite
  •   Etusivu
  • Trepo
  • Opinnäytteet - ylempi korkeakoulututkinto
  • Näytä viite
JavaScript is disabled for your browser. Some features of this site may not work without it.

Robotics Motion Control by Using Deep Reinforcement Learning

Zhou, Yi (2021)

 
Avaa tiedosto
ZhouYi.pdf (2.933Mt)
Lataukset: 



Zhou, Yi
2021

Master's Programme in Computing Sciences
Informaatioteknologian ja viestinnän tiedekunta - Faculty of Information Technology and Communication Sciences
This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.
Hyväksymispäivämäärä
2021-11-16
Näytä kaikki kuvailutiedot
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202110227773
Tiivistelmä
Recently, with the development of Artificial Intelligence and Deep Learning in the field of robotics, especially to control robots for specific tasks. The traditional approach uses a large number of models to calculate the most suitable and stable operating points, however, this makes it necessary to spend a lot of time to collect models and therefore this approach is not generalizable. Other methods use neural networks to predict possible operating points from the model's 3D data, which is faster but also depends on the training of the neural network. However, Deep Reinforcement Learning combines the benefits of Deep Learning in the vision domain with the generalization of Reinforcement Learning, which allows robots to learn to perform specific tasks by interacting with their environment. The purpose of this paper is to implement and improve the traditional deep reinforcement learning approach by applying popular Deep Reinforcement Learning methods to the Franka Emika Panda robot. Implementing different Deep Reinforcement Learning algorithms to control the robot to perform specific tasks such as reach, touch and grasp. Additional, improving feature extraction through secondary processing of observations. Finally, the impact of different feature exactors on the algorithm is compared.
Kokoelmat
  • Opinnäytteet - ylempi korkeakoulututkinto [39852]
Kalevantie 5
PL 617
33014 Tampereen yliopisto
oa[@]tuni.fi | Tietosuoja | Saavutettavuusseloste
 

 

Selaa kokoelmaa

TekijätNimekkeetTiedekunta (2019 -)Tiedekunta (- 2018)Tutkinto-ohjelmat ja opintosuunnatAvainsanatJulkaisuajatKokoelmat

Omat tiedot

Kirjaudu sisäänRekisteröidy
Kalevantie 5
PL 617
33014 Tampereen yliopisto
oa[@]tuni.fi | Tietosuoja | Saavutettavuusseloste