概要:
The emergence of deep convolutional neural networks (CNNs) in recent years was an important breakthrough in the field of computer vision, with CNN models topping competition leaderboards for a broad range of computer vision problems, including image classification, object detection and tracking, and semantic segmentation. More recently, the "Transformer" architecture---which was originally developed for natural language processing---has been adapted to computer vision tasks and is quickly growing a reputation within the deep learning research community as a powerful alternative to CNNs. In this talk I will give an overview of how Vision Transformers work and discuss my current research into extending this fresh deep learning architecture to the task of tracking pose joint locations for human subjects in videos.
〒819-0395
福岡市西区元岡744番地
TEL:092-802-4402
FAX:092-802-4405
(数理・MI研究所事務室)
IMI(マス・フォア・インダストリ研究所)
共同利用・共同研究拠点
セミナー
![]() |
リスト | ![]() |
全て(掲示受付分)(1958) | ![]() |
今日・明日のセミナー(1) |
Automatic human pose tracking with Vision Transformers
![]() |
開催時期 | 2021-11-16 12:00~2021-11-16 13:00 |
![]() |
場所 | Zoom |
![]() |
受講対象 | |
![]() |
講師 | Aiden Nibali (La Trobe University) |