๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
Deep Learning/[๋…ผ๋ฌธ] Paper Review

Taskonomy: Disentangling Task Transfer Learning

by ์ œ๋ฃฝ 2023. 7. 16.
728x90
๋ฐ˜์‘ํ˜•

 

 

๐Ÿ’ก
<๋ฆฌ๋ทฐ>

 

 

<Taskonomy>๊ฐ€ ๋ญ๋ƒ?
Taskonomy๋Š” ์ปดํ“จํ„ฐ ๋น„์ „ ๋ถ„์•ผ์—์„œ ๋‹ค์–‘ํ•œ ์ž‘์—… ๊ฐ„์˜ ์ƒํ˜ธ ์˜์กด์„ฑ์„ ํƒ๊ตฌํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๋ฒ”์šฉ ๋น„์ „ ์‹œ์Šคํ…œ์„ ๊ตฌ์ถ•ํ•˜๋Š” ์—ฐ๊ตฌ.
Taskonomy๋Š” ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค์„ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•ด ํ•„์š”ํ•œ ์‹œ๊ฐ์  ํŠน์ง•๋“ค์ด ์„œ๋กœ ๊ณต์œ ๋  ์ˆ˜ ์žˆ๋Š”์ง€๋ฅผ ์กฐ์‚ฌํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๋ชจ๋ธ์˜ ํ•™์Šต ํšจ์œจ์„ฑ๊ณผ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ์ „์ด ํ•™์Šต ๋ฐฉ๋ฒ•์„ ํƒ๊ตฌ.

Taskonomy์˜ ๋ชฉํ‘œ๋Š” ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค ๊ฐ„์— ๊ณต์œ  ๊ฐ€๋Šฅํ•œ ์‹œ๊ฐ์  ํŠน์ง•์„ ํƒ์ƒ‰ํ•˜์—ฌ, ์ž‘์—… ๊ฐ„์˜ ํ•™์Šต๊ณผ ์ผ๋ฐ˜ํ™”๋ฅผ ๊ฐœ์„ ํ•˜๊ณ  ์ž‘์—… ์ „ํ™˜์— ๋”ฐ๋ฅธ ๋น„์šฉ๊ณผ ๋…ธ๋ ฅ์„ ์ตœ์†Œํ™”ํ•˜๋Š” ๊ฒƒ.

 

 

 
 
๐Ÿ€
๋…ผ๋ฌธ ์š”์•ฝ: ์—ฌ๋Ÿฌ ์ž‘์—…๋“ค ๊ฐ„์— ๊ณต์œ  ๊ฐ€๋Šฅํ•œ ํŠน์ง•๋“ค์„ ๋ฐœ๊ฒฌํ•˜๊ณ  ์ด๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ•œ ๋ชจ๋ธ์ด ๋‹ค์–‘ํ•œ ์ž‘์—…(๊ฐ์ฒด ๊ฒ€์ถœ, ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜, ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜)์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•จ. ์ด๋ฅผ ์œ„ํ•ด ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค์„ ์ˆ˜ํ–‰ํ•˜๋ฉฐ ์ˆ˜์ง‘ํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ด ex) segmentation๊ณผ ๊นŠ์ด ์ถ”์ • ์ž‘์—…(ํ”ฝ์…€์˜ ๊ฑฐ๋ฆฌ ์ •๋ณด)์„ ๊ตฌํ•œ๋‹ค๊ณ  ํ–ˆ์„ ๋•Œ, ๋‘ task๋Š” ์ด๋ฏธ์ง€ ๊ตฌ์กฐ์™€ ๊ณต๊ฐ„ ์ •๋ณด์— ๊ด€๋ จ์ด ์žˆ์Œ. ์ฆ‰, ๋‘ ์ž‘์—…์„ ์›๋ž˜ ๋ชจ๋ธ 2๊ฐœ๋ฅผ ์จ์„œ ๊ฐ๊ฐ ์ˆ˜ํ–‰ํ•ด์•ผ ํ–ˆ์ง€๋งŒ, ํ•œ ๋ชจ๋ธ๋กœ ๋‘ ๊ฐ€์ง€์˜ ์ •๋ณด(segmentation, ๊นŠ์ด ์ถ”์ • ์ž‘์—…)๋ฅผ ํ•™์Šตํ•ด์„œ segmentation๊ณผ ๊นŠ์ด ์ถ”์ • ์ž‘์—…์„ ๋ชจ๋‘ ์ˆ˜ํ–‰ํ•˜๊ฒŒ ๋จ
  • โ€ป Transferability๋ž€?
    • ํ•œ ์ž‘์—…์—์„œ ํ•™์Šตํ•œ ์ง€์‹์ด ๋‹ค๋ฅธ ๊ด€๋ จ ์ž‘์—…์— ์ ์šฉ ๊ฐ€๋Šฅํ•œ ์ •๋„๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ๊ฐœ๋…
    • ์ฆ‰, ํ•œ ์ž‘์—…์—์„œ ์–ป์€ ํ•™์Šต ๊ฒฐ๊ณผ๋‚˜ ๋ชจ๋ธ์ด ๋‹ค๋ฅธ ์ž‘์—…์—๋„ ์ ์šฉ๋  ์ˆ˜ ์žˆ๋Š” ๋Šฅ๋ ฅ์„ ์˜๋ฏธํ•จ

0. Abstract

  • ์—ฌ๋Ÿฌ Visual Task(detection, depth estimation, edge detection) ๋“ฑ์€ ์„œ๋กœ ์—ฐ๊ด€์„ฑ, ์˜์กด์„ฑ์ด ์žˆ๋‹ค๋Š” ๊ฐ€์ • ํ•˜์— ๋…ผ๋ฌธ์ด ์‹œ์ž‘๋จ ⇒ transfer learning์˜ ๊ธฐ๋ณธ ๊ฐœ๋…์ž„.
  • ๋…ผ๋ฌธ์—์„œ๋Š” ์˜์กด์„ฑ ์ฐพ๋Š” ๊ฒƒ์„ ํ†ตํ•ด, transfer learning ์— ๋Œ€ํ•œ ๊ณ„์‚ฐ์ ์ธ ๋ถ„๋ฅ˜๋„ ‘Taskonomy’๋ฅผ ์„ค๋ช…ํ•จ
๐Ÿ’ก
โžก๏ธ ์ด ๋…ผ๋ฌธ์—์„œ๋Š” ๊ด€๋ จ๋œ task๋“ค ์‚ฌ์ด์—์„œ ์ง€์‹์„ ํšจ๊ณผ์ ์œผ๋กœ ์žฌ์‚ฌ์šฉํ•˜๊ฑฐ๋‚˜ ๋ณต์žก์„ฑ์„ ์ฆ๊ฐ€์‹œํ‚ค์ง€ ์•Š๊ณ , ๋‹ค์–‘ํ•œ ์ž‘์—…์„ ํ•œ ์‹œ์Šคํ…œ์—์„œ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ๋Š” ์›์น™์ ์ธ ๋ฐฉ๋ฒ•์„ ์ œ๊ณตํ•จ

1. Introduction

  • ์˜ค๋Š˜๋‚ ์˜ vision ๋ถ„์•ผ์—์„œ ๋‹ค๋ค„์ง€๋Š” task๋“ค์€ classification, depth estimation(๊นŠ์ด ์ถ”์ •), edge detection(๊ฒฝ๊ณ„์„  ๊ฒ€์ถœ), pose estimation(ํฌ์ฆˆ ์ถ”์ •) ๋“ฑ ๋‹ค์–‘ํ•จ
  • ์ด๋Ÿฌํ•œ task๋“ค์€ ์„œ๋กœ ๊ฐ„์— ๊ฐ€์ง€๊ณ  ์žˆ๋Š” ์ •๋ณด๋“ค์ด ์œ ์‚ฌํ•ด์„œ ๊ณต์œ ๋  task๋„ ๋ถ„๋ช…ํžˆ ์žˆ์„ํ…๋ฐ, ๊ฐ๊ฐ ํ•™์Šต์‹œํ‚ค๋ฉด ๋‚ญ๋น„ ์•„๋‹Œ๊ฐ€์— ๋Œ€ํ•œ ์˜๋ฌธ์ ์ด ์ƒ๊น€โžก๏ธ ์„œ๋กœ ๋‹ค๋ฅธ task๋“ค ๊ฐ„์˜ ๊ด€๊ณ„๋ฅผ ๊ทธ๋ž˜ํ”„ ํ˜•ํƒœ๋กœ ํ‘œํ˜„ํ•˜๊ณ , ์ด๋ฅผ ์ด์šฉํ•ด ์ƒˆ๋กœ์šด task์— ๋Œ€ํ•œ ๋ชจ๋ธ์˜ ํ•™์Šต์„ ๋ณด๋‹ค ํšจ์œจ์ ์œผ๋กœ ํ•™์Šต์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ๋ฐฉ์‹์„ ์ œ์•ˆํ•จ
  • Target Task๋ฅผ ๋‹จ๋…์œผ๋กœ ํ•™์Šตํ–ˆ์„ ๋•Œ ๋Œ€๋น„ ์„ฑ๋Šฅ์ด ํ–ฅ์ƒ๋œ ์ˆ˜์ค€์„ ‘Transferability’ ์ฒ™๋„๋กœ ์ธก์ •ํ–ˆ๊ณ , Task ๊ฐ„์˜ ๊ด€๊ณ„๋ฅผ ์œ ์‚ฌ๋„ํ–‰๋ ฌ(affinity matrix)๋กœ ํ‘œํ˜„ํ•œ ํ›„, Target Task์— ๋Œ€ํ•œ ์ตœ์ ์˜ Transfer policy๋ฅผ ์ฐพ์•„๋ƒ„ ( ๋ชจ๋“  ๊ณผ์ •์€ ๊ฐ task์— ๋Œ€ํ•œ prior knowledge๊ฐ€ ๊ฐœ์ž… ์•ˆํ•˜๋„๋ก ๊ตฌ์„ฑ๋จ )โžก๏ธ ๊ฒฐ๋ก ์ ์œผ๋กœ ๋น„์šฉ์ด ์ ๊ฒŒ ๋“ค๊ณ , ์ผ๋ฐ˜์ ์ธ ๋ฐ์ดํ„ฐ์…‹์— ์œ ํšจํ•˜๋‹ค๋Š” ๊ฒƒ์„ ๋ณด์—ฌ์คŒ

2. Related Work

Taskonomy ๋ฐฉ๋ฒ•๊ณผ ๊ด€๋ จ๋œ ๋งค์šฐ ๋‹ค์–‘ํ•œ ๊ด€๋ จ ์—ฐ๊ตฌ ์ฃผ์ œ๋“ค์ด ์กด์žฌ.

  • Self-Supervised Learning
    • ๋ ˆ์ด๋ธ”์ด ์—†๋Š” ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ๋ชจ๋ธ์„ ํ•™์Šต์‹œ์ผœ์„œ ๊ฐ€์งœ ๋ ˆ์ด๋ธ” ์ƒ์„ฑ
    • ๋ ˆ์ด๋ธ”๋ง ๋น„์šฉ์ด ์ƒ๋Œ€์ ์œผ๋กœ ๋‚ฎ์€ task๋กœ๋ถ€ํ„ฐ ํ•™์Šตํ•œ ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜์—ฌ, ๊ทธ์™€ ๋‚ด์žฌ์ ์œผ๋กœ ๊ด€๋ จ๋˜์–ด ์žˆ์œผ๋ฉด์„œ ๋ ˆ์ด๋ธ”๋ง ๋น„์šฉ์ด ๋” ๋†’์€ task์— ๋Œ€ํ•œ ํ•™์Šต์„ ์‹œ๋„ํ•˜๋Š” ๋ฐฉ๋ฒ•
    • source task๋ฅผ ์‚ฌ์ „์— ์‚ฌ๋žŒ์ด ์ˆ˜๋™์œผ๋กœ ์ง€์ •ํ•ด์ค˜์•ผ ํ•œ๋‹ค๋Š” ์ธก๋ฉด์—์„œ, Taskonomy ๋ฐฉ๋ฒ•๊ณผ ์ฐจ์ด๊ฐ€ ์กด์žฌ
  • Unsupervised Learning
    • ๋ ˆ์ด๋ธ”์ด ์—†๋Š” ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ํ•™์Šตํ•˜๋Š” ๋ฐฉ๋ฒ•์œผ๋กœ, ๋ช…์‹œ์ ์ธ ์ง€๋„๋‚˜ ๋ ˆ์ด๋ธ” ์—†์ด ๋ฐ์ดํ„ฐ์˜ ํŒจํ„ด๊ณผ ๊ตฌ์กฐ๋ฅผ ๋ฐœ๊ฒฌ
    • (๋ ˆ์ด๋ธ”์ด ์—†๋Š” ์ƒํ™ฉ์—์„œ) ๋ฐ์ดํ„ฐ์…‹ ์ž์ฒด์— ๊ณตํ†ต์ ์œผ๋กœ ๋‚ด์žฌ๋˜์–ด ์žˆ๋Š” ์†์„ฑ์„ ํ‘œํ˜„ํ•˜๋Š” feature representation์„ ์ฐพ์•„๋ƒ„
    • Taskonomy ๋ฐฉ๋ฒ•์˜ ๊ฒฝ์šฐ ๊ฐ task ๋ณ„ ๋ ˆ์ด๋ธ”์„ ๋ช…์‹œ์ ์œผ๋กœ ํ•„์š”๋กœ ํ•จ.
  • Meta-Learning
    • ๋ชจ๋ธ์ด ํ•™์Šต ๋ฐฉ๋ฒ•์„ ํ•™์Šตํ•˜๋Š” ํ•™์Šต ์ ‘๊ทผ ๋ฐฉ์‹์œผ๋กœ, ๋‹ค์–‘ํ•œ ๊ณผ์ œ ๋ถ„ํฌ์—์„œ ์ง€์‹์ด๋‚˜ ์‚ฌ์ „ ์ •๋ณด๋ฅผ ์Šต๋“ํ•˜์—ฌ ์ƒˆ๋กœ์šด ๊ณผ์ œ์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™”์™€ ์ ์‘ ๋Šฅ๋ ฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๊ฒƒ
    • ๋ชจ๋ธ ํ•™์Šต์„ ์ƒ์œ„ ๋ ˆ๋ฒจ(meta-level)์—์„œ์˜ ์ข€ ๋” ‘์ถ”์ƒํ™”๋œ’ ๊ด€์ ์œผ๋กœ ์กฐ๋ช…ํ•˜๊ณ  ๋ชจ๋ธ์„ ๋” ํšจ๊ณผ์ ์œผ๋กœ ํ•™์Šตํ•˜๊ธฐ ์œ„ํ•œ ์ผ๋ฐ˜์ ์ธ ๋ฐฉ๋ฒ•์„ ์ฐพ์Œ
    • ๋ณต์ˆ˜ ๊ฐœ์˜ task๋“ค ๊ฐ„์˜ transferability๋ฅผ ์ข€ ๋” meta-level์—์„œ ์กฐ๋งํ•˜๋ฉด์„œ ์ด๋“ค์˜ structure๋ฅผ ์ฐพ๊ธฐ ์œ„ํ•œ ์ผ๋ฐ˜์ ์ธ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค๋Š” ์ ์—์„œ, Taskonomy ๋ฐฉ๋ฒ•๊ณผ ์ผ์ข…์˜ ๊ณตํ†ต์ ์ด ์กด์žฌ
  • Multi-Task Learning
    • ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๊ฐ€ ํ•˜๋‚˜๋กœ ์ •ํ•ด์ ธ ์žˆ์„ ๋•Œ ์ด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์—ฌ๋Ÿฌ task๋“ค์— ๋Œ€ํ•œ ์˜ˆ์ธก ๊ฒฐ๊ณผ๋“ค์„ ๋™์‹œ์— ์ถœ๋ ฅํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์—ฐ๊ตฌํ•˜๋Š” ์ฃผ์ œ
    • Taskonomy ๋ฐฉ๋ฒ•์€ ๋‘ task๋“ค ๊ฐ„์˜ ๊ด€๊ณ„๋ฅผ ๋ช…์‹œ์ ์œผ๋กœ ๋ชจ๋ธ๋งํ•จ
  • Domain Adaptation
    • transfer learning์˜ ํ˜•ํƒœ๋กœ, task๋Š” ๋™์ผํ•˜์ง€๋งŒ, ์ž…๋ ฅ ๋ฐ์ดํ„ฐ์˜ domain์ด ํฌ๊ฒŒ ๋‹ฌ๋ผ์ง€๋Š” ๊ฒฝ์šฐ(source domain -> target domain) ์ตœ์ ์˜ transfer policy๋ฅผ ์ฐพ๊ธฐ ์œ„ํ•œ ์—ฐ๊ตฌ ์ฃผ์ œ
    • Taskonomy์˜ ๊ฒฝ์šฐ, domain์ด ์•„๋‹Œ task๊ฐ€ ๋‹ฌ๋ผ์ง€๋Š” ๊ฒฝ์šฐ๋ฅผ ๊ฐ€์ •
  • Learning Theoretic
    • ๋ชจ๋ธ์˜ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ์„ ๋ณด์žฅํ•˜๊ธฐ ์œ„ํ•œ ๋ฐฉ๋ฒ•
    • ์œ„์˜ ์ฃผ์ œ๋“ค๊ณผ ์กฐ๊ธˆ์”ฉ ๊ฒน์นจ
    • ์œ„์˜ ๋ฐฉ์‹๋“ค์€ ๋Œ€๋ถ€๋ถ„ ๊ณ„์‚ฐ์ด ๋ถˆ๊ฐ€๋Šฅํ•œ ๊ณ„์‚ฐ์„ ํฌํ•จํ•จ
    • ํ˜น์€ ๋ถˆ๊ฐ€๋Šฅํ•œ ๊ณ„์‚ฐ์„ ํ”ผํ•˜๊ธฐ ์œ„ํ•ด task์— ๋งŽ์€ ์ œํ•œ์„ ๋‘์—ˆ์Œ
    • Taskonomy๋Š” ์œ„์˜ ๋ฐฉ์‹์œผ๋กœ๋ถ€ํ„ฐ ์˜๊ฐ์„ ์–ป์—ˆ์œผ๋‚˜, ์ด๋ก ์  ์ฆ๋ช…์„ ํ”ผํ•˜๊ณ  ์‹ค์šฉ์ ์ธ ์ ‘๊ทผ์œผ๋กœ ์‹œ๋„ํ•จ

 

3. Method

โ“
Source Task๊ฐ€ ๋ญ๊ณ , Target Task๋Š” ๋ญ์•ผ?

Source Task๊ฐ€ ๊ฐœ์™€ ๊ณ ์–‘์ด์˜ ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜ ์ž‘์—…์ด๋ผ๊ณ  ๊ฐ€์ •ํ•ด๋ณด์ž. ์ด ์ž‘์—…์—์„œ ๋ชจ๋ธ์€ ์ฃผ์–ด์ง„ ์ด๋ฏธ์ง€๊ฐ€ ๊ฐœ์ธ์ง€ ๊ณ ์–‘์ด์ธ์ง€๋ฅผ ํŒ๋ณ„ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ํ•™์Šตํ•  ๊ฒƒ์ž„. ๊ทธ๋ฆฌ๊ณ  Target Task๋กœ๋Š” ๊ฐœ์˜ ์–ผ๊ตด ๊ฒ€์ถœ ์ž‘์—…์„ ํ•œ๋‹ค๊ณ  ๊ฐ€์ •. ์ด ์ž‘์—…์—์„œ๋Š” ์ด๋ฏธ์ง€ ๋‚ด์—์„œ ๊ฐœ์˜ ์–ผ๊ตด ์˜์—ญ์„ ์ฐพ์•„๋‚ด๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•  ๊ฒƒ์ž„.

Source Task์ธ ๊ฐœ์™€ ๊ณ ์–‘์ด์˜ ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜ ์ž‘์—…์—์„œ ํ•™์Šต๋œ ๋ชจ๋ธ์€ ์ด๋ฏธ์ง€์˜ ํŠน์ง•๊ณผ ํŒจํ„ด์„ ํŒŒ์•…ํ•˜๋Š” ๋ฐ ๋Šฅ์ˆ™ํ•ด์ ธ์žˆ์Œ. ์ด๋Ÿฌํ•œ ์ง€์‹์€ Target Task์ธ ๊ฐœ์˜ ์–ผ๊ตด ๊ฒ€์ถœ ์ž‘์—…์—๋„ ์œ ์šฉํ•  ์ˆ˜ ์žˆ์Œ. ์˜ˆ๋ฅผ ๋“ค์–ด, ๊ฐœ์˜ ์–ผ๊ตด์€ ๊ฐœ ์ด๋ฏธ์ง€์—์„œ ํŠน์ •ํ•œ ๋ชจ์–‘๊ณผ ๊ตฌ์กฐ๋ฅผ ๊ฐ€์ง€๊ณ , Source Task์—์„œ ํ•™์Šต๋œ ๋ชจ๋ธ์€ ์ด๋ฏธ์ง€์—์„œ ์ด๋Ÿฌํ•œ ํŠน์ง•์„ ์ž˜ ์ดํ•ดํ•˜๊ณ  ์žˆ์„ ๊ฒƒ์ž„.

๋”ฐ๋ผ์„œ, Source Task์—์„œ ํ•™์Šต๋œ ๋ชจ๋ธ์„ ๊ฐ€์ ธ์™€์„œ Target Task์— ์ ์šฉํ•˜๋ฉด, ๊ฐœ์˜ ์–ผ๊ตด์„ ๊ฒ€์ถœํ•˜๋Š” ๋Šฅ๋ ฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋‹ค. Source Task์—์„œ ํ•™์Šตํ•œ ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜ ์ž‘์—…์„ ํ†ตํ•ด ์–ป์€ ์ง€์‹์„ ํ™œ์šฉํ•˜์—ฌ Target Task์ธ ๊ฐœ์˜ ์–ผ๊ตด ๊ฒ€์ถœ ์ž‘์—…์—์„œ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ๊ฒƒ์„ ๋งํ•จ. ์ฆ‰, Source Task์™€ Target Task๋Š” ์„œ๋กœ ๋‹ค๋ฅธ ์ž‘์—…์ด์ง€๋งŒ, Source Task์—์„œ ์–ป์€ ์ง€์‹์„ Target Task์— ์ ์šฉํ•˜์—ฌ ์ „์ด(transfer)ํ•˜๋Š” ๊ฒƒ์ด ๋ฐ”๋กœ Taskonomy์˜ ํ•ต์‹ฌ ๊ฐœ๋….

  • โ€ป Taskonomy๋ž€

    : ์ฃผ์–ด์ง„ Task Dictionary์—์„œ Task ๊ฐ„์˜ Transferability๋ฅผ ๋‚˜ํƒ€๋‚ธ, ๊ณ„์‚ฐ์ ์œผ๋กœ ๋„์ถœ์ด ๊ฐ€๋Šฅํ•œ Hypergraph

  • โ€ป Hypergraph๋ž€

    : ํ•˜๋‚˜์˜ edge๊ฐ€ ์—ฌ๋Ÿฌ Node๋ฅผ ์—ฐ๊ฒฐํ•  ์ˆ˜ ์žˆ๋Š” ๊ทธ๋ž˜ํ”„ (๋‹จ์ˆœํ™”)

    โžก๏ธ ํ•˜๋‚˜์˜ target task์— ์—ฌ๋Ÿฌ source task๊ฐ€ ์—ฐ๊ฒฐ๋ผ์„œ, ์„ฑ๋Šฅ ๊ทน๋Œ€ํ™”๋ฅผ ์œ„ํ•ด ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค๊ณ  ํ•จ.

Taskonomy ๋ฐฉ๋ฒ• overview: Transferability ๋ชจ๋ธ๋ง ๋ฐ taxonomy ์ƒ์„ฑ ๊ณผ์ •

 

 

Taskonomy ๋ฐฉ๋ฒ• ( ์ด 4๋‹จ๊ณ„)

๐Ÿค–
1. Source Task Set S ๋‚ด์˜ ๊ฐ task์— ๋Œ€ํ•ด ํŠนํ™”๋œ ๋ชจ๋ธ์ธ task-specific network๋ฅผ ๊ฐ๊ฐ ๋…๋ฆฝ์ ์œผ๋กœ ํ•™์Šต
2. ์ง€์ •๋œ Transfer Order k ํ•˜์—์„œ, ์„œ๋กœ ๊ฐ„์˜ ์กฐํ•ฉ ์—ฐ์‚ฐ์„ ํ†ตํ•ด ๋งŒ๋“ค์–ด์ง€๋Š” source task(s) -> target task ์˜ ๊ฐ ์กฐํ•ฉ ๋ณ„ transferability๊ฐ€ ์ˆ˜์น˜ํ™”๋œ ํ˜•ํƒœ๋กœ ๊ณ„์‚ฐ
3. 2๋ฒˆ์—์„œ ๊ณ„์‚ฐ๋œ transferability์— ๋Œ€ํ•œ ์ •๊ทœํ™”๋ฅผ ํ†ตํ•ด affinity matrix๋ฅผ(์œ ์‚ฌ๋„ ํ–‰๋ ฌ) ์–ป์Œ
4. ๊ฐ transfer policy๋ฅผ ํƒ์ƒ‰ํ•ด์„œ ์ตœ์ ์˜ ์„ฑ๋Šฅ์„ ๋ณด์ด๋Š” transfer policy๋ฅผ ์ฐพ์Œ
  • โ€ป Transfer Policy๋ž€

    : ๋ชจ๋ธ์ด ํ•œ ์ž‘์—…์—์„œ ํ•™์Šตํ•œ ์ง€์‹์„ ๋‹ค๋ฅธ ์ž‘์—…์— ์ „๋‹ฌํ•˜๊ณ  ํ™œ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ๊ฒฐ์ •ํ•˜๋Š” ์ •์ฑ…์ด๋‚˜ ์ „๋žต์„ ์˜๋ฏธ

  • ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” Computer vision์—์„œ ์ผ๋ฐ˜์ ์œผ๋กœ ๋‹ค๋ฃจ๋Š” 26๊ฐ€์ง€์˜ task๋ฅผ ์ œ์‹œ.
  • Datasets: ์‹คํ—˜์— ์“ฐ์ธ ๋ฐ์ดํ„ฐ์…‹์€ ๋ณธ์ธ๋“ค์ด ์ง์ ‘ ์ œ์ž‘. 600๊ฐœ ๊ฑด๋ฌผ์˜ ์‹ค๋‚ด ์žฅ๋ฉด์— ๋Œ€ํ•œ 400๋งŒ ๊ฐœ์˜ ์ด๋ฏธ์ง€ ๋ฐ์ดํ„ฐ์…‹์„ ์ƒ์„ฑ, ์ด ๋ฐ์ดํ„ฐ๋ฅผ ๊ฐ€์ง€๊ณ  26๊ฐ€์ง€ task์— ๋Œ€ํ•œ ๋ ˆ์ด๋ธ”๋ง์„ ๋ชจ๋‘ ์ˆ˜ํ–‰ํ•จ.
26๊ฐ€์ง€ ์ค‘ 24๊ฐ€์ง€ task ์˜ˆ์ธก ๊ฒฐ๊ณผ ์˜ˆ์‹œ
26๊ฐ€์ง€์˜ vision task

3.1. Step I: Task-Specific Modeling (์ž‘์—…๋ณ„ ๋ชจ๋ธ๋ง)

๐Ÿ’ก
Source Task Set ๋‚ด์˜ ๊ฐ source task์— ๋Œ€ํ•˜์—ฌ task-specific networks๋ฅผ ๋…๋ฆฝ์ ์œผ๋กœ ํ•™์Šต
  • ๊ฐ Task-specific network๋Š” ๊ณตํ†ต์ ์œผ๋กœ encoder-decoder ๊ตฌ์กฐ๋ฅผ ์ง€๋‹˜

โ€ป ์ธ์ฝ”๋”๋Š” ๊ฐ•๋ ฅํ•œ ํ‘œํ˜„์„ ์ถ”์ถœํ•˜๊ธฐ์— ์ถฉ๋ถ„ํžˆ ํฌ๊ณ , ๋””์ฝ”๋”๋Š” ์ข‹์€ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•˜๊ธฐ์— ์ถฉ๋ถ„ํžˆ ํฌ์ง€๋งŒ ์ธ์ฝ”๋”๋ณด๋‹ค ํ›จ์”ฌ ์ž‘๋‹ค๊ณ  ํ•จ

3.2. Step II: Transfer Modeling

๐Ÿ’ก
Source task S์™€ Target task tt์— ๋Œ€ํ•ด์„œ S์˜ task-specific network์˜ Encoder์™€ ์ƒˆ๋กœ์šด Decoder๋ฅผ ํ•ฉ์ณ์ ธ ๊ตฌ์„ฑ๋œ transfer network๋ฅผ ์ƒ์„ฑ

์–ด๋Š ์ž…๋ ฅ ์ด๋ฏธ์ง€ I์— ๋Œ€ํ•œ transfer network์˜ ์˜ˆ์ธก ๋ ˆ์ด๋ธ”์€ ์™ผ์ชฝ์ฒ˜๋Ÿผ ํ‘œํ˜„๋จ

์ตœ์ ์˜ transfer function ์‹์„ ์˜๋ฏธํ•จ ( readout function )
  • ft(I): ์ž…๋ ฅ ์ด๋ฏธ์ง€ I์— ๋Œ€ํ•œ target task t์˜ Ground Truth
  • L(t): ํ•ด๋‹น task์˜ loss function
  • D(s→t)์˜ ์„ฑ๋Šฅ์ด ์ข‹์„์ˆ˜๋ก ๋‘ task s,t ๊ฐ„์˜ transferability๊ฐ€ ๋†’์Œ
  • ๋ชจ๋“  (s,t) ์กฐํ•ฉ์— ๋Œ€ํ•œ readout Function์„ ๋ชจ๋‘ ๊ตฌํ•จ

 

 

  • ์ด๋•Œ, ์—ฌ๋Ÿฌ source task๊ฐ€ target task๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ๋ณด์™„์ ์ธ ์ •๋ณด๋ฅผ ํฌํ•จํ•  ์ˆ˜ ์žˆ์Œ (ํ•˜๋‚˜์˜ ์ •๋ณด๋ณด๋‹ค๋Š” ์—ฌ๋Ÿฌ ์ •๋ณด๋“ค์„ ํ†ตํ•ด ๋‹ค์–‘ํ•œ ์ •๋ณด๋ฅผ ์ œ๊ณตํ•˜๋ฉด ์„ฑ๋Šฅ ์˜ฌ๋ฆด ์ˆ˜ ์žˆ๋‹ค๋Š” ๋ง)
  • k๊ฐ€ ์ปค์งˆ์ˆ˜๋ก ์ „์ฒด ์กฐํ•ฉ ๊ฒฝ์šฐ ์ˆ˜๋„ ๋Š˜์–ด๋‚˜๊ฒ ์ฃ ?
  • TT x sCk_sC_k : ๋ชจ๋“  target task์— ๋Œ€ํ•œ ์ „์ฒด ์กฐํ•ฉ ๊ฒฝ์šฐ์˜ ์ˆ˜โžก๏ธ ํ•˜์ง€๋งŒ, ๊ณ„์‚ฐ๋Ÿ‰์ด ๋ฐฉ๋Œ€ํ•ด์ง
๐Ÿค–
1. k=1๋กœ ํ•ด์„œ target task t์— ๋Œ€ํ•œ ๋ชจ๋“  ์„ฑ๋Šฅ์„ ๊ณ„์‚ฐํ•จ 2. ์ดํ›„, beam search๋ฅผ ์ ์šฉํ•ด์„œ ์„ฑ๋Šฅ ๊ธฐ์ค€์œผ๋กœ max(5,k) ๊ฐœ์˜ source task๋ฅผ ์„ ํƒํ•ด์„œ k์ฐจ ์กฐํ•ฉ๋งŒ์„ ๊ณ ๋ ค 3. ์ด๋ ‡๊ฒŒ ํ•ด์„œ, ๋ชจ๋“  (s1,...,s(k),t)์กฐํ•ฉ์— ๋Œ€ํ•œ readout function๋“ค์„ ๋ชจ๋‘ ๊ตฌํ•จ
  • โ€ป beam search๋ž€

    : ๊ฐ€๋Šฅํ•œ ๋‹ค์–‘ํ•œ ํ›„๋ณด๋“ค ์ค‘์—์„œ ์ตœ์ ์˜ ํ•ด๋ฅผ ์ฐพ๋Š” ๋ฐฉ๋ฒ•

    : ์ œํ•œ๋œ ์ง‘ํ•ฉ์—์„œ ๊ฐ€์žฅ ์œ ๋งํ•œ ๋…ธ๋“œ๋ฅผ ํ™•์žฅํ•˜์—ฌ ๊ทธ๋ž˜ํ”„๋ฅผ ํƒ์ƒ‰ํ•˜๋Š” ๋ฐœ๊ฒฌ์  ํƒ์ƒ‰ ์•Œ๊ณ ๋ฆฌ์ฆ˜

  • ์˜ˆ๋ฅผ ๋“ค์–ด, s=7, t=1, k=2๋กœ ์„ค์ •ํ•œ ๊ฒฝ์šฐ,
7C2=217C2 = 21

๊ฐœ์˜ ์กฐํ•ฉ์ด ๋‚˜์˜ฌ ๊ฒƒ์ด๊ณ , 5๊ฐœ๋งŒ์„ ๊ฐ€์ ธ๊ฐ„๋‹ค๊ณ  ํ•˜๋ฉด, 5C2์ด๋ฏ€๋กœ 10๊ฐœ์˜ ์กฐํ•ฉ์„ ๊ฑธ์น˜๊ฒŒ ๋จ

(s1,s2~s5), (s2,s3~s5), (s3,s4~s5), (s4,s5) → ์ด 10๊ฐ€์ง€

 

 

3.3. Step III: Ordinal Normalization using Analytic Hierarchy Process (AHP)

๐Ÿ’ก
Task๋“ค ๊ฐ„์˜ Transability์„ ๊ธฐ๋ฐ˜์œผ๋กœ Affinity matrix๋ฅผ ๊ตฌํ•˜๋Š” ๋‹จ๊ณ„
  • Target task์— ๋”ฐ๋ผ Loss์˜ ๋ฒ”์œ„๊ฐ€ ๋‹ค๋ฅด๊ธฐ ๋•Œ๋ฌธ์— Ordinal ์ ‘๊ทผ๋ฒ•์„ ์‚ฌ์šฉํ•˜์—ฌ ์ •๊ทœํ™” ์ˆ˜ํ–‰
  • โ€ป [0,1] ๋ฒ”์œ„๋กœ ์กฐ์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ์ด์œ 
    • Loss ๊ฐ’์˜ ๊ฐ์†Œ์— ๋”ฐ๋ฅธ ์‹ค์ œ ์ฒด๊ฐ๋˜๋Š” ์˜ˆ์ธก ๊ฒฐ๊ณผ ํ’ˆ์งˆ์˜ ์ฆ๊ฐ€ ์†๋„๋„ task ๋ณ„๋กœ ์ฐจ์ด๊ฐ€ ์กด์žฌํ•œ๋‹ค๊ณ  ํ•จโžก๏ธ ๋‹จ์ˆœํžˆ normalizeํ•˜๋Š” ๊ฒƒ๋งŒ์œผ๋กœ๋Š” ๋ชจ๋“  target task๋“ค์„ ๋™์ผ ์„ ์ƒ์—์„œ ์ปค๋ฒ„ํ•˜๊ธฐ์— ์ถฉ๋ถ„ํ•˜์ง€ ์•Š์Œ
  • โ€ป ์™œ Target task๋งˆ๋‹ค Loss ๋ฒ”์œ„๊ฐ€ ๋‹ค๋ฅธ๊ฐ€?
    1. ํ•ด์˜ ๊ณต๊ฐ„์˜ ๋ณต์žก์„ฑ: target task์— ๋”ฐ๋ผ ๊ฐ€๋Šฅํ•œ ํ•ด์˜ ๊ณต๊ฐ„์ด ๋‹ค์–‘ํ•˜๊ณ  ๋ณต์žก์„ฑ์ด ๋‹ฌ๋ผ์ง. ์ผ๋ถ€ ํƒœ์Šคํฌ์—์„œ๋Š” ํ•ด์˜ ๊ณต๊ฐ„์ด ์ƒ๋Œ€์ ์œผ๋กœ ์ž‘๊ณ  ๋‹จ์ˆœํ•  ์ˆ˜ ์žˆ์ง€๋งŒ, ๋‹ค๋ฅธ ํƒœ์Šคํฌ์—์„œ๋Š” ๋งค์šฐ ํฌ๊ณ  ๋ณต์žกํ•œ ๊ณต๊ฐ„์ผ ์ˆ˜ ์žˆ์Œ.
    1. target task์˜ ์š”๊ตฌ ์‚ฌํ•ญ: ๊ฐ target task๋Š” ๋‹ค๋ฅธ ์š”๊ตฌ ์‚ฌํ•ญ์„ ๊ฐ€์ง€๊ณ  ์žˆ์Œ. ์–ด๋–ค task์—์„œ๋Š” ๋งŽ์€ ํ›„๋ณด๋ฅผ ๊ณ ๋ คํ•˜๋Š” ๊ฒƒ์ด ์œ ๋ฆฌํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์ด๋Š” ๋” ํฐ ๋น” ํฌ๊ธฐ(K ๊ฐ’)๋ฅผ ํ•„์š”๋กœ ํ•จ. ๋‹ค๋ฅธ ํƒœ์Šคํฌ์—์„œ๋Š” ์ž‘์€ K ๊ฐ’์ด ์ถฉ๋ถ„ํ•  ์ˆ˜๋„ ์žˆ์Œ.
    1. ์„ฑ๋Šฅ๊ณผ ํšจ์œจ์„ฑ์˜ ๊ท ํ˜•: ๋น” ํฌ๊ธฐ(K ๊ฐ’)๋Š” ์„ฑ๋Šฅ๊ณผ ํšจ์œจ์„ฑ ์‚ฌ์ด์˜ ๊ท ํ˜•์„ ๋‚˜ํƒ€๋ƒ„. ๋” ํฐ K ๊ฐ’์€ ๋” ๋งŽ์€ ํ›„๋ณด๋ฅผ ๊ณ ๋ คํ•˜๊ณ , ์ด๋กœ ์ธํ•ด ๋” ์ข‹์€ ์„ฑ๋Šฅ์„ ์–ป์„ ์ˆ˜ ์žˆ์Œ. ๊ทธ๋Ÿฌ๋‚˜ ๋™์‹œ์— ์—ฐ์‚ฐ ๋น„์šฉ๋„ ๋” ๋งŽ์ด ์†Œ๋ชจํ•˜๊ฒŒ ๋จ. ๋”ฐ๋ผ์„œ, ์‹ค์ œ๋กœ ์„ ํƒ๋˜๋Š” K ๊ฐ’์€ ์„ฑ๋Šฅ ๋ชฉํ‘œ์™€ ์—ฐ์‚ฐ ๋น„์šฉ์„ ๊ณ ๋ คํ•˜์—ฌ ์กฐ์ •๋˜์–ด์•ผ ํ•จ.
๐Ÿค–
<๋ฐฉ๋ฒ•>
  1. ๊ฐtt์— ๋Œ€ํ•ด, ๋ชจ๋“  source task๊ฐ„์˜ transability๋ฅผ ๋น„๊ตํ•˜๋Š” ํ–‰๋ ฌWtW_t์„ ๋งŒ๋“ฌ.
  1. ํ–‰๋ ฌ์˜(i,j) (i, j)์˜ ์š”์†Œwi,jw_{i, j}๋Š” test ๋ฐ์ดํ„ฐ์—์„œ si๊ฐ€ sj๋ณด๋‹ค t์— ๋Œ€ํ•ด ๋” ์„ฑ๋Šฅ์ด ์šฐ์ˆ˜ํ–ˆ๋˜ (์‹์œผ๋กœ ํ‘œํ˜„ํ•œ ๊ฒƒ:Dsi→t(I)>Dsj→t(I){D_{s_i→t}}(I) > Ds_{j→t}(I)) ํ…Œ์ŠคํŠธ ์ด๋ฏธ์ง€(I)์˜ ์ˆ˜๋ฅผ ์นด์šดํŒ…ํ•˜๊ณ , test set ๋‚ด์—์„œ์˜ ๋น„์œจ์„ ๊ณ„์‚ฐํ•จ.
  1. Wt๋ฅผ [0.001, 0.999] ๋ฒ”์œ„๋กœ clipping ํ•จ.
  1. Wt์˜ ํ•ฉ์ด 1์ด ๋˜๋„๋ก ์ƒˆ๋กœ์šด matrixWt‘W_t^`==Wt/WtTW_t / W_t^T(element-wise)๋ฅผ ๊ณ„์‚ฐํ•จ.โ€ปsi s_i๊ฐ€sjs_j์— ๋น„ํ•ด ์„ฑ๋Šฅ์ด ์–ผ๋งˆ๋‚˜ ๋” ์šฐ์ˆ˜ํ–ˆ๋Š”์ง€ ๋‚˜ํƒ€๋‚ธ๋‹ค๊ณ  ํ•จ.
  1. Wt‘W_t^`์˜ eigenvector(๊ณ ์œ ๊ฐ’์ด ๊ฐ€์žฅ ํฐ ๊ณ ์œ ๋ฒกํ„ฐ)๋ฅผ ๊ณ„์‚ฐํ•จ
  1. principal eigenvector์˜ i๋ฒˆ์งธ ์„ฑ๋ถ„์€, ์ด์— ๋Œ€์‘๋˜๋Š” i๋ฒˆ์งธ source task๋“ค๋กœ ๊ตฌ์„ฑํ•œ centrality(์ค‘์‹ฌ์„ฑ)๋ฅผ ๋‚˜ํƒ€๋‚ด๊ฒŒ ๋จ (์•„ ๋ชจ๋ฅด๊ฒ ์Œ ๋ญ”๋ง์ด์•ผ. ์–ด๋ ค์›Œ!!!~!~!~)

โžก๏ธ ์ฆ‰, ์‰ฝ๊ฒŒ ๋งํ•ด์„œ ํ•ด๋‹น source task์˜ target task์— ๋Œ€ํ•œ ์ผ์ข…์˜ ‘์˜ํ–ฅ๋ ฅ’์„ ๋‚˜ํƒ€๋ƒ„. ⇒ Analytic Hierarchy Process(AHP) → ๊ฒฝ์˜๊ณผํ•™์—์„œ ๋งŽ์ด ์“ฐ์ด๋Š” ๋ฐฉ์‹์ด๋ผ๊ณ .

  • โ€ป AHP๋ž€

    ์ƒํ˜ธ ๋ฐฐํƒ€์ ์ธ ๋Œ€์•ˆ๋“ค์„ ์ฒด๊ณ„์ ์œผ๋กœ ํ‰๊ฐ€ํ•˜์—ฌ ์šฐ์„  ์ˆœ์œ„๋ฅผ ๋„์ถœํ•˜๋Š” ์˜์‚ฌ๊ฒฐ์ •๋ฐฉ๋ฒ•

  • ๋ชจ๋“  t ∈ T์— ๋Œ€ํ•œ W0t์˜ ์ฃผ์š” ๊ณ ์œ ๋ฒกํ„ฐ๋ฅผ ์Œ“์•„์„œ ํ–‰๋ ฌ P('p'๋Š” ์„ฑ๋Šฅ์„ ๋‚˜ํƒ€๋ƒ„)์„ ์–ป์Œ.

3.4. Step IV: Computing the Global Taxonomy

๐Ÿ’ก
Task affinity matrix๊ฐ€ ์™„์„ฑ๋˜๋ฉด, ์ด๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ํŠน์ •ํ•œ target task์˜ ์„ฑ๋Šฅ์„ ๊ทน๋Œ€ํ™”ํ•˜๋Š” ์ตœ์ ์˜ transfer policy๋ฅผ ํƒ์ƒ‰ํ•จ ⇒ BIP(Boolean Interger Programming)๋ฅผ ์‚ฌ์šฉํ•ด์„œ ๋ฌธ์ œ ํ•ด๊ฒฐ
  • task๊ฐ€ ๋…ธ๋“œ(๋จธ๋ฆฌ)์ด๊ณ  transfer๊ฐ€ ์—ฃ์ง€(๊ฐ„์„ )์ธ ํ•˜์œ„ ๊ทธ๋ž˜ํ”„(subgraph) ์„ ํƒ์œผ๋กœ์„œ ์ •์˜๋  ์ˆ˜ ์žˆ์Œ
  • โ€ป subgraph ๊ทธ๋ฆผ ์ฐธ๊ณ 
    • ์›๋ž˜ ๊ทธ๋ž˜ํ”„ ์•ˆ์—์„œ ์ƒˆ๋กœ ๋งŒ๋“ค์–ด๋‚ธ ๊ทธ๋ž˜ํ”„

โžก๏ธ ์ตœ์ ์˜ ํ•˜์œ„ ๊ทธ๋ž˜ํ”„๋Š” ์ด์ƒ์ ์ธ tast ๋…ธ๋“œ์™€ ๋Œ€์ƒ์œผ๋กœ๋ถ€ํ„ฐ์˜ ์ตœ์ƒ์˜ edge๋ฅผ ์„ ํƒํ•˜๋ฉด์„œtast ๋…ธ๋“œ์˜ ์ˆ˜๊ฐ€ supervision budget ์„ ์ดˆ๊ณผํ•˜์ง€ ์•Š๋„๋ก ํ•จ

  • โ€ป supervision budget
    • ๋ชจ๋ธ์ด ํŠน์ • ์ž‘์—…์— ๋Œ€ํ•ด ์–ผ๋งˆ๋‚˜ ๋งŽ์€ ๋ ˆ์ด๋ธ” ํ˜น์€ ๊ฐ๋… ์‹ ํ˜ธ๋ฅผ ๋ฐ›์„ ์ˆ˜ ์žˆ๋Š”๊ฐ€์— ๋Œ€ํ•œ ๋ฒ”์œ„๋ผ๊ณ  ์ƒ๊ฐํ•˜๋ฉด ๋  ๋“ฏ.
    • Supervision Budget์ด ๋†’์„์ˆ˜๋ก ๋ชจ๋ธ์€ ํ•ด๋‹น ์ž‘์—…์— ๋Œ€ํ•ด ๋” ๋งŽ์€ ๊ฐ๋… ์‹ ํ˜ธ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ๋” ์ •ํ™•ํ•˜๊ฒŒ ํ•™์Šตํ•  ์ˆ˜ ์žˆ์Œ

 

๐Ÿ’ก
BIP๋ž€ : 0๊ณผ 1์˜ ๊ฐ’์„ ๊ฐ€์ง€๋Š” ์ด์ง„ ๋ณ€์ˆ˜๋กœ ์ œ์•ฝ ์กฐ๊ฑด์„ ํฌํ•จํ•˜๋Š” ์ตœ์ ํ™” ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ๊ธฐ๋ฒ• <์ œ์•ฝ์กฐ๊ฑด> 1. task๊ฐ€ ํ•˜์œ„ ๊ทธ๋ž˜ํ”„์— ํฌํ•จ๋˜๋ฉด, ๊ทธ ๋ชจ๋“  source task(node)๋„ ํฌํ•จ๋˜์–ด์•ผ ํ•œ๋‹ค. 2. ๊ฐ target task์€ ์ •ํ™•ํžˆ ํ•˜๋‚˜์˜ transfer(source task)๋ฅผ ๊ฐ€์ ธ์•ผ ํ•œ๋‹ค. 3. ๊ฐ๋… ์˜ˆ์‚ฐ์„ ์ดˆ๊ณผํ•˜์ง€ ์•Š์•„์•ผ ํ•œ๋‹ค.

 


 

4. Experiments

  • Step 1์—์„œ ํ•™์Šต์‹œํ‚จ Task-specific network๋“ค์˜ ๊ฐ task ๋ณ„ ์„ฑ๋Šฅ ๊ฒฐ๊ณผ

 


4.1. Evaluation of Computed Taxonomies

  • Supervision budget γ ๋ฐ Transfer Order๋ฅผ ๋ณ€๊ฒฝํ•ด ๊ฐ€๋ฉด์„œ ํ•™์Šตํ•œ ๊ฒฐ๊ณผ ์–ป์–ด์ง„ ๋ช‡ ๊ฐ€์ง€ ์˜ˆ์‹œ

 


 

  • Taskonomy์— ๊ธฐ๋ฐ˜ํ•ด์„œ ์–ป์–ด์ง„ transfer ๊ทœ์น™๋“ค์„ ๊ฐ target task์— ์ ์šฉํ•ด์„œ transfer learning์„ ์ˆ˜ํ–‰ํ–ˆ์„ ๋•Œ์˜ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์คŒ (2๊ฐ€์ง€ ์ง€ํ‘œ๋ฅผ ์‚ฌ์šฉ)
  • Gain

    : Transfer network์˜ ํ•™์Šต์— ์‚ฌ์šฉํ•œ validation set(1.6๋งŒ)์œผ๋กœ, target task์˜ task-specific network๋ฅผ ์ฒ˜์Œ๋ถ€ํ„ฐ ํ•™์Šตํ•˜๋Š” ๋ฐฉ๋ฒ•์„ baseline์œผ๋กœ ์„ค์ •ํ–ˆ์„ ์‹œ์˜, taskonomy ๋ฐฉ๋ฒ•์˜ win rate(%)

  • Quality

    Task-specific network์˜ ํ•™์Šต์— ์‚ฌ์šฉํ•œ training set(12๋งŒ)์œผ๋กœ, target task์˜ task-specific network๋ฅผ ์ฒ˜์Œ๋ถ€ํ„ฐ ํ•™์Šตํ•˜๋Š” ๋ฐฉ๋ฒ•์„ baseline์œผ๋กœ ์„ค์ •ํ–ˆ์„ ์‹œ์˜, taskonomy ๋ฐฉ๋ฒ•์˜ win rate(%)

  • Maximum transfer order๋ฅผ ์ฆ๊ฐ€์‹œํ‚ฌ์ˆ˜๋ก, ๊ทธ๋ฆฌ๊ณ  supervision budget γ๋ฅผ ์ฆ๊ฐ€์‹œํ‚ฌ์ˆ˜๋ก, Gain๊ณผ Quality๊ฐ€ ์ ์ฐจ์ ์œผ๋กœ ์ฆ๊ฐ€ํ•˜๋Š” ๊ฒฝํ–ฅ์„ ๋ณด์ž„

โžก๏ธ ๋” ๋งŽ์€ source task๋กœ๋ถ€ํ„ฐ ์–ป์€ ์ง€์‹์„ transferํ•  ์ˆ˜๋ก ์„ฑ๋Šฅ์ด ๋†’์•„์งˆ ๊ฒƒ์ด๋‹ค๋ผ๋Š” ๊ฐ€์„ค์ด ์„ฑ๋ฆฝ๋จ.

 


4.2. Generalization to Novel Tasks

  • ์ƒˆ๋กœ์šด task์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ ๊ฒ€์ฆ ๊ฒฐ๊ณผ
  • ์ด์ „๊นŒ์ง€์˜ ์‹คํ—˜์—์„œ๋Š” ์‚ฌ์ „์— ๊ฐ€์ •ํ•œ source-target ์กฐํ•ฉ๋“ค์„ ๊ณ ๋ คํ•˜์—ฌ taskonomy๋ฅผ ๊ณ„์‚ฐํ•˜๋Š” ๊ณผ์ •์„ ๊ฑฐ์ณค์ง€๋งŒ, ํ˜„์‹ค์ ์ธ ์ƒํ™ฉ์—์„œ๋Š” ์ƒˆ๋กœ์šด target task์— ๋Œ€ํ•ด์„œ ๊ธฐ์กด์— ์žˆ๋˜ source task๋งŒ์„ ๊ฐ€์ง€๊ณ  ์ตœ์ ์˜ transfer policy๋ฅผ ์ฐพ์•„์•ผ ํ•จ.

โžก๏ธ ๊ธฐ์กด Task๋“ค์„ ๋ชจ๋‘ Source๋กœ ์˜ฎ๊ธฐ๊ณ , ์ƒˆ๋กœ์šด Task๋ฅผ ๋‹จ์ผ Target์œผ๋กœํ•œ ์ผ๋ฐ˜ํ™” ์„ฑ์ฆ ๊ฒ€์ฆ ์‹คํ—˜์„ ์ˆ˜ํ–‰

โžก๏ธ ImageNet ๋ฐ์ดํ„ฐ์…‹์œผ๋กœ ํ•™์Šตํ•œ AlexNet์˜ FC7์„ features๋กœ ์‚ฌ์šฉํ•œ ๊ฒฝ์šฐ ๋“ฑ์— ๋น„ํ•ด, ์™„์„ฑ๋œ taskonomy์— ๊ธฐ๋ฐ˜ํ•˜์—ฌ ์ฐพ์€ transfer policy์— ๋”ฐ๋ผ ํ•™์Šตํ•œ ๊ฒฝ์šฐ์˜ ์„ฑ๋Šฅ์ด ์ „์ฒด์ ์œผ๋กœ ๋” ์šฐ์ˆ˜ํ•œ ๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚จ (์งˆ๋ฌธ- ์ž˜ ๋ชจ๋ฅด๊ฒ ์Œ)

 


 

5. Significance Test of the Structure

  • Taskonomy๋กœ ์ฐพ์€ ์ตœ์ ์˜ Transfer policy ์ ์šฉ ์„ฑ๋Šฅ ๋น„๊ต ๊ฒฐ๊ณผ (์ดˆ๋ก์ƒ‰: ์ตœ์ ์˜ transfer policy, ํšŒ์ƒ‰: ๋žœ๋ค transfer Policy)

 


 

6. Limitations and Discussion

  1. Model Dependence: ํ•™์Šต์€ DNN, ๋ฐ์ดํ„ฐ๋Š” ์ด๋ฏธ์ง€ ๋ฐ์ดํ„ฐ๋งŒ ์‚ฌ์šฉ → ์‹คํ—˜ ๊ฒฐ๊ณผ๊ฐ€ ํŠน์ • ๋ชจ๋ธ๊ณผ ๋ฐ์ดํ„ฐ์…‹์— ๋„ˆ๋ฌด ํŠนํ™”๋˜์–ด์žˆ๋‹ค๊ณ  ํ‘œํ˜„ํ•  ์ˆ˜ ์žˆ์„ ๋“ฏ
  1. Compositionality: ๋ณธ ๋…ผ๋ฌธ์—์„œ ๋‹ค๋ฃฌ task๋“ค์€ ๋ชจ๋‘ ์‚ฌ๋žŒ์ด ์ •์˜ํ•œ task์— ํ•ด๋‹น. → ์ƒˆ๋กœ์šด ๋ถ€๋ถ„ ์ž‘์—…(Subtask)์„ ๋ฐœ๊ฒฌํ•  ๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ์˜๋ฌธ์„ ์ œ๊ธฐํ•จ (๋‹ค์–‘ํ•œ task๋ฅผ ๋‹ค๋ฃจ์ง€ ๋ชปํ–ˆ๋‹ค..)
  1. Space Regularity: 26๊ฐœ์˜ task์—์„œ ์ƒ˜ํ”Œ๋ง์„ ํ†ตํ•ด ์–ป์€ task๋งŒ์„ ์‚ฌ์šฉํ–ˆ๊ธฐ์—, ๊ณผ์—ฐ ์ผ๋ฐ˜์ ์ธ task์ธ๊ฐ€์— ๋Œ€ํ•ด ์˜๋ฌธ ์ œ๊ธฐ
  1. Transferring to Non-visual and Robotic Tasks: ์ด๋ฏธ์ง€์™€ ๊ด€๋ จ๋œ visual task๋“ค์— ๋Œ€ํ•ด์„œ๋งŒ ๊ฒ€์ฆ์„ ์ˆ˜ํ–‰ํ•จ → ์‹œ๊ฐ์ ์ด์ง€ ์•Š์€ ๋ถ„์•ผ์—์„œ๋„ Taskonomy ๋ฐฉ๋ฒ•์„ ํ†ตํ•ด transferability๋ฅผ ๊ทน๋Œ€ํ™”ํ•  ์ˆ˜ ์žˆ์„์ง€์— ๋Œ€ํ•œ ์˜๋ฌธ ์ œ๊ธฐ
  1. Lifelong Learning: ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” Taskonomy๋ฅผ ์™„์„ฑํ•˜๋Š” ์ž‘์—…์„ ๋‹จ ํ•œ ๋ฒˆ์— ์ˆ˜ํ–‰ํ•จ → ์‹œ์Šคํ…œ์ด ์ง€์†์ ์œผ๋กœ ํ•™์Šต์„ ์ง„ํ–‰ํ•˜๊ณ , ์ด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์–ด๋–ค task๋ฅผ ์ ์ง„์ ์œผ๋กœ ํ™•์žฅ์‹œํ‚ฌ ์ˆ˜ ์žˆ์„์ง€์— ๋Œ€ํ•œ ๊ฒ€์ฆ ํ•„์š”
  2.  

7. Reference

https://www.cognex.com/ko-kr/blogs/deep-learning/research/paper-review-taskonomy-disentangling-task-transfer-learning

https://rahites.tistory.com/179

๊ฐ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

์ƒ๊ฐ๋ณด๋‹ค ์ดํ•ดํ•˜๋Š”๋ฐ ์–ด๋ ค์› ๋˜ ๋…ผ๋ฌธ.
728x90
๋ฐ˜์‘ํ˜•