๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
728x90
๋ฐ˜์‘ํ˜•

Deep Learning/[๋…ผ๋ฌธ] Paper Review38

SMPLify(Keep it SMPL): Automatic Estimation of 3D Human Pose and Shape from a Single Image SMPLify[Keep it SMPL] ์ด๋ž€? : 2D CNN(Deepcut)์„ ํ™œ์šฉํ•ด ๊ด€์ ˆ ์œ„์น˜๋ฅผ ๋ฝ‘์€ ํ›„, 3D SMPL์— ์ ์šฉํ•ด 3D Mesh๋ฅผ ๊ตฌ์„ฑํ•˜๋Š” ๋ฐฉ์‹ ์š”์•ฝ(Abstract) : ์ด๋ฏธ์ง€์—์„œ ์ธ๊ฐ„์˜ 3D ํฌ์ฆˆ์™€ ํ˜•ํƒœ๋ฅผ ์ž๋™์œผ๋กœ ์ถ”์ •ํ•˜๊ณ ์ž ํ•จ : CNN ๊ธฐ๋ฒ• Deepcut์„ ํ™œ์šฉ, 3D SMPL์˜ ๊ฒฐํ•ฉ : Datasets์˜ ๊ฒฝ์šฐ, Leeds Spors, HumanEva, Human3.6M ์‚ฌ์šฉ ์ด์  ๋ฐ ํŠน์ง•(Introduction) : ์ด์ „ ๋ฐฉ์‹์˜ ๊ฒฝ์šฐ, ํฌ์ฆˆ ์ดˆ์ ์—๋งŒ ๋งž์ท„๊ณ , 3D ํ˜•ํƒœ๋ฅผ ๋ฌด์‹œํ–ˆ์Œ โžก๏ธ 2D ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ ํฌ์ฆˆ์™€ ํ˜•ํƒœ๋ฅผ ๋ชจ๋‘ ํฌํ•จํ•˜๋Š” 3D ๋ฉ”์‰ฌ๋ฅผ ์ž๋™์œผ๋กœ ์ถ”์ •ํ•˜๋Š” ํ•ด๊ฒฐ์ฑ…์„ ์ œ์‹œํ•จ Deepcut ํ™œ์šฉํ•ด 2D ๊ด€์ ˆ ์ถ”์ • โ€ป DeepCut์ด๋ž€ ๐Ÿ“š [์ฐธ๊ณ ] https://arxiv.or.. 2023. 7. 31.
SMPL: A Skinned Multi-Person Linear Model ๐Ÿ’ก SMPL: A Skinned Multi-Person Linear Model ๋ชฉ์ฐจ SMPL์˜ ์ •์˜ SMPL ์—ฐ๊ตฌ ๋ชฉ์  ๊ธฐ์กด์˜ ๋ฐฉ๋ฒ• ๋ฐ ํ•œ๊ณ„ SMPL์˜ ์›๋ฆฌ์™€ ์ž‘๋™ ์ตœ์ข… DMPL SMPL(Skinned Multi-Person Linear) ์ด๋ž€? [์ฐธ๊ณ ] โžก๏ธ SMPL์— ๋Œ€ํ•œ ๊ฐ„๋‹จํ•œ ์„ค๋ช… : skinned vertex ๊ธฐ๋ฐ˜์˜ ๋ชจ๋ธ๋กœ์„œ, ๋‹ค์–‘ํ•œ ์ธ๊ฐ„์˜ ์ฒดํ˜•์„ ํ˜„์‹ค์ ์œผ๋กœ ๋‚˜ํƒ€๋‚ด๊ณ  ์ž์—ฐ์Šค๋Ÿฌ์šด ์ž์„ธ์— ๋”ฐ๋ฅธ ๋ณ€ํ˜•์„ ์ทจํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ ์—ฐ์กฐ์ง ์›€์ง์ž„์„ ๋‚˜ํƒ€๋‚ธ๋‹ค. โ€ป skinned vertex: ์Šคํ‚จ(ํ”ผ๋ถ€)์„ ์”Œ์›Œ์ง„ ๋ผˆ๊ตฌ์กฐ์— ์†ํ•œ ๋ฉ”์‹œ์˜ ์ •์  (๋ผˆ์˜ ์›€์ง์ž„์— ๋”ฐ๋ผ ๋ณ€ํ˜•๋˜๋Š” ์ •์ ) SMPL ์—ฐ๊ตฌ์˜ ๋ชฉ์  : ๋‹ค์–‘ํ•œ ์ฒดํ˜•์„ ๋Œ€ํ‘œํ•  ์ˆ˜ ์žˆ๋Š” ์‹ค์ œ์ ์ธ ์• ๋‹ˆ๋ฉ”์ด์…˜ ์ธ๊ฐ„ ์‹ ์ฒด๋ฅผ ๋งŒ๋“ค๊ณ , ์ž์—ฐ์Šค๋Ÿฌ์šด ์ž์„ธ์— ๋”ฐ๋ผ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๋ณ€ํ˜•๋˜๋ฉฐ, .. 2023. 7. 28.
DETR: End-to-End Object Detection with Transformers ๐Ÿ“ ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” object detection์„ direct set prediction(์ผ๋Œ€์ผ๋Œ€์‘)์œผ๋กœ ์ •์˜, transformer์™€ bipartite matching loss๋ฅผ ์‚ฌ์šฉํ•œ DETR(DEtection TRansformer)์„ ์ œ์•ˆํ•จ. DETR์€ COCO dataset์— ๋Œ€ํ•˜์—ฌ Faster R-CNN๊ณผ ๋น„์Šทํ•œ ์ˆ˜์ค€์˜ ์„ฑ๋Šฅ์„ ๋ณด์ž„ ์ถ”๊ฐ€์ ์œผ๋กœ, self-attention์„ ํ†ตํ•œ global information(์ „์—ญ ์ •๋ณด)๋ฅผ ํ™œ์šฉํ•จ์œผ๋กœ์จ ํฌ๊ธฐ๊ฐ€ ํฐ ๊ฐ์ฒด๋ฅผ Faster R-CNN๋ณด๋‹ค ํ›จ์”ฌ ์ž˜ ํฌ์ฐฉ. ๐Ÿ“ 1. Backbone(ResNet)์„ ์ž…๋ ฅํ•ด์„œ ํ”ผ์ฒ˜๋งต์„ ์ถ”์ถœ 2. ํ”ผ์ฒ˜๋งต์„ 1x1 conv์— ์ž…๋ ฅํ•ด์„œ flattenํ•œ ํ”ผ์ฒ˜๋งต์— ๋Œ€ํ•ด positional encoding ๊ตฌํ•ด์„œ ๋”ํ•จ โ€ป spatial.. 2023. 7. 23.
SRNet: Editing Text in the Wild Review 0. Abstract ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์ž์—ฐ ์ด๋ฏธ์ง€์˜ ํ…์ŠคํŠธ ํŽธ์ง‘์— ๊ด€์‹ฌ์ด ์žˆ์œผ๋ฉฐ, ์›๋ณธ ์ด๋ฏธ์ง€์˜ ๋‹จ์–ด๋ฅผ ๋‹ค๋ฅธ ๋‹จ์–ด๋กœ ๊ต์ฒดํ•˜๊ฑฐ๋‚˜ ์ˆ˜์ •ํ•˜์—ฌ ์›๋ณธ ์ด๋ฏธ์ง€์™€ ์‹œ๊ฐ์ ์œผ๋กœ ๊ตฌ๋ณ„ํ•˜๊ธฐ ์–ด๋ ค์šด ํŽธ์ง‘๋œ ์ด๋ฏธ์ง€๋ฅผ ์œ ์ง€ํ•˜๋Š” ์ž‘์—…์„ ๋ชฉํ‘œ๋กœ ํ•จ ์„ธ ๊ฐ€์ง€ ๋ชจ๋“ˆ๋กœ ๊ตฌ์„ฑ๋œ end-to-end ํ•™์Šต ๊ฐ€๋Šฅํ•œ ์Šคํƒ€์ผ ๋ณด์กด ๋„คํŠธ์›Œํฌ (SRNet)๋ฅผ ์ œ์•ˆ ํ…์ŠคํŠธ ๋ณ€ํ™˜ ๋ชจ๋“ˆ: ์›๋ณธ ์ด๋ฏธ์ง€์˜ ํ…์ŠคํŠธ ๋‚ด์šฉ์„ ๋Œ€์ƒ ํ…์ŠคํŠธ๋กœ ๋ณ€๊ฒฝํ•˜๋ฉด์„œ ์›๋ž˜์˜ ํ…์ŠคํŠธ ์Šคํƒ€์ผ์„ ์œ ์ง€ํ•ฉ๋‹ˆ๋‹ค. ๋ฐฐ๊ฒฝ ์ธํŽ˜์ธํŒ… ๋ชจ๋“ˆ: ์›๋ณธ ํ…์ŠคํŠธ๋ฅผ ์ง€์šฐ๊ณ  ์ ์ ˆํ•œ ํ…์Šค์ฒ˜๋กœ ํ…์ŠคํŠธ ์˜์—ญ์„ ์ฑ„์›๋‹ˆ๋‹ค. ํ“จ์ „ ๋ชจ๋“ˆ: ๋‘ ๋ชจ๋“ˆ์˜ ์ •๋ณด๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ์ˆ˜์ •๋œ ํ…์ŠคํŠธ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑ ๐Ÿ’ก 1. Text Editing(ํ…์ŠคํŠธ ํŽธ์ง‘) 2. Text Synthesis(ํ…์ŠคํŠธ ํ•ฉ์„ฑ) 3. Text Erasure(ํ…์ŠคํŠธ ์‚ญ์ œ).. 2023. 7. 17.
Taskonomy: Disentangling Task Transfer Learning ๐Ÿ’ก ๊ฐ€ ๋ญ๋ƒ? Taskonomy๋Š” ์ปดํ“จํ„ฐ ๋น„์ „ ๋ถ„์•ผ์—์„œ ๋‹ค์–‘ํ•œ ์ž‘์—… ๊ฐ„์˜ ์ƒํ˜ธ ์˜์กด์„ฑ์„ ํƒ๊ตฌํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๋ฒ”์šฉ ๋น„์ „ ์‹œ์Šคํ…œ์„ ๊ตฌ์ถ•ํ•˜๋Š” ์—ฐ๊ตฌ. Taskonomy๋Š” ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค์„ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•ด ํ•„์š”ํ•œ ์‹œ๊ฐ์  ํŠน์ง•๋“ค์ด ์„œ๋กœ ๊ณต์œ ๋  ์ˆ˜ ์žˆ๋Š”์ง€๋ฅผ ์กฐ์‚ฌํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๋ชจ๋ธ์˜ ํ•™์Šต ํšจ์œจ์„ฑ๊ณผ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ์ „์ด ํ•™์Šต ๋ฐฉ๋ฒ•์„ ํƒ๊ตฌ. Taskonomy์˜ ๋ชฉํ‘œ๋Š” ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค ๊ฐ„์— ๊ณต์œ  ๊ฐ€๋Šฅํ•œ ์‹œ๊ฐ์  ํŠน์ง•์„ ํƒ์ƒ‰ํ•˜์—ฌ, ์ž‘์—… ๊ฐ„์˜ ํ•™์Šต๊ณผ ์ผ๋ฐ˜ํ™”๋ฅผ ๊ฐœ์„ ํ•˜๊ณ  ์ž‘์—… ์ „ํ™˜์— ๋”ฐ๋ฅธ ๋น„์šฉ๊ณผ ๋…ธ๋ ฅ์„ ์ตœ์†Œํ™”ํ•˜๋Š” ๊ฒƒ. ๐Ÿ€ ๋…ผ๋ฌธ ์š”์•ฝ: ์—ฌ๋Ÿฌ ์ž‘์—…๋“ค ๊ฐ„์— ๊ณต์œ  ๊ฐ€๋Šฅํ•œ ํŠน์ง•๋“ค์„ ๋ฐœ๊ฒฌํ•˜๊ณ  ์ด๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ•œ ๋ชจ๋ธ์ด ๋‹ค์–‘ํ•œ ์ž‘์—…(๊ฐ์ฒด ๊ฒ€์ถœ, ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜, ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜)์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•จ. ์ด๋ฅผ ์œ„ํ•ด ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค์„ ์ˆ˜ํ–‰ํ•˜๋ฉฐ ์ˆ˜์ง‘.. 2023. 7. 16.
Noisy Student: Self-training with Noisy Student improves ImageNet classification(2019) ๋ฆฌ๋ทฐ๋Š” ์•„๋ž˜์ชฝ์— ์žˆ์Šต๋‹ˆ๋‹น ! ! ๋ฒˆ์—ญ ver 0. Abstract ์šฐ๋ฆฌ๋Š” Noisy Student Training์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ด๋Š” ๋ ˆ์ด๋ธ”์ด ํ’๋ถ€ํ•œ ๊ฒฝ์šฐ์—๋„ ์ž˜ ์ž‘๋™ํ•˜๋Š” ์ค€์ง€๋„ ํ•™์Šต ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค. Noisy Student Training์€ ImageNet์—์„œ 88.4%์˜ top-1 ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์ด๋Š” 35์–ต ๊ฐœ์˜ ์•ฝํ•œ ๋ ˆ์ด๋ธ”์ด ๋ถ€์ฐฉ๋œ Instagram ์ด๋ฏธ์ง€๊ฐ€ ํ•„์š”ํ•œ ์ตœ์ฒจ๋‹จ ๋ชจ๋ธ๋ณด๋‹ค 2.0% ๋” ๋†’์€ ์„ฑ๋Šฅ์ž…๋‹ˆ๋‹ค. ๊ฐ•๊ฑด์„ฑ ํ…Œ์ŠคํŠธ ์„ธํŠธ์—์„œ๋Š” ImageNet-A์˜ top-1 ์ •ํ™•๋„๋ฅผ 61.0%์—์„œ 83.7%๋กœ ํ–ฅ์ƒ์‹œํ‚ค๋ฉฐ, ImageNet-C์˜ ํ‰๊ท  ์†์ƒ ์˜ค์ฐจ๋ฅผ 45.7์—์„œ 28.3์œผ๋กœ ์ค„์ด๊ณ , ImageNet-P์˜ ํ‰๊ท  ๋’ค์ง‘๊ธฐ ๋น„์œจ์„ 27.8์—์„œ 12.2๋กœ ์ค„์ž…๋‹ˆ๋‹ค. Noisy Student Train.. 2023. 7. 14.
728x90
๋ฐ˜์‘ํ˜•