๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
728x90
๋ฐ˜์‘ํ˜•
BodyNet: Volumetric Inference of 3D Human Body Shapes BodyNet์ด๋ž€? : ๋‹จ์ผ ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ 2D pose, segmentation ์ถ”์ถœ, ๋‘ ๊ฐœ์˜ ์ •๋ณด๋ฅผ ํ™œ์šฉํ•ด 3D pose๋ฅผ ํ•™์Šต, ์ดํ›„, 3๊ฐ€์ง€ ์ •๋ณด์— RGB ์ •๋ณด๊นŒ์ง€ ํ™œ์šฉํ•ด 3D์˜ ๋ถ€ํ”ผ ๊ธฐ๋ฐ˜ ์ฒดํ˜•์„ ๊ตฌ์„ฑํ•˜๋Š” Network๋ฅผ ๋งํ•จ : end to end ํ˜•์‹ 1. ์ž…๋ ฅ RGB ์ด๋ฏธ์ง€๋Š” ๋จผ์ € 2D ํฌ์ฆˆ ์ถ”์ •๊ณผ 2D ์‹ ์ฒด ๋ถ€์œ„ ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜์„ ์œ„ํ•œ ํ•˜์œ„ ๋„คํŠธ์›Œํฌ๋ฅผ ํ†ต๊ณผ 2. 2D pose์™€ segmentation์„ ํ›ˆ๋ จ 3. ํ•™์Šต๋œ 2D pose์™€ Segmentation ๊ฐ€์ค‘์น˜๋ฅผ ๊ณ ์ •ํ•ด์„œ 3D pose๋ฅผ ํ›ˆ๋ จ์‹œํ‚ด 4. ์ดํ›„, ์ด์ „์˜ ๋ชจ๋“  ๋„คํŠธ์›Œํฌ ๊ฐ€์ค‘์น˜๋ฅผ ๊ณ ์ •ํ•˜๊ณ  3D ํ˜•ํƒœ network๋ฅผ ํ›ˆ๋ จ 5. ์ถ”๊ฐ€ ์žฌํ”„๋กœ์ ์…˜ ์†์‹ค๋กœ ํ˜•ํƒœ ๋„คํŠธ์›Œํฌ ํ›ˆ๋ จํ•ด์„œ ๋ถ€ํ”ผ ๊ธฐ๋ฐ˜ ํ˜•ํƒœ ์ถ”์ • ์ž‘์—…์— ๋Œ€ํ•ด ์„ธ๋ฐ€ ์กฐ์ • 6. ๊ฒฐํ•ฉ๋œ ์†.. 2023. 8. 3.
mixup: Beyond Emprical Risk Minimization Mixup์ด ๋ญ์•ผ? : Beyond Emprical Risk Minimization - ๊ฒฝํ—˜์  ์œ„ํ—˜ ์ตœ์†Œํ™”๋ฅผ ๋„˜์–ด? ์ด๊ฒŒ ๋„๋Œ€์ฒด ๋ญ”๋ง์ธ๊ฐ€ : mixup ⇒ data augmentaion ๊ธฐ๋ฒ• :๋‘ ๋ฐ์ดํ„ฐ๋ฅผ ์„ ํ˜•์ ์œผ๋กœ ๊ฒฐํ•ฉํ•ด์„œ ์ƒˆ๋กœ์šด ์ƒ˜ํ”Œ์„ ์ƒ์„ฑ : ์ •๋ง ์‰ฝ๊ฒŒ ๋งํ•˜์ž๋ฉด, ์šฐ๋ฆฌ๊ฐ€ ์ผ๋ฐ˜์ ์œผ๋กœ ํ›ˆ๋ จ, ์˜ˆ์ธก๊ณผ ๊ฐ™์€ ๋ฐฉ๋ฒ•์„ ์“ฐ๋ฉด ๊ณผ์ ํ•ฉ์ด ๋ฐœ์ƒํ•˜๊ธฐ ๋งˆ๋ จ์ž„. : ์™œ๋ƒ? ํ›ˆ๋ จ๋ฐ์ดํ„ฐ๋งŒ ๋ณด๊ณ  ํ•™์Šต์„ ์‹œํ‚ค๊ธฐ ๋•Œ๋ฌธ์—, ๋‹น์—ฐํžˆ ํ•™์Šตํ•œ ๋ชจ๋ธ์€ ํ•™์Šต ๋ฐ์ดํ„ฐ์…‹์— ํŽธํ–ฅ๋จ. : ์ฆ‰, ๊ณผ์ ํ•ฉ์ด ๋‚œ๋‹ค๋Š” ๋ง. ๊ฒฐ๊ตญ, ๋‹ค๋ฅธ ์กฐ๊ธˆ๋งŒ ๋‹ค๋ฅธ ๋ถ„ํฌ๋ฅผ ๊ฐ€์ง€๋Š” ๋ฐ์ดํ„ฐ์…‹์— ์ ์šฉ๋งŒ ํ•ด(Out of Distribution) ๋ชจ๋ธ์ด ์ทจ์•ฝํ•  ์ˆ˜ ๋ฐ–์— ์—†์Œ : ๋”ฐ๋ผ์„œ, ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์…‹๋งŒ ํ•™์Šต ์‹œํ‚ค๋Š”๊ฒŒ ์•„๋‹ˆ๋ผ, ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์…‹์˜ ๊ทผ๋ฐฉ ๋ถ„ํฌ๋„ ํ•จ๊ป˜ ํ•™์Šต์„ ์‹œ์ผœ์„œ ๋ณด๋‹ค ๋” .. 2023. 8. 3.
SMPLify(Keep it SMPL): Automatic Estimation of 3D Human Pose and Shape from a Single Image SMPLify[Keep it SMPL] ์ด๋ž€? : 2D CNN(Deepcut)์„ ํ™œ์šฉํ•ด ๊ด€์ ˆ ์œ„์น˜๋ฅผ ๋ฝ‘์€ ํ›„, 3D SMPL์— ์ ์šฉํ•ด 3D Mesh๋ฅผ ๊ตฌ์„ฑํ•˜๋Š” ๋ฐฉ์‹ ์š”์•ฝ(Abstract) : ์ด๋ฏธ์ง€์—์„œ ์ธ๊ฐ„์˜ 3D ํฌ์ฆˆ์™€ ํ˜•ํƒœ๋ฅผ ์ž๋™์œผ๋กœ ์ถ”์ •ํ•˜๊ณ ์ž ํ•จ : CNN ๊ธฐ๋ฒ• Deepcut์„ ํ™œ์šฉ, 3D SMPL์˜ ๊ฒฐํ•ฉ : Datasets์˜ ๊ฒฝ์šฐ, Leeds Spors, HumanEva, Human3.6M ์‚ฌ์šฉ ์ด์  ๋ฐ ํŠน์ง•(Introduction) : ์ด์ „ ๋ฐฉ์‹์˜ ๊ฒฝ์šฐ, ํฌ์ฆˆ ์ดˆ์ ์—๋งŒ ๋งž์ท„๊ณ , 3D ํ˜•ํƒœ๋ฅผ ๋ฌด์‹œํ–ˆ์Œ โžก๏ธ 2D ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ ํฌ์ฆˆ์™€ ํ˜•ํƒœ๋ฅผ ๋ชจ๋‘ ํฌํ•จํ•˜๋Š” 3D ๋ฉ”์‰ฌ๋ฅผ ์ž๋™์œผ๋กœ ์ถ”์ •ํ•˜๋Š” ํ•ด๊ฒฐ์ฑ…์„ ์ œ์‹œํ•จ Deepcut ํ™œ์šฉํ•ด 2D ๊ด€์ ˆ ์ถ”์ • โ€ป DeepCut์ด๋ž€ ๐Ÿ“š [์ฐธ๊ณ ] https://arxiv.or.. 2023. 7. 31.
SMPL: A Skinned Multi-Person Linear Model ๐Ÿ’ก SMPL: A Skinned Multi-Person Linear Model ๋ชฉ์ฐจ SMPL์˜ ์ •์˜ SMPL ์—ฐ๊ตฌ ๋ชฉ์  ๊ธฐ์กด์˜ ๋ฐฉ๋ฒ• ๋ฐ ํ•œ๊ณ„ SMPL์˜ ์›๋ฆฌ์™€ ์ž‘๋™ ์ตœ์ข… DMPL SMPL(Skinned Multi-Person Linear) ์ด๋ž€? [์ฐธ๊ณ ] โžก๏ธ SMPL์— ๋Œ€ํ•œ ๊ฐ„๋‹จํ•œ ์„ค๋ช… : skinned vertex ๊ธฐ๋ฐ˜์˜ ๋ชจ๋ธ๋กœ์„œ, ๋‹ค์–‘ํ•œ ์ธ๊ฐ„์˜ ์ฒดํ˜•์„ ํ˜„์‹ค์ ์œผ๋กœ ๋‚˜ํƒ€๋‚ด๊ณ  ์ž์—ฐ์Šค๋Ÿฌ์šด ์ž์„ธ์— ๋”ฐ๋ฅธ ๋ณ€ํ˜•์„ ์ทจํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ ์—ฐ์กฐ์ง ์›€์ง์ž„์„ ๋‚˜ํƒ€๋‚ธ๋‹ค. โ€ป skinned vertex: ์Šคํ‚จ(ํ”ผ๋ถ€)์„ ์”Œ์›Œ์ง„ ๋ผˆ๊ตฌ์กฐ์— ์†ํ•œ ๋ฉ”์‹œ์˜ ์ •์  (๋ผˆ์˜ ์›€์ง์ž„์— ๋”ฐ๋ผ ๋ณ€ํ˜•๋˜๋Š” ์ •์ ) SMPL ์—ฐ๊ตฌ์˜ ๋ชฉ์  : ๋‹ค์–‘ํ•œ ์ฒดํ˜•์„ ๋Œ€ํ‘œํ•  ์ˆ˜ ์žˆ๋Š” ์‹ค์ œ์ ์ธ ์• ๋‹ˆ๋ฉ”์ด์…˜ ์ธ๊ฐ„ ์‹ ์ฒด๋ฅผ ๋งŒ๋“ค๊ณ , ์ž์—ฐ์Šค๋Ÿฌ์šด ์ž์„ธ์— ๋”ฐ๋ผ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๋ณ€ํ˜•๋˜๋ฉฐ, .. 2023. 7. 28.
[3์ฃผ์ฐจ] ๊ฑด๋ฌผ 3Dํ™” ๋ชจ๋ธ ์ฐพ๊ธฐ GitHub - chrise96/3D_building_reconstruction: MSc Computer Science project. Automatically enhance CityGML LOD2 buildings with facade details, by using a panoramic image sequence and building footprint data. NOTE: Amsterdam Panorama API is currently offline. MSc Computer Science project. Automatically enhance CityGML LOD2 buildings with facade details, by using a panoramic image sequence and buil.. 2023. 7. 28.
DETR: End-to-End Object Detection with Transformers ๐Ÿ“ ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” object detection์„ direct set prediction(์ผ๋Œ€์ผ๋Œ€์‘)์œผ๋กœ ์ •์˜, transformer์™€ bipartite matching loss๋ฅผ ์‚ฌ์šฉํ•œ DETR(DEtection TRansformer)์„ ์ œ์•ˆํ•จ. DETR์€ COCO dataset์— ๋Œ€ํ•˜์—ฌ Faster R-CNN๊ณผ ๋น„์Šทํ•œ ์ˆ˜์ค€์˜ ์„ฑ๋Šฅ์„ ๋ณด์ž„ ์ถ”๊ฐ€์ ์œผ๋กœ, self-attention์„ ํ†ตํ•œ global information(์ „์—ญ ์ •๋ณด)๋ฅผ ํ™œ์šฉํ•จ์œผ๋กœ์จ ํฌ๊ธฐ๊ฐ€ ํฐ ๊ฐ์ฒด๋ฅผ Faster R-CNN๋ณด๋‹ค ํ›จ์”ฌ ์ž˜ ํฌ์ฐฉ. ๐Ÿ“ 1. Backbone(ResNet)์„ ์ž…๋ ฅํ•ด์„œ ํ”ผ์ฒ˜๋งต์„ ์ถ”์ถœ 2. ํ”ผ์ฒ˜๋งต์„ 1x1 conv์— ์ž…๋ ฅํ•ด์„œ flattenํ•œ ํ”ผ์ฒ˜๋งต์— ๋Œ€ํ•ด positional encoding ๊ตฌํ•ด์„œ ๋”ํ•จ โ€ป spatial.. 2023. 7. 23.
728x90
๋ฐ˜์‘ํ˜•