๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
728x90
๋ฐ˜์‘ํ˜•

CV15

Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization CAM(Class Activation Maps) ์ด๋ž€? Global Max Pooling(GMP) vs Global Average Pooling(GAP) : ์ „์ฒด ์˜์—ญ ๋‚ด์—์„œ ๊ฐ€์žฅ ํฐ ๊ฐ’์„ ์‚ฌ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์„ Global Max Pooling(GMP)๋ผ๊ณ  ํ•จ : ๋ฐ˜๋ฉด, ๋ชจ๋“  ๊ฐ’์„ ๊ณ ๋ คํ•˜์—ฌ ํ‰๊ท ๊ฐ’์„ ์‚ฌ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์„ Global Average Pooling(GAP)์ด๋ผ๊ณ  ํ•จ : ๋ณดํ†ต CNN์˜ ๊ตฌ์กฐ์—์„œ๋Š”๋งˆ์ง€๋ง‰ feature map์„ flattenํ•˜์—ฌ 1์ฐจ์› ๋ฒกํ„ฐ๋กœ ๋งŒ๋“  ๋’ค ์ด๋ฅผ Fully Connected Netowork๋ฅผ ํ†ต๊ณผํ•˜์—ฌ softmax๋กœ classification์„ ํ–ˆ์—ˆ์Œ. : ์ด FC layer๋Š” parameter์˜ ๊ฐœ์ˆ˜๋ฅผ ๋งค์šฐ ์ปค์ง€๋„๋ก ๋งŒ๋“ค๊ธฐ ๋•Œ๋ฌธ์— overfitting ์œ„ํ—˜์ด ์ฆ๊ฐ€ํ•  ์ˆ˜ ์žˆ๊ณ , F.. 2023. 8. 13.
DINO: Emerging Properties in Self-Supervised Vision Transformers (2021) Self Supervised learning https://brunch.co.kr/@b047a588c11b462/45 : ๋น„์ง€๋„ ํ•™์Šต ๋ฐฉ์‹์˜ ์ผ์ข…์œผ๋กœ์„œ ๋ผ๋ฒจ๋ง๋˜์ง€ ์•Š์€ ๋ฐ์ดํ„ฐ์…‹์„ ํ™œ์šฉํ•˜์—ฌ ์ธ๊ณต์ง€๋Šฅ์ด ์Šค์Šค๋กœ ๋ถ„๋ฅ˜์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๋„๋ก ํ•จ : ์Šค์Šค๋กœ ํƒœ์Šคํฌ๋ฅผ ์„ค์ •ํ•˜์—ฌ ๋ชจ๋ธ์„ ํ•™์Šตํ•œ๋‹ค๋Š” ์ ์—์„œ ๊ธฐ์กด์˜ ๋น„์ง€๋„ ํ•™์Šต ๋ฐฉ์‹๊ณผ ์ฐจ์ด๊ฐ€ ์กด์žฌํ•˜๋ฉฐ, ์ธํ„ฐ๋„ท์ƒ ํฌ๋กค๋ง์„ ํ†ตํ•ด ์ˆ˜์ง‘ํ•  ์ˆ˜ ์žˆ๋Š” ํ…์ŠคํŠธ, ์ด๋ฏธ์ง€, ๋น„๋””์˜ค ๋“ฑ ๋‹ค์–‘ํ•œ ์ข…๋ฅ˜์˜ ๋ฐ์ดํ„ฐ์…‹์„ ํ™œ์šฉํ•  ์ˆ˜๋„ ์žˆ์Œ : ๋ชจ๋ธ์ด ํ™•์žฅ๋˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋Œ€๋Ÿ‰์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ•„์š”๋กœ ํ•˜์ง€๋งŒ, ๋ผ๋ฒจ๋ง๋œ ๋ฐ์ดํ„ฐ๋ฅผ ์ง€์†์ ์œผ๋กœ ํ™•๋ณดํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋งŽ์€ ๋น„์šฉ์ด ์š”๊ตฌ๋œ๋‹ค๋Š” ๋‹จ์ ์ด ์กด์žฌ : ์ž๊ธฐ ์ง€๋„ ํ•™์Šต์€ ๋ผ๋ฒจ๋ง๋˜์ง€ ์•Š์€ ํ•™์Šต ๋ฐ์ดํ„ฐ๋งŒ ํ™•๋ณดํ•˜๋”๋ผ๋„ ๋ชจ๋ธ์˜ ๊ทœ๋ชจ๋ฅผ ์ฆ๊ฐ€์‹œํ‚ฌ ์ˆ˜ ์žˆ์œผ๋ฉฐ ์ด์— ๋”ฐ๋ผ ์ •ํ™•๋„ ์—ญ์‹œ ํ–ฅ์ƒ์‹œํ‚ฌ.. 2023. 8. 10.
[3์ฃผ์ฐจ] ๊ฑด๋ฌผ 3Dํ™” ๋ชจ๋ธ ์ฐพ๊ธฐ GitHub - chrise96/3D_building_reconstruction: MSc Computer Science project. Automatically enhance CityGML LOD2 buildings with facade details, by using a panoramic image sequence and building footprint data. NOTE: Amsterdam Panorama API is currently offline. MSc Computer Science project. Automatically enhance CityGML LOD2 buildings with facade details, by using a panoramic image sequence and buil.. 2023. 7. 28.
DETR: End-to-End Object Detection with Transformers ๐Ÿ“ ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” object detection์„ direct set prediction(์ผ๋Œ€์ผ๋Œ€์‘)์œผ๋กœ ์ •์˜, transformer์™€ bipartite matching loss๋ฅผ ์‚ฌ์šฉํ•œ DETR(DEtection TRansformer)์„ ์ œ์•ˆํ•จ. DETR์€ COCO dataset์— ๋Œ€ํ•˜์—ฌ Faster R-CNN๊ณผ ๋น„์Šทํ•œ ์ˆ˜์ค€์˜ ์„ฑ๋Šฅ์„ ๋ณด์ž„ ์ถ”๊ฐ€์ ์œผ๋กœ, self-attention์„ ํ†ตํ•œ global information(์ „์—ญ ์ •๋ณด)๋ฅผ ํ™œ์šฉํ•จ์œผ๋กœ์จ ํฌ๊ธฐ๊ฐ€ ํฐ ๊ฐ์ฒด๋ฅผ Faster R-CNN๋ณด๋‹ค ํ›จ์”ฌ ์ž˜ ํฌ์ฐฉ. ๐Ÿ“ 1. Backbone(ResNet)์„ ์ž…๋ ฅํ•ด์„œ ํ”ผ์ฒ˜๋งต์„ ์ถ”์ถœ 2. ํ”ผ์ฒ˜๋งต์„ 1x1 conv์— ์ž…๋ ฅํ•ด์„œ flattenํ•œ ํ”ผ์ฒ˜๋งต์— ๋Œ€ํ•ด positional encoding ๊ตฌํ•ด์„œ ๋”ํ•จ โ€ป spatial.. 2023. 7. 23.
Taskonomy: Disentangling Task Transfer Learning ๐Ÿ’ก ๊ฐ€ ๋ญ๋ƒ? Taskonomy๋Š” ์ปดํ“จํ„ฐ ๋น„์ „ ๋ถ„์•ผ์—์„œ ๋‹ค์–‘ํ•œ ์ž‘์—… ๊ฐ„์˜ ์ƒํ˜ธ ์˜์กด์„ฑ์„ ํƒ๊ตฌํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๋ฒ”์šฉ ๋น„์ „ ์‹œ์Šคํ…œ์„ ๊ตฌ์ถ•ํ•˜๋Š” ์—ฐ๊ตฌ. Taskonomy๋Š” ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค์„ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•ด ํ•„์š”ํ•œ ์‹œ๊ฐ์  ํŠน์ง•๋“ค์ด ์„œ๋กœ ๊ณต์œ ๋  ์ˆ˜ ์žˆ๋Š”์ง€๋ฅผ ์กฐ์‚ฌํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๋ชจ๋ธ์˜ ํ•™์Šต ํšจ์œจ์„ฑ๊ณผ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ์ „์ด ํ•™์Šต ๋ฐฉ๋ฒ•์„ ํƒ๊ตฌ. Taskonomy์˜ ๋ชฉํ‘œ๋Š” ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค ๊ฐ„์— ๊ณต์œ  ๊ฐ€๋Šฅํ•œ ์‹œ๊ฐ์  ํŠน์ง•์„ ํƒ์ƒ‰ํ•˜์—ฌ, ์ž‘์—… ๊ฐ„์˜ ํ•™์Šต๊ณผ ์ผ๋ฐ˜ํ™”๋ฅผ ๊ฐœ์„ ํ•˜๊ณ  ์ž‘์—… ์ „ํ™˜์— ๋”ฐ๋ฅธ ๋น„์šฉ๊ณผ ๋…ธ๋ ฅ์„ ์ตœ์†Œํ™”ํ•˜๋Š” ๊ฒƒ. ๐Ÿ€ ๋…ผ๋ฌธ ์š”์•ฝ: ์—ฌ๋Ÿฌ ์ž‘์—…๋“ค ๊ฐ„์— ๊ณต์œ  ๊ฐ€๋Šฅํ•œ ํŠน์ง•๋“ค์„ ๋ฐœ๊ฒฌํ•˜๊ณ  ์ด๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ•œ ๋ชจ๋ธ์ด ๋‹ค์–‘ํ•œ ์ž‘์—…(๊ฐ์ฒด ๊ฒ€์ถœ, ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜, ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜)์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•จ. ์ด๋ฅผ ์œ„ํ•ด ๋‹ค์–‘ํ•œ ์ž‘์—…๋“ค์„ ์ˆ˜ํ–‰ํ•˜๋ฉฐ ์ˆ˜์ง‘.. 2023. 7. 16.
Noisy Student: Self-training with Noisy Student improves ImageNet classification(2019) ๋ฆฌ๋ทฐ๋Š” ์•„๋ž˜์ชฝ์— ์žˆ์Šต๋‹ˆ๋‹น ! ! ๋ฒˆ์—ญ ver 0. Abstract ์šฐ๋ฆฌ๋Š” Noisy Student Training์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ด๋Š” ๋ ˆ์ด๋ธ”์ด ํ’๋ถ€ํ•œ ๊ฒฝ์šฐ์—๋„ ์ž˜ ์ž‘๋™ํ•˜๋Š” ์ค€์ง€๋„ ํ•™์Šต ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค. Noisy Student Training์€ ImageNet์—์„œ 88.4%์˜ top-1 ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์ด๋Š” 35์–ต ๊ฐœ์˜ ์•ฝํ•œ ๋ ˆ์ด๋ธ”์ด ๋ถ€์ฐฉ๋œ Instagram ์ด๋ฏธ์ง€๊ฐ€ ํ•„์š”ํ•œ ์ตœ์ฒจ๋‹จ ๋ชจ๋ธ๋ณด๋‹ค 2.0% ๋” ๋†’์€ ์„ฑ๋Šฅ์ž…๋‹ˆ๋‹ค. ๊ฐ•๊ฑด์„ฑ ํ…Œ์ŠคํŠธ ์„ธํŠธ์—์„œ๋Š” ImageNet-A์˜ top-1 ์ •ํ™•๋„๋ฅผ 61.0%์—์„œ 83.7%๋กœ ํ–ฅ์ƒ์‹œํ‚ค๋ฉฐ, ImageNet-C์˜ ํ‰๊ท  ์†์ƒ ์˜ค์ฐจ๋ฅผ 45.7์—์„œ 28.3์œผ๋กœ ์ค„์ด๊ณ , ImageNet-P์˜ ํ‰๊ท  ๋’ค์ง‘๊ธฐ ๋น„์œจ์„ 27.8์—์„œ 12.2๋กœ ์ค„์ž…๋‹ˆ๋‹ค. Noisy Student Train.. 2023. 7. 14.
728x90
๋ฐ˜์‘ํ˜•