이미지넷

이미지넷(ImageNet) 프로젝트는 시각적 개체 인식 소프트웨어 연구에 사용하도록 설계된 대규모 시각적 데이터베이스이다. 이 프로젝트에서는 어떤 물체가 묘사되어 있는지를 나타내기 위해 1,400만 개^[1]^[2]가 넘는 이미지에 손으로 주석을 달았으며, 최소 100만 개 이상의 이미지에 경계 상자도 제공되었다.^[3] 이미지넷에는 수백 개의 이미지로 구성된 "풍선" 또는 "딸기"와 같은 일반적인 범주를 포함하여 20,000개 이상^[2]의 범주가 포함되어 있다.^[4] 제3자 이미지 URL의 주석 데이터베이스는 이미지넷에서 직접 무료로 사용할 수 있지만 실제 이미지는 이미지넷의 소유가 아니다.^[5] 2010년부터 이미지넷 프로젝트는 이미지넷 ILSVRC(Large Scale Visual Recognition Challenge)라는 소프트웨어 콘테스트를 매년 개최하고 있다. 이 콘테스트에서는 소프트웨어 프로그램이 개체와 장면을 올바르게 분류하고 감지하기 위해 경쟁한다. 이 챌린지는 겹치지 않는 클래스 1,000개의 "조정된" 목록을 사용한다.^[6]

딥러닝의 중요성

2012년 9월 30일 알렉스넷(AlexNet)^[7]이라는 CNN(합성곱 신경망)은 이미지넷 2012 챌린지에서 상위 5위 오류인 15.3%를 달성했는데, 이는 준우승자보다 10.8%포인트 이상 낮은 수치이다. 이는 딥러닝 혁명의 필수 요소인 훈련 중에 그래픽 처리 장치(GPU)를 사용^[7]했기 때문에 가능해졌다. 이코노미스트에 따르면, 갑자기 사람들이 AI 커뮤니티뿐만 아니라 기술 산업 전반에 걸쳐 관심을 갖기 시작했다.^[4]^[8]^[9]

2015년에 알렉스넷은 100개 이상의 레이어를 갖춘 마이크로소프트의 매우 심층적인 CNN에 비해 성능이 뛰어났으며 이미지넷 2015 콘테스트에서 우승했다.^[10]

데이터베이스의 역사

AI 연구원 리페이페이(Fei-Fei Li)는 2006년부터 이미지넷에 대한 아이디어 작업을 시작했다. 대부분의 AI 연구가 모델과 알고리즘에 중점을 두던 당시 리페이페이는 AI 알고리즘을 훈련하는 데 사용할 수 있는 데이터를 확장하고 개선하기를 원했다.^[11] 2007년에 리페이페이는 워드넷 창시자 중 한 명인 프린스턴 대학교 교수 크리스티안 펠바움(Christiane Fellbaum)을 만나 프로젝트에 대해 논의했다. 이 회의의 결과로 리페이페이는 워드넷의 단어 데이터베이스에서 시작하여 그 기능을 많이 사용하여 이미지넷을 구축했다.^[12]

프린스턴 대학교의 조교수로서 Li는 이미지넷 프로젝트를 진행하기 위해 연구원 팀을 구성했다. 그들은 이미지 분류를 돕기 위해 아마존 메커니컬 터크(Amazon Mechanical Turk)를 사용했다.^[12]

그들은 플로리다에서 열린 2009년 컴퓨터 비전 및 패턴 인식 컨퍼런스(CVPR)에서 자신들의 데이터베이스를 포스터로 처음으로 발표했다.^[12]^[13]^[14]

같이 보기

각주

↑ “New computer vision challenge wants to teach robots to see in 3D”. 《New Scientist》. 2017년 4월 7일. 2018년 2월 3일에 확인함.
↑ ^가 ^나 Markoff, John (2012년 11월 19일). “For Web Images, Creating New Technology to Seek and Find”. 《The New York Times》. 2018년 2월 3일에 확인함.
↑ “ImageNet”. 2020년 9월 7일. 2020년 9월 7일에 원본 문서에서 보존된 문서. 2022년 10월 11일에 확인함.
↑ ^가 ^나 “From not working to neural networking”. 《The Economist》. 2016년 6월 25일. 2018년 2월 3일에 확인함.
↑ “ImageNet Overview”. ImageNet. 2022년 10월 15일에 확인함.
↑ Olga Russakovsky*, Jia Deng*, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg and Li Fei-Fei. (* = equal contribution) ImageNet Large Scale Visual Recognition Challenge. IJCV, 2015.
↑ ^가 ^나 Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. (June 2017). “ImageNet classification with deep convolutional neural networks” (PDF). 《Communications of the ACM》 60 (6): 84–90. doi:10.1145/3065386. ISSN 0001-0782. S2CID 195908774. 2017년 5월 24일에 확인함.
↑ “Machines 'beat humans' for a growing number of tasks”. 《Financial Times》. 2017년 11월 30일. 2018년 2월 3일에 확인함.
↑ Gershgorn, Dave (2018년 6월 18일). “The inside story of how AI got good enough to dominate Silicon Valley”. 《Quartz》. 2018년 12월 10일에 확인함.
↑ He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing; Sun, Jian (2016). 〈Deep Residual Learning for Image Recognition〉. 《2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)》. 770–778쪽. arXiv:1512.03385. doi:10.1109/CVPR.2016.90. ISBN 978-1-4673-8851-1. S2CID 206594692.
↑ Hempel, Jesse (2018년 11월 13일). “Fei-Fei Li's Quest to Make AI Better for Humanity”. 《Wired》. 2019년 5월 5일에 확인함. When Li, who had moved back to Princeton to take a job as an assistant professor in 2007, talked up her idea for ImageNet, she had a hard time getting faculty members to help out. Finally, a professor who specialized in computer architecture agreed to join her as a collaborator.
↑ ^가 ^나 ^다 Gershgorn, Dave (2017년 7월 26일). “The data that transformed AI research—and possibly the world”. 《Quartz》. Atlantic Media Co. 2017년 7월 26일에 확인함. Having read about WordNet's approach, Li met with professor Christiane Fellbaum, a researcher influential in the continued work on WordNet, during a 2006 visit to Princeton.
↑ Deng, Jia; Dong, Wei; Socher, Richard; Li, Li-Jia; Li, Kai; Fei-Fei, Li (2009), 〈ImageNet: A Large-Scale Hierarchical Image Database〉 (PDF), 《2009 conference on Computer Vision and Pattern Recognition》, 2021년 1월 15일에 원본 문서 (PDF)에서 보존된 문서, 2017년 7월 26일에 확인함
↑ Li, Fei-Fei (2015년 3월 23일), 《How we're teaching computers to understand pictures》, 2018년 12월 16일에 확인함

외부 링크

이미지넷 - 공식 웹사이트

[New_Scientist-1] “New computer vision challenge wants to teach robots to see in 3D”. 《New Scientist》. 2017년 4월 7일. 2018년 2월 3일에 확인함.

[nytimes_2012-2] 가 ^나 Markoff, John (2012년 11월 19일). “For Web Images, Creating New Technology to Seek and Find”. 《The New York Times》. 2018년 2월 3일에 확인함.

[3] “ImageNet”. 2020년 9월 7일. 2020년 9월 7일에 원본 문서에서 보존된 문서. 2022년 10월 11일에 확인함.

[economist-4] 가 ^나 “From not working to neural networking”. 《The Economist》. 2016년 6월 25일. 2018년 2월 3일에 확인함.

[5] “ImageNet Overview”. ImageNet. 2022년 10월 15일에 확인함.

[ILJVRC-2015-6] Olga Russakovsky*, Jia Deng*, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg and Li Fei-Fei. (* = equal contribution) ImageNet Large Scale Visual Recognition Challenge. IJCV, 2015.

[:0-7] 가 ^나 Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. (June 2017). “ImageNet classification with deep convolutional neural networks” (PDF). 《Communications of the ACM》 60 (6): 84–90. doi:10.1145/3065386. ISSN 0001-0782. S2CID 195908774. 2017년 5월 24일에 확인함.

[8] “Machines 'beat humans' for a growing number of tasks”. 《Financial Times》. 2017년 11월 30일. 2018년 2월 3일에 확인함.

[9] Gershgorn, Dave (2018년 6월 18일). “The inside story of how AI got good enough to dominate Silicon Valley”. 《Quartz》. 2018년 12월 10일에 확인함.

[microsoft2015-10] He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing; Sun, Jian (2016). 〈Deep Residual Learning for Image Recognition〉. 《2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)》. 770–778쪽. arXiv:1512.03385. doi:10.1109/CVPR.2016.90. ISBN 978-1-4673-8851-1. S2CID 206594692.

[WiredQuest-11] Hempel, Jesse (2018년 11월 13일). “Fei-Fei Li's Quest to Make AI Better for Humanity”. 《Wired》. 2019년 5월 5일에 확인함. When Li, who had moved back to Princeton to take a job as an assistant professor in 2007, talked up her idea for ImageNet, she had a hard time getting faculty members to help out. Finally, a professor who specialized in computer architecture agreed to join her as a collaborator.

[Gershgorn-12] 가 ^나 ^다 Gershgorn, Dave (2017년 7월 26일). “The data that transformed AI research—and possibly the world”. 《Quartz》. Atlantic Media Co. 2017년 7월 26일에 확인함. Having read about WordNet's approach, Li met with professor Christiane Fellbaum, a researcher influential in the continued work on WordNet, during a 2006 visit to Princeton.

[13] Deng, Jia; Dong, Wei; Socher, Richard; Li, Li-Jia; Li, Kai; Fei-Fei, Li (2009), 〈ImageNet: A Large-Scale Hierarchical Image Database〉 (PDF), 《2009 conference on Computer Vision and Pattern Recognition》, 2021년 1월 15일에 원본 문서 (PDF)에서 보존된 문서, 2017년 7월 26일에 확인함

[14] Li, Fei-Fei (2015년 3월 23일), 《How we're teaching computers to understand pictures》, 2018년 12월 16일에 확인함

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]