研究シーズ2016情報メディア科学

3D Digitization through the World’s Photo

Geometric and Photometric Modeling and Inference of Large-Scale Internet
Image Collections

鄭 銀強コンテンツ科学研究系 助教

研究分野情報学/知覚情報処理/コンピュータビジョン

研究背景・目的

The famous landmarks and tourism sites around the world, such as the Rome Trevi Fountain and the Tokyo Train Station, have been photographed hundreds of thousands of times, at different time and places by different photographers.

Nowadays, as the online photo and video sharing sites, like Flickr and YouTube, are becoming popular, a portion of these images are publicly available on the Internet. These Internet image collections are tremendously huge in quantity and extremely diverse in viewpoints, color tones, light/shadow effects and so on. If the embedded viewpoint and appearance variations could be properly exploited, large-scale Internet image collections are capable of offering us abundant in-depth information, including 3D geometric structure, surface reflective properties, texture and more, which are not available in small scattered image sets.

研究内容

Our first-hand experiences indicate that traditional models and inference algorithms are inadequate for large-scale Internet image collections, due to their poor accuracy or (and) unfavorable computational efficiency. Therefore, the primary research topics include:

  • Incremental 3D reconstruction pipeline without using intermediate 3D information;
  • Fast minimal problem solvers for the relative and absolute pose estimation;
  • Reflectance and texture recovery via intrinsic image decomposition guided by sparse correspondence.

8-3_tei_1.jpg

産業応用の可能性

  • Photo tourism with user designations
  • Mobile image-based localization
  • Posterior high-fidelity image editing
連絡先

鄭 銀強[コンテンツ科学研究系 助教]
yqzheng[at]nii.ac.jp

Recommend

さらにみる