> HOME > Data List

Dataset List

This page presents the list of datasets that NII provides for informatcis-related researchers. Some of the datasets are under preparation.

update: 2021-07-12

Yahoo! Dataset

The NII provides Yahoo! Dataset to researchers, which was offered by Yahoo! Japan Corporation.

  1. "Yahoo! Chiebukuro" Data (3rd edition)

Rakuten Dataset

The NII distributes Rakuten Dataset to researchers, which Rakuten Group, Inc. provides.

  1. Rakuten Ichiba: item data and review data
  2. Rakuten Travel: facilities data and review data
  3. Rakuten GORA: golf course data and review data
  4. Rakuten Recipe: recipe information and recipe image
  5. Annotated data -- 2021-03-22 UPDATE!!


The NII provides LIFULL HOME'S Dataset to researchers, which was offered by LIFULL Co., Ltd.

  1. Snapshot Data of Rentals
  2. High Resolution Floor Plan Image Data
  3. Monthly Data of Rentals and Sales

NTCIR Test Collection --- 2021-04-26 Update

Test collections that NTCIR Project organized by NII built. IDR provides the following test collections. The list of test collections provided by IDR is here. For other test collections that are provided by NTCIR secretariat, please refer to "Test Collections".

Speech Corpus --- 2021-07-12 Update

Speech corpora that Speech Resources Consortium established in NII accepted from various institutions and groups. These are provided by Speech Resources Consortium for the time being.

Video Database ´╝łterminated´╝ë

Video databases for evaluation of video processing built by VDBWG, SIG-PRMU, IEICE. Distribution of the data was terminated. (Mar, 2018)