> HOME > Data List > Rakuten Data Set

Rakuten Data Set

NII provides the "Rakuten Data" to researchers according to the contract between NII and Rakuten, Inc.

update: 2017-04-03

Outline of the Data

    1. Rakuten Ichiba: All product data (Approx. 156 million items), review data (Approx. 64 million reviews)
    2. Rakuten Travel: Facility data (Approx. 128,000 facilities), review data (Approx. 5.6 million reviews)
    3. Rakuten GORA (Rakuten's golf service): Facility data (1,669 facilities), review data (320,000 reviews)
    4. Rakuten Recipe: Recipe data (Approx. 800,000 recipes), recipe images (Approx. 800,000 images), Pickup recipe (1,854 recipes), Daylicious news (362 articles)
    5. Annotated data:
      • Tsukuba sentiment-tagged corpus (TSUKUBA corpus): corpus with sentiment polarity information for each sentence of Rakuten Travel's review data provided by University of Tsukuba
      • Product images dataset with category label: image dataset of products which belong to Rakuten genres corresponding to some categories in Caltech-256 dataset
      • Images with character area: images with rectangle coordinates of character area
    6. Rakuten Viki: Data used in "Rakuten-Viki Global TV recommender challenge" of 2015 (Video data (623 videos), user behavior (Approx. 4.9 million records))
    7. PriceMinister: Data used in "Challenge Data 2016-2017: Prediction of products reviews interests" (User review (training: 80,000 records / test: 36,395 records), products reviews interests (training: 80,000 records) in French) (added on 2017-04-03)

Please see "Rakuten Data Release" for details.

The same data will be available also from Advanced Language Information Forum (ALAGIN).

Update Information

  • New data was released. Present users can download the data from the data distribution site with no additional procedure. (2017/04/03)
  • Distribution of Rakuten Auction data was terminated, because the Rakuten Auction Service was closed. (2016/10/31)
  • Update and new data were released. Present users can download the data from the data distribution site with no additional procedure. (2016/01/12)
  • Update and addition of data were released. Present users can download the data from the data distribution site with no additional procedure. (2014/09/30)
  • Update and new data were released. Present users can download the data from the data distribution site with no additional procedure. (2014/04/01)

Usage Conditions

The data set will be provided to a user group which belongs to a university or a public research institution under the following conditions. For more details, please e-mail to the IDR office shown in the "contact" section below.

  • A Researcher belonging to a university or a public research institution can apply for the data. Those belonging to a private company, etc. cannot apply.
  • An application should be made for each user group such as a laboratory of a university, and its applicant should be a full-time researcher representing the research group, e.g., a professor at a university or a head researcher at a research institution.
  • Those who can use the data are restricted to the members belonging to the abovementioned user group and doing research directly in collaboration with or under supervision of the applicant. When someone belonging to a different organization or a separate laboratory, even if in a joint research, would use the data, a separate application should be made.
  • The siger of the Agreement should be a person with the authority to sign and seal agreements on behalf of your organization and having an official seal. Please consult with your administrative section about the qualified signer beforehand.

Application

Please submit an application following the procedure shown below. The data is available free for charge. The required documents (except "Agreement form") can be downloaded from the links in "documents" section below.

  1. Please read the contents of "Agreement on the usage of Rakuten Data (sample)" carefully and confirm if it is acceptable for you (and your organization).

  2. Fill out the "Application Form" with required information, and e-mail it as an attachment file to the IDR office shown in the "contact" section below. For "Signer", please enter formal information in full as to be printed in the Agreement.

  3. *** Please note that your application will be forwarded to Rakuten, Inc. and will be used for judging the qualification, preparing an agreement, and listing up the users.

  4. Your application will be checked at the IDR office and the availability of the data will be e-mailed to you. If you do not receive an reply e-mail in a week, please contact the IDR office.

    *** Please understand that, since the user qualification conditions are set by Rakuten, Inc., there may be a case we cannot provide the data to you.

  5. You and Rakuten, Inc. conclude an agreement.

    1. Rakuten, Inc. will send two copies of "Agreement form" to you by postal mail.

    2. Fill out with required information, sign and seal, and send by postal mail both of the two copies to the "Postal mail address for agreements (Rakuten, Inc.)" shown below.
    3. *** Please follow instructions of Rakuten, Inc. if one is enclosed.

    4. Rakuten, Inc. will return one of the copies sealed by Rakuten, Inc. Please keep it in a secure place.

  6. The IDR office will provide the data when noticed by Rakuten, Inc..

Data provision

The data will be provided by downloading from the IDR's Web server. If you cannot download the data for some technical reason, please consult us.

Documents

Contact (IDR Office)

IDR Office, National Institute of Informatics

E-mail:
idr [at] nii.ac.jp
Address:
2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo 101-8430, JAPAN

(Please use e-mail for communicating with us if not otherwise specified.)

Postal mail address for agreements (Rakuten, Inc.)

"Rakuten Data Release" Office, Rakuten Institute of Technology (R.I.T.)

Address:
Rakuten Crimson House,
1-14-1 Tamagawa, Setagaya-ku, Tokyo 158-0094, JAPAN
E-mail:
rit-rdr [at] mail.rakuten.com