> HOME > Data List > Rakuten Data Set

Rakuten Data Set

NII provides the "Rakuten Data" to researchers according to the contract between NII and Rakuten, Inc.

update: 2019-07-05

Outline of the Data

  1. Rakuten Ichiba: All product data (Approx. 156 million items), review data (Approx. 64 million reviews)
  2. Rakuten Travel: Facility data (Approx. 128,000 facilities), review data (Approx. 5.6 million reviews)
  3. Rakuten GORA (Rakuten's golf service): Facility data (1,669 facilities), review data (320,000 reviews)
  4. Rakuten Recipe: Recipe data (Approx. 800,000 recipes), recipe images (Approx. 800,000 images), Pickup recipe (1,854 recipes), Daylicious news (362 articles)
  5. Annotated data:
    • Tsukuba sentiment-tagged corpus (TSUKUBA corpus): corpus with sentiment polarity information for each sentence of Rakuten Travel's review data provided by University of Tsukuba
    • Product images dataset with category label: image dataset of products which belong to Rakuten genres corresponding to some categories in Caltech-256 dataset
    • Images with character area: images with rectangle coordinates of character area
    • Floor plan from Rakuten Real Estate and pixel-wise wall label: floor plan images (powered by LIFULL Co., Ltd., 500 images) and annotated wall label with pixel
    • Rakuten France: user review, products reviews interests: data used in "Challenge Data 2016-2017: Prediction of products reviews interests" (in French)
    • Rakuten France: book and author information: book item information, annotated book item information with normalized author name (added on 2019-07-05)

Please see "Rakuten Data Release" for details.

The same data is available also from Advanced Language Information Forum (ALAGIN).

Update Information

  • The data of "Rakuten France: book and author information" (Annotated Data) was newly released. (2019/07/05)
  • Name of "PriceMinister: user review, products reviews interests" was changed to "Rakuten France: user review, products reviews interests", and it was moved to "Annotated Data". (2019/07/05)
  • The data of "Floor Plan from Rakuten Real Estate (powered by LIFULL Co., Ltd.) and Pixel-wise Wall Label" (Annotated Data) was newly released. (2017/11/28)
  • Distribution of "Rakuten Viki" data was terminated. (2017/11/28)
  • "PriceMinister" data was newly released. (2017/04/03)
  • Distribution of Rakuten Auction data was terminated, because the Rakuten Auction Service was closed. (2016/10/31)
  • "Rakuten Viki" data was newly released and "Rakuten Travel" data and "Rakuten Recipe" data were updated. (2016/01/12)
  • Three kinds of "Annotated data" were newly released and "Rakuten Travel" data was updated. (2014/09/30)
  • "Rakuten Auction" data was newly released and "Rakuten Ichiba" data was updated. (2014/04/01)
  • "Rakuten Recipe" data was newly released and "Rakuten Travel" data was updated. (2012/08/07)
  • "Rakuten Ichiba" data was updated. (2011/08/23)
  • Distribution of "Rakuten Data Set" was started. (2010/08/04)

User Qualification

A researcher belonging to a university or a public research institution only can apply for the use of the Data. Application from those belonging to a private company, etc. will not be accepted. For more details, please e-mail to the IDR office shown in the "contact" section below.

Application

Please submit an application following the procedure shown below. The data is available free for charge. The required documents can be downloaded from the links in "documents" section below.

  1. Please read the contents of "Agreement on the usage of Rakuten Data (sample)" carefully and confirm if it is acceptable for you (and your organization), and fill out the "Application Form" following the items below:

    1. An application should be made for each user group such as a laboratory in a university, and the applicant should be a principal investigator in the group, e.g., a professor at a university or a head researcher at a research institution.

    2. The siger of the Agreement should be a person authorized to sign and seal the agreement on behalf of your organization and having an official seal (typically, Dean of a school or the upper for the case of a university). Please consult with your administrative section about the qualified signer beforehand and enter formal information in full for "Signer" as to be printed in the Agreement.

    3. "Research group members" are restricted to the researchers and students belonging to the abovementioned user group and doing research under supervision of the applicant. When someone belonging to a different organization or a separate laboratory, even if in a joint research, would use the data, a separate application should be made.

  2. Please e-mail the application form as an attachment file to the IDR office shown in the "contact" section below.

    1. The subject of the email should be "Application for the Rakuten Data Set (Xxxx University)". If the subject is not appropriate, the email may be discarded without its content reviewed.

    2. In case you make applications for other datasets at the same time, please send each one with a separate email.

    3. Please note that your application will be forwarded to Rakuten, Inc. and will be used for judging the qualification, preparing an agreement, and managing the users.

  3. Your application will be reviewed at the IDR office and the availability of the data will be emailed to you. If you do not receive a reply email in a week, please contact the IDR office.

    *** Please understand that, since the user qualification conditions are set by Rakuten, Inc., there may be a case we cannot provide the data to you.

  4. Your organization and Rakuten, Inc. conclude an agreement.

    1. Rakuten, Inc. will send two copies of "Agreement" to you by post.

    2. Fill out with required information, sign and seal, and send by post both of the two copies to the "Postal mail address for agreements (Rakuten, Inc.)" shown below.

    3. *** Please follow instructions of Rakuten, Inc. if enclosed.

    4. Rakuten, Inc. will return one of the copies sealed by Rakuten, Inc. Please keep it safe.

  5. The IDR office will provide the data when noticed by Rakuten, Inc.. If you do not receive the download instruction within a few days after you receive the sign-and-sealed Agreement, please contact the IDR office.

Data provision

The data will be provided by downloading from the IDR's Web server. If you cannot download the data for some technical reason, please consult us.

Documents

Usage report

  • When you make the research result public, you are required to give Rakuten, Inc. notice of the publication content, date, place, etc. at least 10 days before the submission or 30 days before the presentation.
  • You are requested to submit a research report using the data once a year.

Contact (IDR Office)

IDR Office, National Institute of Informatics

E-mail:
idr [at] nii.ac.jp
Address:
2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo 101-8430, JAPAN

(Please use e-mail for communicating with us if not otherwise specified.)

Postal mail address for agreements (Rakuten, Inc.)

"Rakuten Data Release" Office, Rakuten Institute of Technology (R.I.T.)

Address:
Rakuten Crimson House,
1-14-1 Tamagawa, Setagaya-ku, Tokyo 158-0094, JAPAN
E-mail:
rit-rdr [at] mail.rakuten.com