Ritendra Datta, Ph.D.
  • Home
  • Publications
  • Media Reports
  • Patents/IP
  • Talks/Posters
  • —————————
  • Software
  • —————————
  • Personal Writings
  • Algorithm Contests

Datasets for Aesthetics Inference Experiments



Below are made available a set of datasets that can potentially be used for aesthetics inference experiments. Please refer to the following paper for more details about these datasets. Also, if you happen to use any of these datasets in your research articles, we request that you cite this paper in the article.

R. Datta, J. Li, and J. Z. Wang, Algorithmic Inferencing of Aesthetics and Emotion in Natural Images: An Exposition, Proc. IEEE ICIP, Special Session on Image Aesthetics, Mood and Emotion, San Diego, CA, 2008. [PDF]

The images themselves are not included in the datasets due to copyright issues, but methods for constructing the URLs of the images are described. It should be straightforward to obtain the images by crawling them. Please adhere to any copyright restrictions that apply.

If you have questions, clarifications, or errata, please do not hesitate to contact me.


Photo.net Dataset

This dataset contains 20,278 images with properties similar to the one described in the above paper. The only differences are that it is a larger dataset, and that only those photos were included in the dataset which have received at least 10 ratings. This ensures greater stability in the average aesthetics scores. Please download the dataset and its description file by clicking on the links below and saving the files.

Dataset File
Dataset Description File




DPChallenge.com Dataset

This dataset contains 16,509 images as described in the paper above. Please download the dataset and its description file by clicking on the links below and saving the files.

Dataset File
Dataset Description File


The good ol' dataset from our ECCV 2006 Paper

If you be still interested in a direct comparison of your results with those in our ECCV paper, read on. I will still continue to insist that the bigger, more stable dataset above is the way to go.

The data consists of the Photo.net IDs and numerical visual features described in the ECCV Paper, for 3581 images. Because of issues with copyright, we only provide hyperlinks to the original images, so it is straightforward to crawl them from the Photo.net Website. Please refer to the Data Description File to figure out how to get the URLs corresponding to each image. For the data file, you should use an unzipping utility such as gzip. So here goes:

(a) all_features.dat.gz - Plain text file containing 3581 rows, one per Photo.net image.

(b) description.txt - File describing what each column means. Please refer to the ECCV paper to get more technical descriptions of these features.

Note that some of the original aesthetics/originality scores may have changed since we collected the data, since the Photo.net community continues to rate these pictures.
Powered by Create your own unique website with customizable templates.