research-article

SelPh: Progressive Learning and Support of Manual Photo Color Enhancement

Authors:
Yuki Koyama

University of Tokyo, Bunkyo-ku, Tokyo, Japan

University of Tokyo, Bunkyo-ku, Tokyo, Japan
View Profile

,
Daisuke Sakamoto

University of Tokyo, Bunkyo-ku, Tokyo, Japan

University of Tokyo, Bunkyo-ku, Tokyo, Japan
View Profile

,
Takeo Igarashi

University of Tokyo, Bunkyo-ku, Tokyo, Japan

University of Tokyo, Bunkyo-ku, Tokyo, Japan
View Profile

CHI '16: Proceedings of the 2016 CHI Conference on Human Factors in Computing SystemsMay 2016Pages 2520–2532https://doi.org/10.1145/2858036.2858111

Published:07 May 2016Publication History

CHI '16: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems

Pages 2520–2532

ABSTRACT

Color enhancement is a very important aspect of photo editing. Even when photographers have tens of or hundreds of photographs, they must enhance each photo one by one by manually tweaking sliders in software such as brightness and contrast, because automatic color enhancement is not always satisfactory for them. To support this repetitive manual task, we present self-reinforcing color enhancement, where the system implicitly and progressively learns the user's preferences by training on their photo editing history. The more photos the user enhances, the more effectively the system supports the user. We present a working prototype system called SelPh, and then describe the algorithms used to perform the self-reinforcement. We conduct a user study to investigate how photographers would use a self-reinforcing system to enhance a collection of photos. The results indicate that the participants were satisfied with the proposed system and strongly agreed that the self-reinforcing approach is preferable to the traditional workflow.

Supplemental Material

pn0464-file3.mp4

mp4

39.6 MB

Download

p2520-koyama.mp4

mp4

284.8 MB

Download

Available for Download

zip

pn0464-file4.zip (13.6 MB)

pn0464-file4.zip

References

Adobe Systems Inc. 2015a. Photoshop Elements Help | Auto Smart Tone. (2015). Retrieved September 23, 2015 from https://helpx.adobe.com/ photoshop-elements/using/auto-smart-tone.html.Google Scholar
Adobe Systems Inc. 2015b. Photoshop Help | Creating actions. (2015). Retrieved September 23, 2015 from https://helpx.adobe.com/photoshop/using/ creating-actions.html.Google Scholar
Luca Benedetti, Holger Winnemöller, Massimiliano Corsini, and Roberto Scopigno. 2014. Painting with Bob: Assisted Creativity for Novices. In Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology (UIST '14). ACM, NY, NY, USA, 419--428. DOI: http://dx.doi.org/10.1145/2642918.2647415 Google ScholarDigital Library
Yoshua Bengio and Pascal Vincent. 2004. Locally Weighted Full Covariance Gaussian Density Estimation. Cirano working papers. CIRANO.Google Scholar
Floraine Berthouzoz, Wilmot Li, Mira Dontcheva, and Maneesh Agrawala. 2011. A Framework for Content-adaptive Photo Manipulation Macros: Application to Face, Landscape, and Global Manipulations. ACM Trans. Graph. 30, 5, Article 120 (Oct. 2011), 14 pages. DOI: http://dx.doi.org/10.1145/2019627.2019639 Google ScholarDigital Library
V. Bychkovsky, S. Paris, E. Chan, and F. Durand. 2011. Learning Photographic Global Tonal Adjustment with a Database of Input/Output Image Pairs. In Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '11). IEEE Computer Society, Washington, DC, USA, 97--104. DOI: http://dx.doi.org/10.1109/CVPR.2011.5995413 Google ScholarDigital Library
J.C. Caicedo, A. Kapoor, and Sing Bing Kang. 2011. Collaborative personalization of image enhancement. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. 249--256. DOI: http://dx.doi.org/10.1109/CVPR.2011.5995439 Google ScholarDigital Library
Huiwen Chang, Ohad Fried, Yiming Liu, Stephen DiVerdi, and Adam Finkelstein. 2015. Palette-based Photo Recoloring. ACM Trans. Graph. 34, 4, Article 139 (July 2015), 11 pages. DOI: http://dx.doi.org/10.1145/2766978 Google ScholarDigital Library
Fanny Chevalier, Pierre Dragicevic, and Christophe Hurter. 2012. Histomages: Fully Synchronized Views for Image Editing. In Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology (UIST '12). ACM, NY, NY, USA, 281--286. DOI: http://dx.doi.org/10.1145/2380116.2380152 Google ScholarDigital Library
Michael A. A. Cox and Trevor F. Cox. 2008. Multidimensional Scaling. In Handbook of Data Visualization. Springer Berlin Heidelberg, 315--347. DOI: http://dx.doi.org/10.1007/978-3-540-33037-0_14Google Scholar
Allen Cypher. 1991. EAGER: Programming Repetitive Tasks by Example. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '91). ACM, NY, NY, USA, 33--39. DOI: http://dx.doi.org/10.1145/108844.108850 Google ScholarDigital Library
Ritendra Datta, Dhiraj Joshi, Jia Li, and James Z. Wang. 2006. Studying Aesthetics in Photographic Images Using a Computational Approach. In Proceedings of the 9th European Conference on Computer Vision - Volume Part III (ECCV'06). Springer-Verlag, Berlin, Heidelberg, 288--301. DOI: http://dx.doi.org/10.1007/11744078_23 Google ScholarDigital Library
Leah Findlater and Jacob Wobbrock. 2012. Personalized Input: Improving Ten-finger Touchscreen Typing Through Automatic Adaptation. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '12). ACM, NY, NY, USA, 815--824. DOI: http://dx.doi.org/10.1145/2207676.2208520 Google ScholarDigital Library
Krzysztof Z. Gajos, Mary Czerwinski, Desney S. Tan, and Daniel S. Weld. 2006. Exploring the Design Space for Adaptive Graphical User Interfaces. In Proceedings of the Working Conference on Advanced Visual Interfaces (AVI '06). ACM, NY, NY, USA, 201--208. DOI: http://dx.doi.org/10.1145/1133265.1133306 Google ScholarDigital Library
Floraine Grabler, Maneesh Agrawala, Wilmot Li, Mira Dontcheva, and Takeo Igarashi. 2009. Generating Photo Manipulation Tutorials by Demonstration. ACM Trans. Graph. 28, 3, Article 66 (July 2009), 9 pages. DOI: http://dx.doi.org/10.1145/1531326.1531372 Google ScholarDigital Library
Yoav HaCohen, Eli Shechtman, Dan B. Goldman, and Dani Lischinski. 2011. Non-rigid Dense Correspondence with Applications for Image Enhancement. ACM Trans. Graph. 30, 4, Article 70 (July 2011), 10 pages. DOI: http://dx.doi.org/10.1145/2010324.1964965 Google ScholarDigital Library
Yoav HaCohen, Eli Shechtman, Dan B. Goldman, and Dani Lischinski. 2013. Optimizing Color Consistency in Photo Collections. ACM Trans. Graph. 32, 4, Article 38 (July 2013), 10 pages. DOI: http://dx.doi.org/10.1145/2461912.2461997 Google ScholarDigital Library
R. Jaroensri, S. Paris, A. Hertzmann, V. Bychkovsky, and F. Durand. 2015. Predicting Range of Acceptable Photographic Tonal Adjustments. In Proceedings of the 2015 IEEE International Conference on Computational Photography (ICCP '15). IEEE Computer Society, Washington, DC, USA, 1--9. DOI: http://dx.doi.org/10.1109/ICCPHOT.2015.7168372Google ScholarCross Ref
Steven G. Johnson. 2015. The NLopt nonlinear-optimization package. (2015). http://ab-initio.mit.edu/nloptGoogle Scholar
Sing Bing Kang, A. Kapoor, and D. Lischinski. 2010. Personalization of image enhancement. In Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '10). IEEE Computer Society, Washington, DC, USA, 1799--1806. DOI: http://dx.doi.org/10.1109/CVPR.2010.5539850Google Scholar
Ashish Kapoor, Juan C. Caicedo, Dani Lischinski, and Sing Bing Kang. 2014. Collaborative Personalization of Image Enhancement. Int. J. Comput. Vision 108, 1--2 (May 2014), 148--164. DOI: http://dx.doi.org/10.1007/s11263-013-0675--3 Google ScholarDigital Library
Yan Ke, Xiaoou Tang, and Feng Jing. 2006. The Design of High-Level Features for Photo Quality Assessment. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1 (CVPR '06). IEEE Computer Society, Washington, DC, USA, 419--426. DOI: http://dx.doi.org/10.1109/CVPR.2006.303 Google ScholarDigital Library
Yuki Koyama, Daisuke Sakamoto, and Takeo Igarashi. 2014. Crowd-powered Parameter Analysis for Visual Design Exploration. In Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology (UIST '14). ACM, NY, NY, USA, 65--74. DOI: http://dx.doi.org/10.1145/2642918.2647386 Google ScholarDigital Library
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems 25. Curran Associates, Inc., 1097--1105.Google ScholarDigital Library
Brian Kulis. 2013. Metric Learning: A Survey. Foundations and Trends® in Machine Learning 5, 4 (2013), 287--364. DOI: http://dx.doi.org/10.1561/2200000019Google ScholarCross Ref
Tessa Lau, Lawrence Bergman, Vittorio Castelli, and Daniel Oblinger. 2004. Sheepdog: Learning Procedures for Technical Support. In Proceedings of the 9th International Conference on Intelligent User Interfaces (IUI '04). ACM, NY, NY, USA, 109--116. DOI: http://dx.doi.org/10.1145/964442.964464 Google ScholarDigital Library
Congcong Li, Alexander C. Loui, and Tsuhan Chen. 2010. Towards Aesthetics: A Photo Quality Assessment and Photo Selection System. In Proceedings of the International Conference on Multimedia (MM '10). ACM, NY, NY, USA, 827--830. DOI: http://dx.doi.org/10.1145/1873951.1874089 Google ScholarDigital Library
Greg Little, Tessa A. Lau, Allen Cypher, James Lin, Eben M. Haber, and Eser Kandogan. 2007. Koala: Capture, Share, Automate, Personalize Business Processes on the Web. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '07). ACM, NY, NY, USA, 943--946. DOI: http://dx.doi.org/10.1145/1240624.1240767 Google ScholarDigital Library
D. C. Liu and J. Nocedal. 1989. On the Limited Memory BFGS Method for Large Scale Optimization. Math. Program. 45, 3 (Dec. 1989), 503--528. DOI: http://dx.doi.org/10.1007/BF01589116Google ScholarCross Ref
Yiwen Luo and Xiaoou Tang. 2008. Photo and Video Quality Evaluation: Focusing on the Subject. In Proceedings of the 10th European Conference on Computer Vision: Part III (ECCV '08). Springer-Verlag, Berlin, Heidelberg, 386--399. DOI: http://dx.doi.org/10.1007/978-3-540-88690-7_29 Google ScholarDigital Library
Luca Marchesotti, Florent Perronnin, Diane Larlus, and Gabriela Csurka. 2011. Assessing the Aesthetic Quality of Photographs Using Generic Image Descriptors. In Proceedings of the 2011 International Conference on Computer Vision (ICCV '11). IEEE Computer Society, Washington, DC, USA, 1784--1791. DOI: http://dx.doi.org/10.1109/ICCV.2011.6126444 Google ScholarDigital Library
J. Marks, B. Andalman, P. A. Beardsley, W. Freeman, S. Gibson, J. Hodgins, T. Kang, B. Mirtich, H. Pfister, W. Ruml, K. Ryall, J. Seims, and S. Shieber. 1997. Design Galleries: A General Approach to Setting Parameters for Computer Graphics and Animation. In Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH '97). ACM Press/Addison-Wesley Publishing Co., NY, NY, USA, 389--400. DOI: http://dx.doi.org/10.1145/258734.258887 Google ScholarDigital Library
Peter O'Donovan, Aseem Agarwala, and Aaron Hertzmann. 2011. Color Compatibility from Large Datasets. ACM Trans. Graph. 30, 4, Article 63 (July 2011), 12 pages. DOI: http://dx.doi.org/10.1145/2010324.1964958 Google ScholarDigital Library
Peter O'Donovan, Aseem Agarwala, and Aaron Hertzmann. 2014. Learning Layouts for Single-Page Graphic Designs. IEEE Transactions on Visualization and Computer Graphics 20, 8 (Aug. 2014), 1200--1213. DOI:http://dx.doi.org/10.1109/TVCG.2014.48 Google ScholarDigital Library
Peter O'Donovan, Aseem Agarwala, and Aaron Hertzmann. 2015. DesignScape: Design with Interactive Layout Suggestions. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). ACM, NY, NY, USA, 1221--1224. DOI: http://dx.doi.org/10.1145/2702123.2702149 Google ScholarDigital Library
Katharina Reinecke, Tom Yeh, Luke Miratrix, Rahmatri Mardiko, Yuechen Zhao, Jenny Liu, and Krzysztof Z. Gajos. 2013. Predicting Users' First Impressions of Website Aesthetics with a Quantification of Perceived Visual Complexity and Colorfulness. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13). ACM, NY, NY, USA, 2049--2058. DOI: http://dx.doi.org/10.1145/2470654.2481281 Google ScholarDigital Library
Juliane Schäfer and Korbinian Strimmer. 2005. A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics. Statistical Applications in Genetics and Molecular Biology 4, 1 (November 2005). DOI: http://dx.doi.org/10.2202/1544--6115.1175Google ScholarCross Ref
Adrian Secord, Jingwan Lu, Adam Finkelstein, Manish Singh, and Andrew Nealen. 2011. Perceptual Models of Viewpoint Preference. ACM Trans. Graph. 30, 5, Article 109 (2011), 12 pages. DOI: http://dx.doi.org/10.1145/2019627.2019628 Google ScholarDigital Library
Burr Settles. 2009. Active Learning Literature Survey. Computer Sciences Technical Report 1648. University of Wisconsin-Madison, Madison, WI, USA.Google Scholar
L. Shapira, A. Shamir, and D. Cohen-Or. 2009. Image Appearance Exploration by Model-Based Navigation. Comput. Graph. Forum 28, 2 (2009), 629--638. DOI: http: //dx.doi.org/10.1111/j.1467--8659.2009.01403.xGoogle ScholarCross Ref
Jerry O. Talton, Daniel Gibson, Lingfeng Yang, Pat Hanrahan, and Vladlen Koltun. 2009. Exploratory Modeling with Collaborative Design Spaces. ACM Trans. Graph. 28, 5, Article 167 (2009), 10 pages. DOI: http://dx.doi.org/10.1145/1618452.1618513 Google ScholarDigital Library
Joshua B. Tenenbaum, Vin de Silva, and John C. Langford. 2000. A Global Geometric Framework for Nonlinear Dimensionality Reduction. Science 290, 5500 (2000), 2319--2323. DOI: http://dx.doi.org/10.1126/science.290.5500.2319Google ScholarCross Ref
Michael Terry and Elizabeth D. Mynatt. 2002. Side Views: Persistent, On-demand Previews for Open-ended Tasks. In Proceedings of the 15th Annual ACM Symposium on User Interface Software and Technology (UIST '02). ACM, NY, NY, USA, 71--80. DOI: http://dx.doi.org/10.1145/571985.571996 Google ScholarDigital Library

Index Terms

SelPh: Progressive Learning and Support of Manual Photo Color Enhancement
1. Human-centered computing

Recommendations

Sequential line search for efficient visual design optimization by crowds

Parameter tweaking is a common task in various design scenarios. For example, in color enhancement of photographs, designers tweak multiple parameters such as "brightness" and "contrast" to obtain the best visual impression. Adjusting one parameter is ...
Read More
Sequential gallery for interactive visual design optimization

Visual design tasks often involve tuning many design parameters. For example, color grading of a photograph involves many parameters, some of which non-expert users might be unfamiliar with. We propose a novel user-in-the-loop optimization method that ...
Read More
Crowd-powered parameter analysis for visual design exploration
UIST '14: Proceedings of the 27th annual ACM symposium on User interface software and technology

Parameter tweaking is one of the fundamental tasks in the editing of visual digital contents, such as correcting photo color or executing blendshape facial expression control. A problem with parameter tweaking is that it often requires much time and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CHI '16: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems
May 2016
6108 pages
ISBN:9781450333627
DOI:10.1145/2858036
General Chairs:
Jofish Kaye
Yahoo
,
Allison Druin
University of Maryland / National Park Service
,
Program Chairs:
Cliff Lampe
University of Michigan
,
Dan Morris
Microsoft
,
Juan Pablo Hourcade
University of Iowa
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 May 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
design support
photo enhancement
self-reinforcement
Qualifiers
- research-article
Conference

Acceptance Rates
CHI '16 Paper Acceptance Rate565of2,435submissions,23%Overall Acceptance Rate6,199of26,314submissions,24%
More
Upcoming Conference
CHI PLAY '24

Sponsor:

sigchi

The Annual Symposium on Computer-Human Interaction in Play

October 14 - 17, 2024

Tampere , Finland
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 18
  Total Citations
  View Citations
- 518
  Total Downloads
- Downloads (Last 12 months)31
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

SelPh: Progressive Learning and Support of Manual Photo Color Enhancement

CHI '16: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

Sequential line search for efficient visual design optimization by crowds

Sequential gallery for interactive visual design optimization

Crowd-powered parameter analysis for visual design exploration