Skip to main content
Cornell University
We gratefully acknowledge support from
the Simons Foundation and member institutions.
arxiv logo > cs > arXiv:2209.11711

Help | Advanced Search

Computer Science > Human-Computer Interaction

(cs)
[Submitted on 23 Sep 2022 (v1), last revised 2 Dec 2022 (this version, v2)]

Title:Best Prompts for Text-to-Image Models and How to Find Them

Authors:Nikita Pavlichenko, Fedor Zhdanov, Dmitry Ustalov
Download PDF
Abstract: Recent progress in generative models, especially in text-guided diffusion models, has enabled the production of aesthetically-pleasing imagery resembling the works of professional human artists. However, one has to carefully compose the textual description, called the prompt, and augment it with a set of clarifying keywords. Since aesthetics are challenging to evaluate computationally, human feedback is needed to determine the optimal prompt formulation and keyword combination. In this paper, we present a human-in-the-loop approach to learning the most useful combination of prompt keywords using a genetic algorithm. We also show how such an approach can improve the aesthetic appeal of images depicting the same descriptions.
Comments: 12 pages (4 main pages), 4 figures, 4 tables, accepted at NeurIPS 2022 Workshop on Human Evaluation of Generative Models (HEGM)
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
ACM classes: H.5.2; H.3.3
Cite as: arXiv:2209.11711 [cs.HC]
  (or arXiv:2209.11711v2 [cs.HC] for this version)
  https://doi.org/10.48550/arXiv.2209.11711
arXiv-issued DOI via DataCite

Submission history

From: Dmitry Ustalov [view email]
[v1] Fri, 23 Sep 2022 16:39:13 UTC (5,143 KB)
[v2] Fri, 2 Dec 2022 21:44:52 UTC (5,143 KB)
Full-text links:

Download:

  • PDF
  • Other formats
Current browse context:
cs.HC
< prev   |   next >
new | recent | 2209
Change to browse by:
cs
cs.CL
cs.CV

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export bibtex citation Loading...

Bookmark

BibSonomy logo Mendeley logo Reddit logo ScienceWISE logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack