PromptCrafter: Crafting Text-to-Image Prompt through Mixed-Initiative Dialogue with LLM 🤖
Seungho Baek
Heather Hyerin Im
Jiseung Ryu
Juhyeong Park
Takyeon Lee
system development & design, qualitative research
- ICML 2023 Workshop : https://doi.org/10.48550/arXiv.2307.08985
Approach
PromptCrafter explores a mixed-initiative workflow where users and an LLM collaboratively construct prompts through small, interpretable steps.
- Understanding Cognitive Bottlenecks A formative study with users of varying expertise revealed that people struggle to identify which textual elements drive unexpected outputs and often narrow their exploration prematurely.
- Decomposing Intent Through Dialogue Instead of editing a monolithic prompt, the system breaks the process into a sequence of clarifying questions generated by an LLM. Users respond, compare visual outcomes, and refine their conceptual direction gradually-mirroring how people naturally iterate on ideas.
- Making Creative Reasoning Visible Each interaction is stored as a visual history that users can revisit or branch. This transforms prompt engineering into a reflective workflow where the evolution of thought, interpretation, and AI response becomes part of the creative process.