Creating images with ChatGPT
Generate and refine images using clear, descriptive prompts.
ChatGPT can generate original images from plain-language prompts. You can iterate quickly—request variations, adjust composition or size, or explore new visual directions—and produce production-ready assets in minutes.
This makes it easier to explore concepts, communicate ideas visually, and adapt existing assets for different audiences, formats, or channels.
A good image prompt does not need to be long. In most cases, 1–3 clear sentences are enough. The goal is to help ChatGPT understand what the image is, how it should feel, and what it needs to accomplish.
In practice, this means grounding the prompt in a few key details: the purpose of the image, the main subject, what is happening, where it takes place, and the desired visual style. If framing, lighting, or specific constraints matter, include those too.
Clarity is more effective than clever phrasing—especially for details like layout, texture, materials, or light. For example, “soft natural light from a window on the left” will usually be more reliable than something vague like “beautiful lighting.”
Constraints are especially useful when something needs to stay fixed. If you do not want extra text, logos, or visual changes, state that directly. When editing an existing image, be explicit about what should change and what should stay the same. A prompt like “Change only X. Keep everything else exactly the same” is often the clearest way to guide a precise edit.
The best way to improve an image is usually through small, targeted revisions. Start by getting the core idea right, then adjust one element at a time. Direct, specific feedback is easier to follow than broad reactions, and repeating the most important details can help prevent the image from drifting as you refine it.
Examples of useful, actionable adjustments:
- “Make it brighter,” “tone down the colors,” “simplify the background”
- “Keep the same composition, but make the style more modern / softer / more playful”
Step-by-step revisions help maintain consistency as you refine the image. You can also edit specific areas and provide targeted instructions.
You can upload multiple images to guide generation or editing, but a small set is usually easier to manage than a large one. Refer to each image by order and explain how they relate to each other.
For example: “Image 1 is a photo of my desk setup. Image 2 is a style reference. Apply image 2’s clean, minimal illustration style to image 1, while keeping the same layout and objects.”
When combining elements, use clear spatial language—such as left, right, foreground, and background—to describe relationships.
Text works best when instructions are very specific:
- Put text in quotes or ALL CAPS
- Specify font style, size, color, placement
- Keep text short
- For brand names/uncommon words, spell out letter-by-letter (e.g., “S-T-R-I-P-E”)
For example: "Add the headline “WEEKLY PLAN” in bold sans-serif, white, centered at the top, 72pt. No other text."
Infographics are useful for explainers, posters, labeled diagrams, timelines, and “visual wiki” assets. For dense layouts or heavy in-image text, emphasize “sharp text rendering,” and consider polishing in design tools if needed.
- Be mindful with likenesses: If generating images of real people (including yourself or others), use a reference photo for accuracy and make sure you have permission to use their likeness.
- Generic over specific: When in doubt, request “generic” or “ownable” versions of a design rather than imitating a specific brand, product, or artwork.
- Attribution optional: You aren’t required to credit OpenAI when using generated images, but you may do so if it helps clarify how the asset was created.
- Policy compliance: All image use should follow your organization’s guidelines and OpenAI’s usage policies.


