AI Image Aspect Ratios Explained: Which to Use and When
Aspect ratio is one of those settings that feels like a technical detail but actually shapes the entire image. A portrait (9:16) and a landscape (16:9) from the same prompt don't just differ in dimensions, they produce different compositions, different subject framing, and different visual hierarchies. The model allocates space differently when it has a tall canvas versus a wide one.
This guide covers what the standard ratios actually do to your images, which ratio fits which platform, and the exact syntax for specifying ratios in the tools you're likely using.
Why aspect ratio matters beyond just dimensions
When you change the aspect ratio of an image prompt, you're not just cropping the same content into a different shape. The model generates a completely new image optimized for that canvas orientation.
Give Midjourney the same prompt at 1:1 and 9:16. The square version will have a centered composition that fills the frame evenly. The portrait version will have a vertical composition, a taller subject, more sky or headspace, a different sense of depth. Neither is wrong; they're genuinely different images.
This matters because generating at the wrong ratio for your intended use case means you'll either have awkward composition or you'll be cropping and stretching after the fact, both of which waste the quality of the original generation.
The standard ratios and what they're good for
1:1 (Square)
The equal-sided square is the original social media format and it's still widely used. Square compositions tend to be centered and balanced, the eye lands in the middle of the frame and there's no strong left-right or top-bottom pull.
Best uses: Instagram feed posts, product catalog images, profile photos, podcast cover art, app icons. Square is the format that travels best across contexts because it doesn't impose a strong orientation on the viewer.
When to avoid it: cinematic content, any scene with a strong horizontal element (a landscape, a running figure), video thumbnails. Square will cut too much from these compositions.
In most tools, 1:1 is the default. You get it unless you specify otherwise.
16:9 (Widescreen landscape)
16:9 is the format of cinema, television, and most desktop screen content. The wide horizontal field lets you include environmental context alongside a subject, a person in a landscape, a product in a room, an architectural exterior. Wide shots and establishing shots naturally fit this ratio.
Best uses: YouTube thumbnails, website hero images, presentation slides, blog header images, desktop wallpapers, Twitter/X card images, LinkedIn banners.
What it does to composition: horizontal space encourages the eye to scan left-right. Subjects placed to one side with negative space on the other read well. Lone centered subjects can feel lost in the wide frame unless the environment around them is interesting.
16:9 is probably the single most versatile ratio for web and digital content because it fits standard screen formats without any adjustment.
9:16 (Vertical / portrait)
The inverse of 16:9, this is the smartphone screen format and it has become dominant for social media content. A 9:16 image fills a phone screen completely, which is why it's the native format for Instagram Stories, TikTok, YouTube Shorts, and Pinterest.
Best uses: Instagram Stories and Reels, TikTok content, YouTube Shorts thumbnails, Pinterest pins, mobile-first ads, vertical video cover frames.
What it does to composition: vertical space creates height and draws the eye upward. People and standing subjects fill the frame naturally. Landscape content gets compressed and often loses its key horizontal elements.
For any content that will be primarily viewed on phones, 9:16 is almost always the right choice. Generating at 16:9 and cropping to 9:16 loses significant composition quality, the left and right sides that were meant to be in frame get cut.
4:3 (Classic TV / slight landscape)
4:3 was the standard television and computer monitor format before widescreen became universal. It's a moderate landscape ratio, not as cinematic as 16:9, not as tight as square.
Best uses: some presentation formats (older PowerPoint templates), Facebook feed images (which display closer to 4:3 than 16:9), certain print formats, tablet-optimized content.
In practice, 4:3 gets used less than it used to. Most digital content has moved to either 16:9 or square. It's still relevant for certain print layouts and for matching older presentation templates, but if you're not targeting a specific format that requires it, you can generally skip it.
3:2 (35mm film / photography standard)
3:2 matches the aspect ratio of 35mm film and the standard output of most DSLR cameras. If you're prompting for photorealistic content and want it to feel like it was shot with a real camera, 3:2 is the ratio that fits. Photographers immediately recognize it as a natural format.
Best uses: editorial photography, e-commerce product photography, blog post feature images (many content management systems default to this), print ads, anything meant to feel photographic.
The slight extra height compared to 16:9 makes it comfortable for portrait subjects too, a person photographed at 3:2 in a slight vertical crop has appropriate headroom and space below without feeling compressed like they're crammed into a square.
For Flux and Stable Diffusion in particular, 3:2 often produces the most natural-looking photorealistic output because many training images were at this ratio.
21:9 (Ultrawide / cinematic)
21:9 is the ratio of ultrawide monitors and certain cinematic film formats (anamorphic lenses produce a similar ratio). It's extremely wide, the letterbox format you see when a movie has black bars at the top and bottom on a standard TV is often displaying a 21:9 or similar aspect ratio.
Best uses: desktop wallpapers for ultrawide monitors, cinematic stills meant to feel like film frames, social media cover images (Facebook covers are close to this ratio), and deliberate artistic choices where the extreme width creates a dramatic panoramic effect.
When to avoid: almost all standard web and social contexts. 21:9 gets cropped or awkwardly letterboxed in most content management systems and social media platforms. Use it only when you're specifically targeting ultrawide formats.
Per-platform quick reference
Instagram:
- Feed post: 1:1 (square) or 4:5 (slight portrait)
- Stories and Reels: 9:16
- Carousel: 1:1
- Profile photo: 1:1 (displayed as a circle, so keep subject centered)
TikTok:
- All content: 9:16. TikTok is a vertical-native platform and 9:16 content fills the screen without black bars.
YouTube:
- Video thumbnail: 16:9
- Channel art banner: approximately 16:4 (very wide), usually cropped from a wider source
- YouTube Shorts: 9:16
Pinterest:
- Standard pin: 2:3 (taller than it is wide)
- Square pin: 1:1
- Long pin: up to 1:2.1 (very tall, uncommon)
Twitter / X:
- In-feed image: 16:9 (1200x675px is the target size)
- Card image: 2:1 (wider than standard)
LinkedIn:
- Feed post image: 1.91:1 (close to 16:9)
- Article header: 16:9
- Company page banner: approximately 4:1
Facebook:
- Feed post: 1:1 or 1.91:1
- Stories: 9:16
- Cover photo: approximately 21:9 (2.7:1 specifically)
- Events image: 16:9
Website / blog:
- Hero banner: 16:9 or wider (up to 21:9 for full-width headers)
- Blog post featured image: 16:9 or 3:2 depending on your theme
- Open Graph image (social share preview): 1.91:1
Tool-specific syntax for setting ratios
Midjourney
Use --ar width:height at the end of your prompt.
A misty mountain lake at dawn, photorealistic, golden hour --ar 16:9
Common ratio flags:
--ar 1:1, square--ar 16:9, widescreen landscape--ar 9:16, vertical portrait--ar 3:2, photography standard--ar 4:5, Instagram portrait--ar 21:9, cinematic ultrawide
Midjourney also accepts non-standard ratios. --ar 4:7 or --ar 7:3 work fine. The model will generate at the nearest supported resolution for your subscription tier.
DALL-E 3 via API
DALL-E 3 uses named size parameters rather than ratio notation. The available sizes depend on whether you're using DALL-E 3 standard or HD quality:
size: "1024x1024" // 1:1 square
size: "1792x1024" // landscape (approximately 16:9)
size: "1024x1792" // portrait (approximately 9:16)
DALL-E 3 doesn't support arbitrary ratios through the API, you're limited to these three options. If you need a different ratio, you'll either need to generate at the closest available size and crop, or generate at native size and use an image editor.
Via ChatGPT, you can request aspect ratios in natural language ("generate this as a landscape image," "make it portrait orientation") and the system will select the appropriate size.
Flux Pro / Dev
Flux accepts explicit pixel dimensions. The ratio is determined by your width and height values:
width: 1344, height: 768 # 16:9 landscape
width: 768, height: 1344 # 9:16 portrait
width: 1024, height: 1024 # 1:1 square
width: 1152, height: 768 # 3:2 photography
width: 1536, height: 640 # 21:9 cinematic
Flux 1.1 Pro works best at dimensions that are multiples of 64 or 128. Total pixel counts above 1 megapixel (1024x1024 or equivalent in other ratios) produce better detail.
In the Flux interface on fal.ai and similar platforms, a dropdown typically shows common aspect ratios. Behind the scenes, these map to pixel dimension presets like those above.
Stable Diffusion SDXL
SDXL was trained at specific aspect ratios with corresponding optimal resolutions. Generating at these "native" resolutions produces better quality than arbitrary dimensions:
- 1:1, 1024x1024
- 3:2 landscape, 1216x832
- 9:16 portrait, 832x1216 or 880x1168
- 16:9 landscape, 1344x768
- 4:3 landscape, 1152x896
- 21:9, 1536x640
Generating at ratios that don't match SDXL's training distributions (very extreme ratios, or non-standard resolutions) produces worse results. Stick to the above dimensions for quality output.
In ComfyUI and Automatic1111, these appear as preset resolution options in the image generation settings. In hosted interfaces like Clipdrop, the aspect ratio selector maps to these underlying resolutions automatically.
Ideogram
Ideogram provides a ratio selector in its interface rather than requiring parameter syntax. The UI options include 1:1, 16:9, 9:16, 3:2, 2:3, and a few others. For the API, you pass aspect ratio as a string: "aspect_ratio": "ASPECT_16_9" or "aspect_ratio": "ASPECT_9_16".
Midjourney Niji (anime mode)
Same --ar syntax as standard Midjourney. The Niji mode often produces better portrait-orientation anime art at 2:3 (--ar 2:3) than at 9:16, because 2:3 matches more of the training data for manga and anime page formats.
Composition tips by ratio
Knowing the ratio is step one. Composing for the ratio is what makes the image actually work.
For square (1:1): Center your main subject, or use a tight crop with subject symmetry. Avoid long horizontal elements that will be cut off. Rule-of-thirds compositions work well here.
For landscape (16:9, 3:2): Use the horizontal space intentionally. Place your subject slightly off-center with meaningful negative space or environmental context on the wider side. Leading lines (roads, paths, horizons) work naturally in landscape ratios.
For portrait (9:16, 4:5): Think vertically. A standing person, a tall building, a falling subject, a looking-up perspective, all of these use the vertical canvas naturally. For product photography, portrait orientation puts the product first and lets you add lifestyle context above or below.
For ultrawide (21:9): Lead with panoramic environmental content. A single subject in an ultrawide frame often looks isolated in a bad way. The format works best for landscapes, cityscapes, and group compositions where horizontal spread is meaningful.
The 4:5 ratio: Instagram's sweet spot
One ratio that deserves separate attention is 4:5, the vertical rectangular format that Instagram feed posts can use. It's slightly taller than square (1.25:1 ratio), which gives portrait subjects better framing without going full 9:16 Stories format.
In Midjourney: --ar 4:5
In Stable Diffusion SDXL: 896x1152 or 1040x1296 pixels
4:5 takes up more screen real estate in the Instagram feed than a square post because of how Instagram's grid and scroll view work. For brand accounts trying to maximize feed presence, 4:5 is worth using consistently.
When generators don't behave well at certain ratios
Extreme aspect ratios, very wide like 21:9, or very narrow like 1:3, sometimes produce odd composition decisions from AI generators. The model wasn't trained heavily on extreme-ratio images, so it doesn't always know what to do with the space.
Common issues at extreme ratios:
- Subjects appear stretched or distorted
- Important compositional elements get pushed to the edges
- The model repeats patterns to fill the horizontal space (tiling artifacts)
- Background detail quality drops toward the far edges of very wide images
If you're consistently getting bad results at an unusual ratio, try generating at a closer-to-standard ratio and cropping, or using a tool like Magnific or Adobe Firefly's generative fill to extend the canvas after generation.
For the most important practical ratios, 1:1, 9:16, 16:9, and 3:2, all major generators handle these natively and well. These four cover the vast majority of real use cases.
For more on writing the prompts themselves once you've chosen your ratio, the AI image prompting guide covers the full structure of effective prompts. And if you're generating images for video, the video prompting guide covers the frame ratios relevant to each video platform.