Strawberry Matcha Drink Photo with Doodle Art Overlay
Create a realistic handheld layered drink photo featuring whimsical hand-drawn doodles, perfect for vibrant social media food content.

Create a realistic shallow-depth-of-field street photo of a hand holding a clear plastic cup of iced layered drink in front of a small outdoor beverage stall. The main drink is a tall transparent cup with a domed flat lid, filled with three distinct layers: deep red berry syrup at the bottom, creamy white milk with red streaks in the middle, and chunky bright green matcha on top; the cup is held by a left hand in a muted gray-green sleeve. Keep the background softly blurred: an outdoor counter with syrup bottles on the right, a white cloth banner reading large playful black text "Eh!" and "Chocolat", plus small menu words "Chocolate", "Maring", and "Refreshing", a pink menu board behind it, and a white illustrated drink menu board on the left. Add playful hand-drawn doodles directly interacting with the drink and scene, as if drawn over the photo: one smiling strawberry character riding a red-and-white candy-striped ribbon swirl around the cup, one smiling green leaf character riding a small red-and-white rocket launching from the cup toward the upper right, three visible candy-striped ribbon arcs wrapping around or streaming from the drink, one dotted multicolor path curling down the left side of the hand, one small handwritten "Phew!" with three blue sweat drops near the thumb, one arch of small cookie or chocolate-chip doodles above the lid with a tiny square chocolate character at the top, four yellow star doodles around the banner, and three small doodled drink icons on the banner matching the labels: a brown chocolate drink, a yellow maring drink, and a green refreshing drink. Use cheerful black marker outlines, red, green, yellow, and blue accent colors, whimsical motion lines, and integrate the doodles so they mimic the drink's flowing layers and energetic fresh flavor. The scene should feel like a candid food-and-drink social media photo enhanced with cute animated doodles, preserving natural lighting, realistic hand anatomy, and the original composition centered on the cup. Customize the drink flavor as argdrink flavor: strawberry matcha milk, the main banner text as argbanner text: Eh! Chocolat, the handwritten exclamation as argexclamation text: Phew!, the doodle mascot pair as argdoodle mascots: smiling strawberry and green leaf, and the stall setting as argstall setting: outdoor beverage stall.
About this prompt
Create a realistic handheld layered drink photo featuring whimsical hand-drawn doodles, perfect for vibrant social media food content. Use it as a Concept Art starting point for GPT Image 2: keep the visual structure and style constraints intact, then swap in your own subject, brand, or scene.
Start by replacing drink flavor, banner text, exclamation text, and doodle mascots, then keep the camera, composition, and material cues in the same order. This makes the output easier to compare across variations.
How to use this prompt
- Copy the full prompt text from the page.
- Open your preferred AI image generator (e.g., Sora 2, Midjourney, DALL-E).
- Paste the prompt into the text field and generate.
- For variations, tweak details like the drink colors, doodle characters, or background text.
- Download the result and use it for your posts, ads, or menus.



