Pool side in Tokyo Prompt. For Nano Banana Pro 2.
- Edward Frank Morris
- 6 days ago
- 3 min read
There are prompts… and then there are precision instruments.This is the latter.
What you’re looking at isn’t just a text-to-image instruction. It’s a tightly engineered framework designed to force realism, eliminate AI “guesswork,” and produce something that feels less like a render—and more like a moment that actually happened.
Let’s break down what this prompt does, why it works, and the kind of output it’s engineered to create.
1. Identity Lock: No AI “Creativity” Allowed
The very first instruction sets the tone:
Use the uploaded reference image… facial identity reproduced exactly 1:1
This is critical.
Most AI image outputs drift. They beautify. They “improve.” They subtly change bone structure or symmetry.This prompt shuts that down completely.
Result:You don’t get “a guy who looks like you.”You get you—unchanged, unfiltered, and recognisable.
That alone makes this prompt powerful for:
Personal branding
Influencer content
Corporate visuals
High-end social media presence
2. Controlled Imperfection: The Illusion of a Real Moment
Instead of a polished, posed shot, the prompt forces something much more interesting:
Handheld iPhone capture
Slight downward angle
Not looking at the camera
Relaxed posture
“Stolen shot” energy
This is deliberate.
Perfect photos look fake.Imperfect photos feel real.
By simulating:
Natural hand positioning
Casual framing
Off-guard body language
…the AI is pushed into generating something that feels accidentally captured.
Result:An image that bypasses the “AI look” and lands firmly in authentic lifestyle photography.
3. Environmental Precision: No Warped Reality Allowed
Infinity pools are one of the easiest things for AI to mess up.
This prompt doesn’t allow that.
It explicitly enforces:
Correct waterline alignment with the horizon
Straight tile geometry
Clean, undistorted reflections
And then anchors it in a very specific setting:
Tokyo at night.
Not just “a city.”Not just “lights in the background.”
But:
Depth
Atmospheric haze
A believable skyline
Result:The environment behaves like physics exists.Which, ironically, is what makes the image feel real.
4. Luxury Detail Injection: The Rolex Factor
This isn’t just a watch mention—it’s a precision constraint.
Rolex Daytona "Ghost" with black Oysterflex braceletBezel screws evenly spaced and proportionally accurate
That level of specificity matters.
AI often:
Warps luxury items
Misshapes proportions
Hallucinates details
This prompt forces product-level accuracy.
Result:The watch doesn’t look “inspired by Rolex.”It looks like it came out of a Rolex campaign.
Which subtly elevates the entire image.
5. Lighting That Breaks the Scene (In a Good Way)
Here’s where things get interesting.
The setting is Tokyo at night…But the lighting is bright natural sunlight.
That contrast is intentional.
It introduces:
Strong highlights on water
Subtle glare on the watch
Exposure imbalance between foreground and reflection
In other words: controlled visual tension.
Result:The image feels stylised—but not artificial.Cinematic—but still grounded.
It’s the kind of lighting you’d expect from a high-end editorial shoot, not a standard AI render.
6. The Intended Outcome: “This Was Taken on My Phone”
Everything in this prompt ladders up to one goal:
Make the image feel like it was casually taken… not generated.
And when it works, the reaction is predictable:
“Where was this taken?”
“Who shot this?”
“What hotel is that?”
Not:
“What AI tool did you use?”
That’s the difference.
7. Why This Prompt Works So Well (From an SEO + Content Perspective)
If you’re creating content around:
AI image generation
Hyper-realistic prompts
Personal branding visuals
Luxury lifestyle imagery
This kind of prompt hits multiple high-intent keywords naturally:
“realistic AI photo prompt”
“iPhone style AI photography”
“AI lifestyle image generation”
“cinematic AI prompt”
And more importantly…
It delivers on what people are actually searching for:
Images that don’t look like AI.
Final Thought
Most prompts tell AI what to create.This one tells AI what not to get wrong.
And that’s why it works.
It removes ambiguity. It removes creativity in the wrong places.And it forces realism where it matters.
The result?
A single image that looks like a memory. Not a generation.
Prompt