Imagen 3 arrives in the Gemini API

Imagen 3 arrives in the Gemini API

Builders can now entry Imagen 3, Google’s state-of-the-art picture era mannequin, by the Gemini API. The mannequin will likely be initially accessible to paid users, with a rollout to the free tier coming quickly.

Imagen 3 excels in producing visually interesting, artifact-free photographs in all kinds of kinds from hyperrealistic photographs to impressionistic landscapes, summary compositions to anime characters. Improved immediate following makes it simple to transform nice concepts into high-quality photographs. General, Imagen 3 achieves state-of-the-art efficiency on the number of benchmarks. Imagen 3 achieves this whereas being priced at $0.03 per picture on the Gemini API, with management over side ratios, the variety of choices to generate, and more.

To assist fight misinformation and misattribution, all photographs generated by Imagen 3 embrace a non-visible digital SynthID watermark, figuring out them as AI-generated.


See Imagen 3 in Motion

The gallery under highlights Imagen 3’s capabilities throughout a variety of kinds.

Immediate: Group of individuals wanting blissful, pure gentle, 8k

Imagen 3 generated Hyperrealistic portrait of a person dressed in 1920s flapper fashion, vintage style, black and white photograph, elegant pose, 8k

Immediate: Hyperrealistic portrait of an individual wearing Twenties flapper style, classic model, black and white {photograph}, elegant pose, 8k

Imagen 3 generated image of a close-up of a vintage watch with realistic and detailed mechanism

Immediate: Think about a close-up of a classic watch. Generate a sensible depiction with an in depth mechanism

Imagen 3 generated image of an impressionistic landscape painting of a sunset over a field of sunflowers, vibrant colors, thick brushstrokes, inspired by Monet

Immediate: Impressionistic panorama portray of a sundown over a discipline of sunflowers, vibrant colours, thick brushstrokes, impressed by Monet

Imagen 3 generated image of A surreal dreamscape featuring a giant tortoise with a lush forest growing on its back, floating through a starry sky, glowing mushrooms, bioluminescent plants, ethereal atmosphere

Immediate: A surreal dreamscape that includes a large tortoise with a lush forest rising on its again, floating by a starry sky, glowing mushrooms, bioluminescent vegetation, ethereal ambiance

Imagen 3 generated lifestyle image of freshly roasted coffee beans spilling out of a burlap sack onto a rustic wooden table next to a up of coffee with 'Awaken Your Senses' written on the cup in cursive

Immediate: Life-style picture of freshly roasted espresso beans spilling out of a burlap sack onto a country wood desk, steam rising from a close-by cup of espresso, ‘Awaken Your Senses’ is written on the cup in cursive, heat and welcoming ambiance, morning daylight, product images

Imagen 3 generated hyperrealistic portrait of a woman with piercing blue eyes, laughing, freckles, dramatic lighting, detailed skin texture, 8k

Immediate: Hyperrealistic portrait of a girl with piercing blue eyes, laughing, freckles, dramatic lighting, detailed pores and skin texture, 8k

Imagen 3 generated panoramic view of a majestic mountain range at dawn

Immediate: A panoramic view of an imposing mountain vary at daybreak.

Imagen 3 generated scene from a game where the player needs to find a specific object by looking into drawers in a messy desk

Immediate: Present a scene from a recreation the place the participant must discover a particular object by wanting into drawers in a messy desk.

Imagen 3 generated painted cityscape in the style of Van Gogh

Immediate: A cityscape painted within the model of Van Gogh, with swirling brushstrokes and vibrant colours.

Get Began with Imagen 3 within the Gemini API

This Python code snippet demonstrates tips on how to generate a picture with Imagen 3 utilizing the Gemini API.

from google import genai
from google.genai import varieties
from PIL import Picture
from io import BytesIO

shopper = genai.Consumer(api_key='GEMINI_API_KEY')

response = shopper.fashions.generate_images(
    mannequin='imagen-3.0-generate-002',
    immediate='a portrait of a sheepadoodle carrying cape',
    config=varieties.GenerateImagesConfig(
        number_of_images=1,
    )
)
for generated_image in response.generated_images:
  picture = Picture.open(BytesIO(generated_image.picture.image_bytes))
  picture.present()

You may discover extra prompting recommendation and picture kinds within the Gemini API developer docs, with additional particulars obtainable on scores, methodology, and efficiency enchancment in Appendix D of our up to date technical report.

We’re excited to take step one of increasing availability of our generative media fashions into the Gemini API and plan to make extra obtainable within the close to future in order that builders can bridge generative media and language fashions collectively.

Leave a Reply