Google Has Rolled Out Some New Features With AI Capabilities

Google has expanded its Gemini AI models by introducing three new experimental versions: Gemini 1.5 Flash-8B, Gemini 1.5 Pro, and Gemini 1.5 Flash. These updates aim to provide developers with enhanced tools for handling complex tasks, from multimodal inputs to long-context summarisation.

The Gemini 1.5 Flash-8B model, which contains eight billion parameters, is designed for tasks that require processing large volumes of data or lengthy content. This model is particularly suitable for summarising long documents or handling multimedia content.

Gemini 1.5 Pro has received updates to better handle tasks involving mathematics, complex prompts, and coding. These improvements make it a more powerful tool for developers working on intricate projects. The new version of Flash also brings performance improvements on internal benchmarks, making it more efficient for a variety of applications.

 

What Are The Main Features Of The Imagen 3 Model?

 

On another note, Google also announced its latest image generation model, Imagen 3, which is designed to create high-quality images from text descriptions. Imagen 3 allows users to generate images in different styles, from photorealistic pieces to abstract art, which brings more creative freedom than the previous Imagen 2.

The model also has safety measures to prevent the generation of inappropriate content, such as realistic images of identifiable individuals or minors, as well as violent or explicit scenes. These features are for users to explore creative possibilities while still being ethical.

Imagen 3 is currently being rolled out to Gemini Advanced, Business, and Enterprise users, with the model being made available across different languages. This is perfect for more users to be able to take advantage of its advanced image generation tools.
 

 

How Is Imagen 3 Improving Image Generation And Customisation?

 

Google’s latest in AI-driven image generationbringing users the ability to create detailed and creative images from simple text prompts. This model supports different image styles, and users have control over the creative process involved.

The fact that users can provide feedback on generated images, which allows them to refine the results until they match their expectations makes this useful, designed to make image generation more intuitive and user-friendly. While Imagen 3 has advanced capabilities, as noted, it also has restrictions on generating certain types of content, making it an incredible tool for all users.
 

How Does Gemini AI Now Support Customised Assistance With Gems?

 

Gemini AI has also introduced a new feature called Gems. On the announcement, Google shared, “You can customise Gems to act as an expert on topics or refine them toward your specific goals. Simply write instructions for your Gem, give it a name, and then chat with it whenever you want.

With Gems, you can create a team of experts to help you think through a challenging project, brainstorm ideas for an upcoming event, or write the perfect caption for a social media post. Your Gem can also remember a detailed set of instructions to help you save time on tedious, repetitive or difficult tasks.”

To help users get started, Google offers premade Gems for various scenarios, including learning coaches, brainstormers, career guides, writing editors, and coding partners. These Gems are now available on both desktop and mobile devices for Gemini Advanced, Business, and Enterprise users in over 150 countries.