Google’s much-anticipated Gemini is poised to redefine the boundaries of what AI can achieve. Touted as the next generation of AI with multimodal capabilities, Gemini has generated significant buzz for its ability to process diverse types of data, including text, images, and even entire websites, all based on a sketch or written description.
Google just delayed the rollout of Gemini until January.
Reports claim Gemini reached capabilities of GPT-4 in some areas, but the delay is due to defects in ‘non-English’ languages.
This sucks, but in reality, a month delay is likely meaningless (for Google) long-term. pic.twitter.com/46qT2UDnkg
— Rowan Cheung (@rowancheung) December 4, 2023
Originally slated for a grand unveiling in New York, Washington, and California next week, Google’s Gemini has hit a temporary roadblock, pushing its launch to early 2024. Sources reveal that concerns over the AI’s responsiveness to non-English prompts and inquiries prompted the rescheduling of these previously undisclosed launch events. Despite this setback, industry insiders eagerly await the arrival of this complex AI model, which promises to outperform its competitors, including OpenAI’s GPT-4.
Gemini’s strength lies in its unparalleled computing power, surpassing even that of GPT-4. The Information, citing anonymous sources, reports that Gemini’s capacity to understand and generate text and images is a leap forward in the AI landscape. With the ability to create images based on verbal prompts, Gemini opens new possibilities for human-AI interaction, transcending the limitations of conventional text-based communication.
Sissie Hsiao, Google’s VP and manager of Bard and Google Assistant, has expressed her awe at Gemini’s capabilities, citing instances where the AI was tasked with visualizing complex instructions. “Like, I’m trying to bake a cake, draw me 3 pictures of the steps to how to ice a three-layer cake, and Gemini will actually create those images,” said Hsiao. What sets Gemini apart is not just its ability to generate images but its knack for producing entirely original content, free from internet influences.
While Google already boasts its own generative AI model, Bard, Gemini’s delayed launch has allowed ChatGPT to maintain its consumer awareness lead. However, analysts predict a significant shift when Gemini finally steps onto the AI stage, offering a robust alternative and potentially surpassing existing models.