Google Unveils Its Gemini 1.5 Flash— The New Addition To The Series

On Tuesday May 14, Google announced new additions to its AI model series Gemini at its annual developer conference, Google I/O.

The company launched Gemini 1.5 Pro, and as indicated by the CEO Sundar Pichai this “offers the longest context window of any foundational model yet.”

The outdooring comes a day after OpenAI announced its most current man-made intelligence model, GPT-4o.

Google utilized its yearly developer conference to exhibit what the organization is calling its lightest and most effective computerized reasoning models.

The Gemini 1.5 Flash is the newest addition to the Gemini series. Google said in a blog post that the new model can quickly summarize conversations, caption images and videos and extract data from large documents and tables.

According to Demis Hassabis, CEO of Google DeepMind, the organization heard from engineers that they needed something faster and, surprisingly, more savvy or cost-effective

The unveiling comes as tech companies increasingly refocus their product development and rollouts around generative AI, which is of particular importance to Google because the new tools give consumers more advanced and creative ways to access online information compared with traditional web search.

Meanwhile, OpenAI on Monday launched a new AI model and desktop version of ChatGPT, along with a new user interface. The new model, called GPT-4o, is twice as fast as GPT-4 Turbo and half the cost, the company said.

This is Google’s improved Gemini 1.5 Pro model, which can make sense of multiple large documents — 1,500 pages total — or summarize 100 emails, according to a vice president working on Gemini.

Gemini 1.5 Pro will soon be able to handle an hour of video content, or codebases with more than 30,000 lines, said Sissie Hsiao, a vice president at Google and the general manager for Gemini experiences.

“You can quickly get answers and insights about dense documents, like figuring out the details of the pet policy in your rental agreement or comparing key arguments of multiple long research papers,” Hsiao added.

OpenAI’s latest upgrade brings with it improved quality and speed and allows ChatGPT to handle 50 different languages. It will also be available via OpenAI’s application programming interface, or API, allowing developers to begin building applications using the new model immediately, executives said.

Ghana To Be The First Blockchain-Powered Government In Africa

With 35 languages, Google says Gemini 1.5 Pro has a 2 million token window, which measures context and indicates how much information the model can process at once. The new model has improved local reasoning, planning and image understanding, company executives said.

“It offers the longest context window of any foundational model yet,” Alphabet CEO Sundar Pichai said in the press briefing. At the event, he gave an example of a parent asking Gemini to summarize all recent emails from their child’s school.

Gemini 1.5 Pro will initially be available for testing in Workspace Labs. Gemini 1.5 Flash will be available for testing and in Vertex AI, which is Google’s machine learning platform that lets developers train and deploy AI applications.

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *