Google has launched its new version of the Gemini AI application | The system performs deeper analysis of long texts and videos to comply with orders

Tech giant Google has launched Gemini 1.5 Pro, an artificial intelligence (AI) model that can process large amounts of information at once, including one hour of video, 11 hours of audio, 30,000 lines of code or more than 700,000 words. . Currently, a select group of technology developers have access on a trial basis and will be rolled out later.

“A few years ago, memorizing or getting the context of hundreds of words was very difficult,” Oriol Vinyals, vice president of research at Google DeepMind and CEO of Gemini, told the press. To demonstrate the capabilities of the Gemini 1.5 Pro, Vinyals used a video showing that the model He was able to analyze 402 pages of flight transcripts on Apollo 11 – the mission to the moon – and was able to identify three funny quotes, which means that this artificial intelligence began analyzing the meaning of sentences.

When it's publicly available, users will be able to request the creation of images and graphics. In the video, a user presents the Gemini 1.5 Pro with a very simple drawing of a shoe stomping on the ground and asks: “What moment is this? Answer me with a text quote.” The machine's response was the famous quote from astronaut Neil Armstrong: “That's one small step for man.”

In terms of programming, the company statement notes that the Gemini 1.5 Pro “can perform relevant troubleshooting tasks on long blocks of code. When presented with a message containing more than 100,000 lines of code, it can suggest useful modifications and provide explanations on how to The working of different pieces of code. The working of code. “In some ways, it works very similarly to our brain,” Vinyals explained.

See also  Farewell thermometer: Fever can be measured with a cell phone

Sundar Pichai, CEO of Google and Alphabet, said Gemini 1.5 Pro will help software developers create more useful forms and applications: “We are excited to offer a limited preview of this beta feature to developers and enterprise customers.” .

Regarding “hallucinations” – well-organized but incorrect responses – Viñales points out that they are still a problem in AI in general and are still a work in progress.
Last week, Google changed the name of its artificial intelligence (AI) chatbot from Bard to Gemini. It was announced that this technology will be available in the new Gemini application for Android and via the Google application on iOS.

Gemini 1.5 Pro is a medium-sized multimedia model where the novelty is that it can analyze very long documents, from comparing contract details to summarizing and analyzing topics and opinions in analyst reports, research studies or even a series of books.

By analyzing and comparing content across hours of video, you can find specific details in sports footage or get detailed information from video meeting summaries that support precise questions and answers. Your chatbot can carry on long conversations without forgetting details, even during complex tasks or with many follow-up interactions. It also enables highly personalized experiences by incorporating relevant user information.

Lovell Loxley

"Alcohol buff. Troublemaker. Introvert. Student. Social media lover. Web ninja. Bacon fan. Reader."

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top