Google Unveils Gemini 2.0, Ushering in the 'Agent Era' with Multimodal AI Expansion
December 22, 2024In December 2023, Google rebranded its chatbot from Bard to Gemini, marking a significant shift in its AI offerings.
Gemini chat has now been merged with Duet AI and is embedded in various Google products, including Android Assistant, Chrome, and Google Photos.
The Gemini Assistant has largely replaced the previous assistant on Android and has also been integrated into Google Docs and developer tools.
With the release of Gemini 2.0 in December 2024, CEO Sundar Pichai announced the beginning of the 'Agent Era' for AI.
Gemini 2.0 Flash Experimental was launched as a high-performance multimodal model that supports Google search and code generation functions.
This multimodal model is capable of processing text, images, video, audio, and computer code, similar to OpenAI's GPT-4o.
New applications associated with Gemini's launch include Project Astra for visual understanding, Project Mariner for multimodal AI, and NotebookLM for research applications.
There are over 50,000 variations of Gemini available on Hugging Face, catering to multiple languages and applications.
The Gemini models consist of various versions, including Gemini 1.5 Flash, Gemini 2.0 Flash Experimental, and Gemini 2.0 Experimental Advanced.
Gemini 1.5 introduced significant improvements, such as mixture of experts technology and a one million token context window.
Premium and developer-focused products in the Gemini ecosystem include Gemini Advanced, Google Cloud, AI Studio, Vertex AI, and Google One, each offering diverse AI functionalities.
Google's advancements in AI are rooted in its AI lab, DeepMind, which has produced notable models like LaMDA, PaLM, and Gato prior to Gemini.
Summary based on 1 source
Get a daily email with more AI stories
Source
Tom's Guide • Dec 22, 2024
Google Gemini — everything you need to know