One of Google’s recent Gemini AI models scores worse on safety

May 2, 2025

48

Google, the tech giant known for its innovative and cutting-edge technology, has recently released a new AI model, Gemini 2.5 Flash. However, in a surprising turn of events, the company’s internal benchmarking has revealed that this new model scores worse on certain safety tests than its predecessor, Gemini 2.0 Flash.

In a technical report published this week, Google has disclosed that its Gemini 2.5 Flash model is more likely to generate text that violates its safety guidelines compared to Gemini 2.0 Flash. This revelation has raised concerns about the safety and reliability of the new AI model.

The Gemini 2.5 Flash model is a text-to-text AI model that is designed to generate human-like text responses to prompts. It is a part of Google’s larger AI project, which aims to create more advanced and sophisticated AI models that can perform a wide range of tasks.

According to the technical report, the Gemini 2.5 Flash model has shown a decline in performance on two key metrics – “text-to-text coherence” and “text-to-text diversity”. These metrics measure the model’s ability to generate text that is coherent and diverse, respectively. The report states that the new model is more likely to generate text that is incoherent and lacks diversity, which goes against Google’s safety guidelines.

This news has caused a stir in the tech community, with many questioning the reliability of Google’s AI models. However, Google has assured its users that they are taking this matter seriously and are working towards improving the safety and performance of the Gemini 2.5 Flash model.

In a statement, Google’s AI team said, “We are committed to ensuring the safety and reliability of our AI models. While the Gemini 2.5 Flash model may have shown a decline in performance on certain safety tests, we are continuously working towards improving its performance and addressing any issues that may arise.”

The company has also stated that they have already made significant improvements to the model since the benchmarking was conducted and are confident that it will continue to improve in the future.

Despite this setback, Google’s AI project has made significant strides in the field of artificial intelligence. Their AI models have been used in various applications, from language translation to image recognition, and have shown impressive results. The company’s commitment to constantly improving and refining their AI models has made them a leader in the industry.

Moreover, Google’s transparency in disclosing the performance of their AI models is commendable. It shows their dedication to ensuring the safety and reliability of their technology, even if it means acknowledging its shortcomings.

In conclusion, while the news of Google’s new AI model scoring worse on certain safety tests may have raised concerns, it is important to remember that this is a part of the continuous learning process for AI technology. With Google’s commitment to improving the safety and performance of their AI models, we can be confident that they will continue to push the boundaries of what is possible in the world of artificial intelligence.

Prime Plus

Previous articleApple and Anthropic reportedly partner to build an AI coding platform

Next articleAurora launches its driverless commercial trucking service, and a surprise bidder joins Canoo’s bankruptcy case

One of Google’s recent Gemini AI models scores worse on safety

popular today

Davis Schneider Risks Toronto Blue Jays Job as Concerning Spring Training Stat Emerges, Says Insider

Popular Protein Bars May Have Far More Calories Than Advertised

Brother’s Battle With Incurable Disease Leads Arch Manning’s Teammate to Emotional Texas Admission

Sadiq Khan ‘not anti-motorist, I’m anti-death’ as SUV drivers set to be taxed in London

Dan Hurley’s UConn Records First-Ever Big East Milestone