Top AI coding tools make mistakes one in four times | Waterloo Data and Artificial Intelligence Institute

Benchmarking research shows leading AI models still struggle to reliably produce structured outputs used in software development

Media Relations

New research from the University of Waterloo shows that artificial intelligence (AI) still struggles with some basic software development tasks, raising questions about how reliably AI systems can assist developers.

As Large Language Models (LLMs) are increasingly incorporated into software development, developers have struggled to ensure that AI-generated responses are accurate, consistent, and easy to integrate into larger development workflows.

To read the full article, click here!