Logic and correctness errors (1.75x), code quality and maintainability (1.64x), security (1.57) and performance (1.42x) all saw higher than average code errors, with the report criticizing AI for ...
The code generated by large language models (LLMs) has improved some over time — with more modern LLMs producing code that has a greater chance of compiling — but at the same time, it's stagnating in ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results