News

Microsoft's Debug-Gym is a Python-driven framework aimed at assessing capabilities of AI agents in handling practical ...
AI models from OpenAI, Anthropic, and other top AI labs are increasingly being used to assist with programming tasks. Google ...
AI debugging tools are completely changing the way software is built and maintained. These tools reduce time spent on error ...
Microsoft Research has introduced debug-gym, a novel environment designed to train AI coding tools in the complex art of ...
Microsoft's own Phi-2 model achieved 15.8% accuracy. The study also tested whether providing access to Python's built-in ...
Microsoft study reveals that leading AI models are struggling to successfully debug software, with success rates failing to surpass 50%.