MADQA Benchmark Reveals AI Agents Use Brute-Force Search Instead of Strategic Reasoning

Saturday, March 14, 2026