SWE-BENCH
SWE-Bench is a leader board that evaluates AI models by testing their ability to solve real-world coding tasks from GitHub issues. This is a collaboration between Princeton and Standford Universities.
Available to all SRX running 12.3X48+
Available to all MX running 20.2R1+
Available to all vSRX running 20.3R1+