New research looks at how leading AI models hold up doing actual white-collar work tasks, drawn from consulting, investment banking, and law. Most models failed.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results