WebExplorer: Training Web Agents with Self-Generated Reward Data
In a major achievement for AI, researchers at Hong Kong University of Science and Technology, MiniMax, and the University of Waterloo have created WebExplorer – a new way to teach web agents without human-annotated data. This novel method addresses a critical challenge in AI research: the lack of high-quality, complex...
