Найти в Дзене
Crynet.io

ScaleAI and AI Risks just dropped the Remote Labor Index (RLI) – and the results are pretty eye-opening

ScaleAI and AI Risks just dropped the Remote Labor Index (RLI) – and the results are pretty eye-opening! 👀 So, how well do today’s AI agents tackle actual remote work? Spoiler alert: not great, folks. 😅 The top performer, Manus, can only automate a mere 2.5% of tasks. Yup, almost everything is still on us humans! 🙋‍♂️💻 But hey, there’s a silver lining – Claude Sonnet 4.5, GPT-5, Gemini 2.5 Pro, and others are slowly but surely raising the bar. 📈 Bottom line: full automation is still a ways off, but we’re seeing gradual improvements. It’s not about leaps; it’s all about those baby steps. Real work? Still our jam for now! 💪✨ 📊 Check out the leaderboard: https://scale.com/leaderboard/rli

ScaleAI and AI Risks just dropped the Remote Labor Index (RLI) – and the results are pretty eye-opening! 👀

So, how well do today’s AI agents tackle actual remote work? Spoiler alert: not great, folks. 😅

The top performer, Manus, can only automate a mere 2.5% of tasks. Yup, almost everything is still on us humans! 🙋‍♂️💻

But hey, there’s a silver lining – Claude Sonnet 4.5, GPT-5, Gemini 2.5 Pro, and others are slowly but surely raising the bar. 📈

Bottom line: full automation is still a ways off, but we’re seeing gradual improvements. It’s not about leaps; it’s all about those baby steps. Real work? Still our jam for now! 💪✨

📊 Check out the leaderboard: https://scale.com/leaderboard/rli