ExploreLearning Code - Search News

verl: Volcano Engine Reinforcement Learning for LLMs

verl is a flexible, efficient and production-ready RL training library for large language models (LLMs). verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper.

GitHub

The AI Scientist: Towards Fully Automated

One of the grand challenges of artificial intelligence is developing agents capable of conducting scientific research and discovering new knowledge. While frontier models have already been used to aid ...

IEEE

Aligning Crowd-Sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models

Abstract: This paper studies how AI-assisted programming and large language models (LLM) improve software developers' ability via AI tools (LLM agents) like Github Copilot and Amazon CodeWhisperer, ...

Pocket Tactics

All Wuthering Waves codes (WuWa) for December 2025

December 17, 2025: We checked for any new Wuthering Waves codes and removed the expired livestream codes from our list We're huge fans of gacha games, and the available Wuthering Waves codes don't ...

Ars Technica

OpenAI built an AI coding agent and uses it to improve the agent itself

With the popularity of AI coding tools rising among some software developers, their adoption has begun to touch every aspect of the process, including human developers using the tools to improve ...

Car and Driver

Rivian R1S and R1T to Add Expanded Hands-Free Driving and AI Assistant

The latest versions of the Rivian R1S and R1T are about to be eligible for some significant upgrades that can be added through the magic of over-the-air updates. Starting this month, an OTA will be ...

TechCrunch

OpenAI fires back at Google with GPT-5.2 after ‘code red’ memo

OpenAI launched its latest frontier model, GPT-5.2, on Thursday amid increasing competition from Google, pitching it as its most advanced model yet and one designed for developers and everyday ...

Microsoft

Agent Lightning: Adding reinforcement learning to AI agents without code rewrites

AI agents are reshaping software development, from writing code to carrying out complex instructions. Yet LLM-based agents are prone to errors and often perform poorly on complicated, multi-step tasks ...

TechCrunch

Claude Code is coming to Slack, and that’s a bigger deal than it sounds

Anthropic is launching Claude Code in Slack, allowing developers to delegate coding tasks directly from chat threads. The beta feature, available Monday as a research preview, builds on Anthropic’s ...

IEEE

“Paper, Meet Code”: A Deep Learning Approach to Linking Scholarly Articles With GitHub Repositories

Abstract: Computer scientists often publish their source code accompanying their publications, prominently using code repositories across various domains. Despite the concurrent existence of scholarly ...

Ars Technica

OpenAI CEO declares “code red” as Gemini gains 200 million users in 3 months

The shoe is most certainly on the other foot. On Monday, OpenAI CEO Sam Altman reportedly declared a “code red” at the company to improve ChatGPT, delaying advertising plans and other products in the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results