HomeElectronics NewsNew Method Trains Robots Using Crowdsourced Feedback

New Method Trains Robots Using Crowdsourced Feedback

Human Guided Exploration (HuGE) facilitates rapid AI agent learning with assistance from humans despite potential human errors.

Credit: iStock, Christine Daniloff, MIT
Credit: iStock, Christine Daniloff, MIT

Training an AI agent in tasks like opening a kitchen cabinet often employs reinforcement learning, where a human expert develops and updates a reward function to guide the agent’s trial-and-error learning. This process, though effective, can be time-consuming and complex, especially for multi-step tasks.

- Advertisement -

Researchers from MIT, Harvard University, and the University of Washington have developed a new reinforcement learning method that relies on crowdsourced feedback from non-expert users worldwide. This approach allows the AI agent to learn more efficiently, overcoming the challenges of error-prone data that often impede similar methods.

Noisy feedback

The Human Guided Exploration (HuGE) method, developed for reinforcement learning, is innovative in utilising user feedback. Users are shown two images of states achieved by an AI agent and asked to choose the one closer to the goal, like a robot opening a cabinet versus a microwave. Unlike earlier methods where such binary, non-expert feedback directly optimised a reward function, often leading to errors, HuGE separates the process. It uses a goal selector algorithm, updated with human feedback, not as a reward but as guidance for the agent’s exploration. Simultaneously, the agent independently explores and collects data, refining the goal selector. This dual approach narrows the exploration field and allows asynchronous feedback, ensuring the agent continues learning without immediate feedback or amidst incorrect inputs, thus streamlining the learning process.

Faster learning

In testing their HuGE method, researchers conducted simulations and real-world experiments, using it to train robotic arms and navigate complex tasks like maze-solving and block-stacking. They gathered input from 109 non-expert users across multiple continents, finding that HuGE accelerated learning compared to other methods. Crowdsourced data proved more effective than synthetic data, with non-expert users labelling images or videos quickly. The research underscores the importance of aligning AI with human values, a crucial aspect in developing AI learning strategies.

- Advertisement -

In the future, the team intends to upgrade HuGE to learn from natural language and physical interaction with robots and to apply this method for teaching multiple agents simultaneously.

Nidhi Agarwal
Nidhi Agarwal
Nidhi Agarwal is a Senior Technology Journalist at Electronics For You, specialising in embedded systems, development boards, and IoT cloud solutions. With a Master’s degree in Signal Processing, she combines strong technical knowledge with hands-on industry experience to deliver clear, insightful, and application-focused content. Nidhi began her career in engineering roles, working as a Product Engineer at Makerdemy, where she gained practical exposure to IoT systems, development platforms, and real-world implementation challenges. She has also worked as an IoT intern and robotics developer, building a solid foundation in hardware-software integration and emerging technologies. Before transitioning fully into technology journalism, she spent several years in academia as an Assistant Professor and Lecturer, teaching electronics and related subjects. This background reflects in her writing, which is structured, easy to understand, and highly educational for both students and professionals. At Electronics For You, Nidhi covers a wide range of topics including embedded development, cloud-connected devices, and next-generation electronics platforms. Her work focuses on simplifying complex technologies while maintaining technical accuracy, helping engineers, developers, and learners stay updated in a rapidly evolving ecosystem.

SHARE YOUR THOUGHTS & COMMENTS

EFY Prime

Unique DIY Projects

Electronics News

Truly Innovative Electronics

Latest DIY Videos

Electronics Components

Electronics Jobs

Calculators For Electronics