Subgraph Atlas · centered on entity

Reinforcement Learning with Human Feedback (RLHF)

technology1 mentions· velocity: stable

In machine learning, reinforcement learning from human feedback (RLHF) is a technique to align an intelligent agent with human preferences. It involves training a reward model to represent preferences, which can then be used to train other models through reinforcement learning.

Two-hop subgraph: this entity, every entity it directly relates to, and every entity those neighbors relate to. Drag a node, scroll to zoom, click to inspect — or click any neighbor and re-center the atlas there.

0 nodes · 0 edges · loading…

companypersonai_modelproductresearch_labbenchmarkframework

drag to move · scroll to zoom · click a node

How to read this: the white-ringed node is Reinforcement Learning with Human Feedback (RLHF). Surrounding nodes are direct relationships; the second ring is what those neighbors connect to. Edge thickness scales with source-article evidence. Click any node and choose Center graph here to walk the graph.