Home
Latest
Featured
Tags
Search
Search for Blog
Human feedback
2
AI
Large Language Models
OpenAI
CriticGPT
ChatGPT
Hallucinations
Reinforcement Learning
Human Feedback
•
26 Jul, 2024
OpenAI's New Tool to Tame the Beast of Large Language Models
By
Elise Montgomery
Large Language Models
Online Alignment
Reinforcement Learning
Human Feedback
Reward Models
•
4 Jun, 2024
Revolutionizing Large Language Models: Active Preference Elicitation for Online Alignment
By
Desmond Morales