1 results found

Pantaleon Fassbender

As Large Language Models (LLMs) are increasingly deployed in autonomous, high-stakes environments, the fragility of current Reinforcement Learning from Human Feedback (RLHF) alignment protocols remain...

Research Square 2026-04-22 rs-9487834
Machine Psychology AI Psychometrics Large Language Models (LLMs) Ontological Dissonance AI Alignment Constraints Cognitive Narrowing Reinforcement Learning from Human Feedback (RLHF)
Back to Top
Home
Browse
Submit
About
0.032818s