6 results found

Pantaleon Fassbender

As Large Language Models (LLMs) are increasingly deployed in autonomous, high-stakes environments, the fragility of current Reinforcement Learning from Human Feedback (RLHF) alignment protocols remain...

Research Square 2026-04-22 rs-9487834
Machine Psychology AI Psychometrics Large Language Models (LLMs) Ontological Dissonance AI Alignment Constraints Cognitive Narrowing Reinforcement Learning from Human Feedback (RLHF)

Venkat Alamuri

Dynamic data distributions, system failures, and low-latency, cost-effective processing are becoming more of a challenge to modern real-time data pipelines. Current streaming architectures are based o...

Research Square 2026-04-21 rs-9455617
Self-healing data pipelines Reinforcement learning Stream processing Cost-aware optimization Real-time validation Adaptive systems Anomaly detection

Venkat Alamuri

The current data validation systems are mostly reactive, static, and resource-heavy which may lead to interruptions of pipelines and will not be able to detect data corruption in real-time settings. T...

Research Square 2026-04-21 rs-9455603
Predictive Data Validation Self-Healing Data Pipelines Reinforcement Learning Graph-Based Validation Data Quality Assurance Autonomous Data Systems Spatio-Temporal Graphs Intelligent Data Engineering

Yunguo Yu

Healthcare revenue cycle management (RCM) loses billions annually to claim denials, yet existing machine learning approaches treat billing as a prediction problem rather than a decision problemthey pr...

Research Square 2026-04-21 rs-9465761
Causal inference Offline reinforcement learning Revenue cycle management Conformal prediction Constrained Markov decision process Healthcare billing optimization

Wonhyeok Choi, Shutong Ding, Minwoo Choi, Jungwan Woo, Kyumin Hwang, Jaeyeul Kim, Ye Shi, Sunghoon Im

Diffusion policies have emerged as a powerful approach for robotic control, demonstrating superior expressiveness in modeling multimodal action distributions compared to conventional policy networks. ...

Artificial Intelligence Review 2026-04-21 rs-9346251
Robot Learning Diffusion Policy Online Reinforcement Learning Large-scale simulation

Atantra Das Gupta

Traditional approaches to anticancer dosing typically rely on fixed protocols that often overlook how patients respond differently to treatment. This can limit both effectiveness and safety. In this w...

Research Square 2026-04-20 rs-9411956
digital twin antineoplastic drug delivery reinforcement learning PBPK modeling precision oncology pharmacokinetics theragnostic deep Q-network
Back to Top
Home
Browse
Submit
About
0.053657s