Gå direkte til innholdet
Advanced Automation for Comprehensible Causal Explanations of Reinforcement Learning Agents
Advanced Automation for Comprehensible Causal Explanations of Reinforcement Learning Agents
Spar

Advanced Automation for Comprehensible Causal Explanations of Reinforcement Learning Agents

Forfatter:
Engelsk
Les i Adobe DRM-kompatibelt e-bokleserDenne e-boka er kopibeskyttet med Adobe DRM som påvirker hvor du kan lese den. Les mer
This thesis introduces Auto-BENEDICT, a novel, fully automated methodology designed to generate human-comprehensible causal explanations for model-free Reinforcement Learning (RL) agents. The system addresses the trade-off between high performance and transparency in RL by integrating Bayesian Networks for causal inference and Recurrent Neural Networks to forecast future states and actions. The method provides answers to both “Why” and “Why not” questions, thereby increasing user trust and interpretability. The work also introduces enhanced importance metrics—including both Q-value-based and graph-based approaches—used to detect distal information, i.e., critical sequences of states or actions that are key to solving a task. These metrics are then fused with the causal explanation framework, resulting in Auto-BENEDICT, which not only explains but also recognizes high-risk or critical states automatically. Validation through computational experiments and a human evaluation study shows that Auto-BENEDICT significantly outperforms traditional methods in comprehensibility and trustworthiness, contributing a major advancement in Explainable Reinforcement Learning. 
Forfatter
Rudy Milani
ISBN
9783658504953
Språk
Engelsk
Utgivelsesdato
3.3.2026
Tilgjengelige elektroniske format
  • Epub - Adobe DRM
Les e-boka her
  • E-bokleser i mobil/nettbrett
  • Lesebrett
  • Datamaskin