VivaTechnics 2 days ago

Awesome!

Technically, we could say?

(1) Single-loop: fixes actions within fixed rules, like Reinforcement Learning.

(2) Double-loop: questions and adapts the rules, somewhat like Meta-Reinforcement Learning.