I think it'd be a neat.

State: Ability Set
Transition/Policy: Available Abilities
Value: Damage

Buffs are just abilities, and skills with modifiers could be ignored because their benefits will impact value, and ultimately influence the policy as it converges.

I wouldn't mind taking it on and building off what you've started, if that's alright. ML and AI has been a passion.