There is keybind that acts as confirm, default is numpad 0, which in conversations acts as if you clicked the mouse in the subtitle box to advance. I assume it works just as well even if the graphical hud is turned of. Not complete solution but that's how I watch my cutscenes, voiced or not, tapping it forward with that key letting my mouse cursor stay out of the screen.