This would be relatively simple to do since the client already accesses the server clock constantly so it wouldn't require any additional server resources.

All they would need to do is give us an additional command to add to macros (much like the current 'motion' command to remove the text description) that has the client wait until a specific interval on the server clock before initiating.

All the tools for this are already there, they just need to allow emotes to utilise it.