Yes, i think something like that would at least make it easier
to understand.
for example a popular site like facebook always shows the
bubbles pointing to specific places of the screen so the user
can spot where it came from or what needs attention. I believe
they have made it more intuitive based on the feedback.
How intuitive does it have to be? It's explained in less than ten
seconds.