Terminology: the difference between a gesture and a manipulation

All over the web I see the word gesture used to describe every type of interaction on a natural user interface. Just because you use your finger or a stylus or an accelerometer, does not make it a “gesture.” Is this crucial? Not really to users, consumers, marketing, et al. But it is in being a good scholar and interaction designer to get your terminology straight. It also helps when speaking with other developers to have your vocabulary correct so they do not misinterpret your meaning or solutions. Let’s start with the classical, dictionary definitions:

Main Entry: ¹ges·ture
Pronunciation: ˈjes-chər, ˈjesh-
Function: noun
Etymology: Middle English, from Anglo-French, from Medieval Latin gestura mode of action, from Latin gestus, past participle of gerere
Date: 15th century

1 archaic : carriage, bearing
2 : a movement usually of the body or limbs that expresses or emphasizes an idea, sentiment, or attitude
3 : the use of motions of the limbs or body as a means of expression
4 : something said or done by way of formality or courtesy, as a symbol or token, or for its effect on the attitudes of others <a political gesture to draw popular support — V. L. Parrington>

Main Entry: ma·nip·u·late
Pronunciation: mə-ˈni-pyə-ˌlāt
Function: transitive verb
Inflected Form(s): ma·nip·u·lat·ed; ma·nip·u·lat·ing
Etymology: back-formation from manipulation, from French, from manipuler to handle an apparatus in chemistry, ultimately from Latin manipulus
Date: 1834

1 : to treat or operate with or as if with the hands or by mechanical means especially in a skillful manner
2 a : to manage or utilize skillfully b : to control or play upon by artful, unfair, or insidious means especially to one’s own advantage
3 : to change by artful or unfair means so as to serve one’s purpose : doctor

You can already start to see the differences for our purposes. One is emotional, symbolic, indirect. The other is direct or mechanical. There are 4 primary differences between the two and they are easily classified after you know them.

Manipulations

contextual – they only happen at specific location(s) or on specific object(s)
react immediately – there is a direct correlation in cause and effect between your interaction and the system (this does not include visual affordance)
can be single state, but are usually 3 or more states ( see Bill Buxton’s paper on Chunking and Phrasing )
direct (could possibly be considered indirect by way of augmenting your actual interactions with the reaction of the system) – your actions directly affect the system, object, or experience in some way

Gestures

not contextual – they can be anywhere in the system in location and time
the system waits for the series of events to complete to decide on how to react (again, this does not include visual affordance)
they contain at least 2 states
indirect – they do not affect the system directly according to your action. Your action is symbolic in some way that issues a command, statement, or state.

In Dan Saffer’s book, Designing Gestural Interfaces, (O’Reilly, 2009) on page 2 he states “for the purposes of this book, is any physical movement that a digital system can sense and respond to without the aid of traditional pointing devices such as a mouse or stylus.” That may be a simple way to define the types of interaction for his book, but generalizing them in that manner is incorrect. I think Professor Shneiderman’s seminal paper in 1983 was absolutely correct. Direct manipulation is just that, direct manipulation. When we start to discuss more complex chained movements that are commands, we need a new set of terminology. (http://en.wikipedia.org/wiki/Direct_manipulation_interface)

Manipulations are the lowest common denominator and the “catch-all.” They are the most prevalent and the most widely patterned because they are easy to design for, easy to understand, and very intuitive with expected results. Gestures are more complex and is what all designers strive to achieve. When trying to decipher if something is a manipulation or a gesture, unless it passes all 4 tests for gesture, it is a manipulation. There are very few true gestures in systems currently.

These have also been called direct gestures (manipulations) and indirect gestures (gestures). Calling them this is confusing the terms and can lead to errors in design or implementation. I leave you with a graphical representation of gestures vs manipulations.

I’m eager to hear any dissenting opinions. Please comment or drop me an email. I’ll also send a copy of this to Dan as well.

9 comments on “Terminology: the difference between a gesture and a manipulation”

If we talk about “differences”, then we can consider the differences between requirements for the gesture and manipulation. For example, when designing manipulation, we need to know how movements will be carried out – with one hand or two hands, one finger, or 5 fingers and so on. And when we design the gesture is important only gesture…. it’s like when we print word on the keyboard – no matter how we did it (used blind press or a stupid way), only important word that we wrote. What do you think?

Ron on December 21, 2009 at 6:19 pm said:

I think in essence you are correct about many points. A manipulation is created from the bottom up, which means that you create the manipulation and then allow affordance for how many ways the user can interact with it. You want to give the user as much leeway as you can while staying within the design constraints.

The main issue with the differences are determining when and where a manipulation can turn into a gesture. Such as moving an object so far then activates a gesture performing a function.

Reply

I agree! Just pleasing! Your penning manner is pleasing and the way you managed the subject with grace is commendable. I am intrigued, I presume you are an expert on this topic. I am signing up for your updates from now on.

an interesting approach. One thing comes to mind. I think we have a system that can inform us about the way gestures in perticular work namely writing on tablet pc’s and mobiles. These computers have writing recognition and some use a gesture like input. I myself have tried several times to use these gesture input things and was frustrated having to learn to write again. This way of input has the problem that the accuracy needed makes it error prown. It might be interesting to do research on how many people use it.

Ron on December 30, 2009 at 2:01 am said:

One of the most interesting things I learned about actual alternative input users on the Windows platform was the use of the flicks.

Users would program their favorite actions or command into it and use it in games. To a designer with no experience, you would be designing for a corner case. After seeing the numbers, that is absolutely not the case. It was significant.

Reply

Interesting. With your work with MS Surface do you have any clear examples where gestures have been applied? Sometimes I feel they get a lot of attention, but are rarely widely used. Or am I missing something?

Ron on December 30, 2009 at 2:02 am said:

I’m not sure if any are in the system at this point. When I left, last summer, there were none implemented. That didn’t stop design though, and there are several in the works now.

There have also been some great leaps forward in the gesture learning and recognition category.

Reply

Pingback: Tweets that mention Terminology: the difference between a gesture and a manipulation- Experience Design by Ron George -- Topsy.com

Pingback: OCGM (pronounced Occam['s Razor]) is the replacement for WIMP- Experience Design by Ron George

Ron George Experience Design