Not A Wall, A World: The Future Of User Experience

I loved Minority Report’s gestural interface, as a scifi representation of what we think a police state might use to watch us, given the ability to move through a nearly infinite amount of data — and time — searching for clues.

Apparently, that interface is not just the stuff of Hollywood, as it appears that John Underkoffler, the guy that mocked up that experience for the movie, has been off actually building the system he literally is hand waving into existence.

MG Siegler thinks this represents the future of computing. I disagree, but first, MG’s thoughts:

- MG Siegler, The iPad Is Step 1 In The Future Of Computing. This Is Step 2 (Or 3).

While we may not have been at this year’s TED conference, apparently, Oblong was. And apparently, it wowed the crowd. And it should have. If you’ve seen the movie Minority Report, you’ve seen the system they’re building.

No, really. The co-founder of Oblong, John Underkoffler, is the man who came up with the gesture-based interface used in the Steven Spielberg movie. And now he’s building it in real life.

The demo I saw a couple years ago was stunning, but it was still just a video. Apparently, at TED, the audience got to see it in action. NYT’s Bits blog detailed some of it in a post yesterday. For those not at TED, Oblong has also made a few demo videos in the past, which I’ll embed below. Again, this is Minority Report.

Oblong’s coming out party couldn’t come at a better time. Following the unveiling of Apple’s iPad, there has been a lot of talk about the future of computing at a fundamental level. That is to say, after decades of dominance by the keyboard and mouse, we’re finally talking about other, more natural, methods of input. The iPad is one step to a multi-touch gesture system (as is this 10/GUI awesome demo), but this Oblong system is the next step beyond that.


I don’t believe that huge displays based on petabytes of information — like Cruise was surfing — is likely to be the prototypical user experience for normal people in the near term. In some narrowly defined industries — military, cinematography — such displays may be temporarily of interest. But the future of user experience is a logical extension of what we have been seeing in consumer electronics: a continued movement to small, mobile, and personal.

Yesterday, I posted a Nokia video that I think is much more true to life. I reproduce it here, again, in the form of the complete video, and an image pulled out.

complete video


one screenshot from the video

The Nokia example is based on a few assumptions:

Augmented reality glasses will become the standard user display — Instead of huge displays, hung on walls with giant panels, people will wear augmented reality glasses. These will display on the inside of the glass images that provide access to various sorts of information.

Displays will become less complex than today’s file/folder/desktop jumble, and interaction will be based on simple eye movements and gestures — User interaction will rely on eye tracking and gestural interfaces to represent selection, expansion, playing video or audio, and the like. In this example, the woman looks at the name of an artist in a playlist long enough and the environment interprets that as a selection. At some points in the demo she flicks her hand to represent clicking or scrolling. Note she doesn’t wear gloves or special hand gear: the glasses have cameras that watch her hands. They don’t show her doing it but either a generalized sign language could be used for more complex communications — more than selecting an emoticon, like she does — or a virtual keyboard could be displayed, and ‘keystrokes’ recorded, again, by the glasses observing our hands.

I don’t think that the grand gestural, ‘orchestra conductor’ sort of scenario that we saw in Minority Report will be the norm, although in specialized contexts — like gaming, war fighting, and brain surgery — those sorts of advanced gestural languages might be developed.

Social interaction with others will be the primary modality of all future operating environments, and other activities will principally be constructed to help filter and aggregate social channels — This is not well-represented in either the Nokia video or Minority Report. In the Nokia example, the woman is mostly dabbling with relatively conventional streams and stores — weather and news, and riffling through a music library — while occasionally being pinged by an overly attentive boyfriend. Imagine a more rich scenario of a marketing executive racing through the streets of New York, communicating with four colleagues in an open semi-public sort of way, with integrated information streams of plans, designs, and marketing campaign mockups. And at the same time receiving local augmented reality information about the streets she is passing through, like GPS coordinates, a map showing her destination and where her four colleagues are, offers from the food truck she passes, and a global stream of socialized news and information from her network of friends, fans, and connections.

***

This can be condensed to the shorthand: not a wall, a world.

The steampunk idea that we will continue to have displays like today’s TV screens or PC monitors is dubious. I would give up mine in a heartbeat. More important, there is a world out there, and amplifying what we are already looking at — like the street we are walking on — with relevant information — like where the bus stop is, or what kind of food that restaurant serves — is so obviously helpful it doesn’t really need to be motivated.

We will continue to have personal and mobile computing experiences in the near future, because mostly we work and play on personal devices. Yes, there is the occasional meeting in a face to face setting where currently we use large displays, but this will be replaced by shared augmented reality: a presentation, for example, could be controlled by one person (or more) and viewed by a larger group. But this wouldn’t be projected on the wall, necessarily. Instead, it would be shared via each attendee’s glasses. We might be looking at a blank wall, or we might be walking through a virtual representation of a building being designed, or a product being assembled.

Amplifying the social through this sort of user experience would be phenomenal. Wandering around at a business meeting, a party, or a conference, and seeing salient information about the people you are looking at — where they work, when you last talked, the names of their loved ones, their pet peeves, whether they follow you and know of your work — would be an immense help. And would potentially change the nature of our social contract in startling ways. This is what I am expecting to appear, and very soon. Not 2045.

Enhanced by Zemanta

Doing That Crazy Hand Jive: Gesture And The Future Of User Experience

The technology industry is going retro — moving away from remote controls, mice and joysticks to something that arrives without batteries, wires or a user manual.


It’s called a hand.

In the coming months, the likes of Microsoft, Hitachi and major PC makers will begin selling devices that will allow people to flip channels on the TV or move documents on a computer monitor with simple hand gestures. The technology, one of the most significant changes to human-device interfaces since the mouse appeared next to computers in the early 1980s, was being shown in private sessions during the immense Consumer Electronics Show here last week. Past attempts at similar technology have proved clunky and disappointing. In contrast, the latest crop of gesture-powered devices arrives with a refreshing surprise: they actually work.

[…]

Just as Microsoft’s gaming system hits the market, so should TVs
from Hitachi in Japan that will let people turn on their screens, scan
through channels and change the volume on their sets with simple hand
motions. Laptops and other computers should also arrive later this year
with built-in cameras that can pick up similar gestures. Such
technology could make today’s touch-screen tools obsolete as people use
gestures to control, for instance, the playback or fast-forward of a
DVD.

To bring these gesture functions to life, device makers
needed to conquer what amounts to one of computer science’s grand
challenges. Electronics had to see the world around them in fine detail
through tiny digital cameras. Such a task meant giving a TV, for
example, a way to identify people sitting on a couch and to recognize a
certain hand wave as a command and not a scratching of the nose.

Little
things like the sun, room lights and people’s annoying habit of doing
the unexpected stood as just some of the obstacles companies had to
overcome.

GestureTek, with offices in Silicon Valley and
Ottawa, has spent a quarter-century trying to perfect its technology
and has enjoyed some success. It helps TV weather people, museums and
hotels create huge interactive displays.

This past work,
however, has relied on limited, standard cameras that perceive the
world in two dimensions. The major breakthrough with the latest gesture
technology comes through the use of cameras that see the world in three
dimensions, adding that crucial layer of depth perception that helps a
computer or TV recognize when someone tilts their hand forward or nods
their head.

via Ashee Vance, www.nytimes.com

This advance is one of several that will form the basis of an entirely new experience for computing.

Gestural UI, or ‘hand jive’ as I call it, once deployed as a built in aspect of future computers, like touchpads and mouses are today, will set the stage for a rethink about user experience.

First we will see hand jive as a way to manipulate the gears of now-tradition windowed UIs: pulling down a menu in an app, moving windows around, dragging a file to the trash.

In the future, we’ll have real Minority Report stuff, without the enormous touch screens: we’ll also see the emergence of augmented reality goggles — Terminator goggles — where we can toggle back and forth between 100% computer screen sorts of display to 100% augmented reality. And the goggles — as an integrated part of the computing device — will be watching our hands for commands, and watching the world for reality to augment.

The combination of these trends will make computing primarily mobile: we’ll have an iPhone sized device we carry all the time, which will be a phone and a PC. We will be free of LCD screens — in general — courtesy of our goggles, and free of keyboards, courtesy of hand jive. A keyboard can be imaged on any flat surface by the goggles, and we can type without a physical keyboard because the gestural system is watching our fingers in 3D. And of course, a lot of things could be done without typing, especially once kids start using sign language and voice to communicate with computers. (I say kids because that’s who start first.)

The other parts of this tectonic shift in UX will include the end of the document-centric folder/file/desktop metaphor, where information in managed in documents, based on old school filing cabinets. I believe that innovation like the Litl OS, which has shifted to a TV-influenced UI of channels, treating information as something that flows and not something static, sitting in a document.

And of course, the social web will be the foundation of future computing, as opposed to a document-centric world in which people are an afterthought.

How To Hand Jive

image from files.droplr.com

via Ehow