HOME  |  NEWS  |  BLOGS  |  MESSAGES  |  FEATURES  |  VIDEOS  |  WEBINARS  |  INDUSTRIES  |  FOCUS ON FUNDAMENTALS
  |  REGISTER  |  LOGIN  |  HELP
Blogs
Guest Blogs

UX Design: The Age of Voice Is Here

Page 1 / 2 Next >
View Comments: Newest First|Oldest First|Threaded View
Page 1/2  >  >>
William K.
User Rank
Platinum
Re: The very worst part of speech input controls
William K.   4/27/2015 4:49:40 PM
NO RATINGS
JW is certainly right. I had not even considered the effect on those with any form of speech impairment. 

So I would suggest that they give the technology another ten or fifteen years to mature before attempting to make it the "only way" to control anything.

JimWagner
User Rank
Iron
Re: Not so fast
JimWagner   4/27/2015 2:14:01 PM
NO RATINGS
Privacy is yet another issue. Want to do it on a bus or subway? Restaurant? Walking down a sidewalk? Who is listening in?

technos
User Rank
Iron
Not so fast
technos   4/27/2015 1:12:32 PM
NO RATINGS
For the same reasons I do not speak the words outloud when I read a book, I don't like to speak when interacting with a smartphone. It accomplishes something that I really do not prefer and would rather not have outright. It might have some application where hands free operation is needed for something, but I do not see it as becoming the prevalent mode of control.

I'd probably look for and purchase the hardware that is not reliant on voice control.

JimWagner
User Rank
Iron
Re: The very worst part of speech input controls
JimWagner   4/27/2015 12:43:44 PM
NO RATINGS
One of the things that scares me, as a stutterer, is how will voice recognition handle my speach. The times I have tried to use it have been miserable failures. How can it possibly meet ADA criteria? My speech patterns are rarely the same, for a given phrase, on any two times. How can speech recognition "recognize" such speech? 

 

William K.
User Rank
Platinum
The very worst part of speech input controls
William K.   4/27/2015 12:02:08 PM
NO RATINGS
The really bad part of the whole concept is that there would not be adequate presentation of what the commands should be, as what would be acepted. It seems that in moany cases the entire functional structure is inflexible and designed by a programmer whose entire thought process is totally different from mine. IT can only be far worse when a system is looking for a specific command format and  either responds with the wrong choice or does not respond at all. AND it is well known that people's voices sound different when they are under stress, and attempting to work with an uncooperative computer system will certainly increase the stress level even more. 

Consider this scenario: a runaway engine and the car acelerating- "Engine off", the computer says "NO, you are driving fast so youmust have power steering", as you appraoch a traffic jam on the interstate at 90MPH. WE already know that the controls computer will not shut down an engine if you are rolling fast, because it is programmed to insist that having full power steering assist is more important. Does anybody else see a problem here?

William K.
User Rank
Platinum
Re: Hack it
William K.   4/27/2015 11:45:14 AM
NO RATINGS
Steve H. is so very correct, right on point!

Just because it can be done is no reason for something to be done. I have to listen to lots of different people and I need to understand them correctly, and the spread of speech sounds is HUGE. In addition there are lots of people who would NEVER speak up so that the system could understand them. Plus, there is a portion of humanity that is just plain too stupid to ever say what the system is wanting to hear.

So while it may be a fun project for a bunch of creative engineers with lots of time on their hands it is not a good idea as an item for widespred application.

BrianD
User Rank
Iron
HW is not the main challenge
BrianD   4/27/2015 10:13:33 AM
NO RATINGS
As someone involved in speech recognition and voice user interfaces for automotive applications for over ten years, I disagree with the premise of the author that the quality of microphone is the primary challenge.  Yes, the system needs a certain SNR for the speech reco engine to work effectively, but IBM and Nuance (now only Nuance) have made tremendous progress with noise-tolerant, far-field, speech (mic) input.

I see VUI challenges differently:

1. What is the extent of voice control?  E.g., does it make sense for the user to utter, "Turn on the front defroster" when the user can press the a single button to accomplish the same thing.

2. How structured or freeform does the voice UI have to be to be useful?  Must the system understand "natural language" and user intent, e.g., "I can't see out the window" versus "Defroster Front On High".

3. What can be controlled by voice in different states of the system?  If the user is searching for a phone contact to call, must the system understand an "out of context" command, e.g., the user utters, "Play Eric Clapton"?

4. How do VUI and GUI interact?  Are the UIs independent and asynchronous or interdependent and synchronous?  I prefer the latter, but it drives more implementation complexity.

The above challenges drive confusability (which reduces recognition success rate), processing power (more complex VUI --> more processing required; delays affect usability), and memory.

I strongly support VUI for automotive use, but few systems have "good" UIs, IMO.

mpersic
User Rank
Iron
VUI'd Barbie... a good thing?
mpersic   4/27/2015 10:05:40 AM
NO RATINGS
I do not see anything positive about a child interacting (via VUI) with their Barbie or Ken or GI Joe doll.  When a child interacts with other humans (adults or children) social behavior and language skills are developed - I doubt that same level of social development is achieved with VUI response.

laschauernd
User Rank
Iron
voice input vs touchscreen
laschauernd   4/27/2015 10:01:15 AM
NO RATINGS
voice input is getting pretty good, but there will always be a need for editing of text input. A very interesting video on youtube, search for PndEdZQl1Ec or the words Ape Sheet.

combines handwriting recognition with voice input...with a highly usable spreadsheet...on a tiny smartwatch touchscreen, the Samsung Gear 2!

RCSwan
User Rank
Iron
For the Deaf Consumer?
RCSwan   4/27/2015 9:56:08 AM
NO RATINGS
I don't agree that this technology will ever replace the entire user interface with voice, because there are Deaf consumers out there. It's a good thought for you guys but in public?

Page 1/2  >  >>
Partner Zone
More Blogs from Guest Blogs
Soft starter technology has become a way to mitigate startup stressors by moderating a motor’s voltage supply during the machine start-up phase, slowly ramping it up and effectively adjusting the machine’s load behavior to protect mechanical components.
Despite the astronomical benefits offered by 3D modeling, it is quite surprising that nearly 75% of the manufacturing industries still perform design operations using 2D CAD systems. What is the reason that keeps companies hesitant from adopting 3D technology?
Energy harvesting in particular seems to be moving at an accelerating pace. We now seem to be at a point where it is possible to run low-power systems primarily from energy harvesting sources. This is a big shift from even just a couple of years ago. Three key trends seem to have accelerated this dramatic shift.
ABI Research, a firm based in the UK that specializes in analyzing global connectivity and other emerging technologies, estimates there will be 40.9 billion active wirelessly interconnected “things” by 2020. The driving force is the usual suspect: the Internet of Things.
Design News Webinar Series
3/31/2015 11:00 a.m. California / 2:00 p.m. New York
2/25/2015 11:00 a.m. California / 2:00 p.m. New York
12/11/2014 8:00 a.m. California / 11:00 a.m. New York
5/7/2015 11:00 a.m. California / 2:00 p.m. New York
Quick Poll
The Continuing Education Center offers engineers an entirely new way to get the education they need to formulate next-generation solutions.
May 4 - 8, Designing Low Power Systems using Battery and Energy Harvesting Energy Sources
SEMESTERS: 1  |  2  |  3  |  4  |  5  |  6 |  7


Focus on Fundamentals consists of 45-minute on-line classes that cover a host of technologies. You learn without leaving the comfort of your desk. All classes are taught by subject-matter experts and all are archived. So if you can't attend live, attend at your convenience.
Last Archived Class
Sponsored by Proto Labs
Learn More   |   Login   |   Archived Classes
Twitter Feed
Design News Twitter Feed
Like Us on Facebook

Sponsored Content

Technology Marketplace

Copyright © 2015 UBM Canon, A UBM company, All rights reserved. Privacy Policy | Terms of Service