By age three, humans are already experts at speech recognition. Computers, on the other hand, still have only remedial skills after a roughly 30-year history.

That may begin to change, thanks to advances in speech recognition software from the biggest players in the market and the thriving competition among them for new voice command markets in mobile devices and automobiles.

One step forward came Tuesday, when Nuance Communications released a new version of its widely used PC-based speech-recognition technology, Dragon NaturallySpeaking 9. The software, in development for about two years, improves the accuracy of speech recognition by 20 percent over its version 8, which debuted in November 2004.

That means that it hits dictation accuracy levels of 99 percent, according to the company, so people with disabilities or repetitive strain injury can voice their PC commands almost entirely instead of using a mouse.

Nuance engineers also built in a shortcut to the once-lengthy script training that had turned many consumers off before. Now people can begin using the speech software without training the program to understand their voice. Instead, the software will learn as it goes.

"People who tried it three, four, five years ago will notice massive improvements," said Matt Revis, Nuance's director of product management for dictation solutions. "Within several uses, the software catches up. It learns as you correct it."

Nuance's update comes as Microsoft tests its own speech recognition technology, which the software giant plans to offer at no charge within its new operating system, Vista. (NaturallySpeaking 9 costs about US$99 for a Standard edition and US$199 for the "Preferred" edition, which includes support for Microsoft Excel and syncing with digital handheld recorders.)

Like Nuance, Microsoft has worked on the accuracy of the program, so it recognises the word "beach" from "peach" by the context of the sentence it's in. But it is also working on improvements to the user interface so it's easier for average people to command the software to fix errors or to switch applications.

"The technology is really becoming more mature. The accuracy continues to improve at an exponential rate," Microsoft software architect Rob Chambers said.

Speech recognition is a difficult computer problem. For one, external noise can confuse the program's reception of the speaker's voice and cause it to misinterpret language. Other recognition hurdles can be the high pitch of one person's voice or the mumbling tendencies of another's. As a result, the software must learn the nuances of an individual's speech patterns to deliver the highest accuracy.

The next leap for speech recognition is in the mobility market. Handheld devices like Blackberrys could allow people to dictate an e-mail instead of wearing out their thumbs on a tiny keyboard. Speech tech in automobiles could help drivers to better control the climate or navigate routes while leaving their hands on the wheel. Nuance's Revis said the company is talking to major wireless carriers and device makers about partnerships, under a mobility initiative.

Microsoft is eyeing the market, too. Microsoft's Chambers said he believes that speech recognition will one day surpass the natural skills of humans. "At one point in the future, we believe that the speech recogniser will be more accurate than a human is. We already do that in numerical digits."

Like this article? Click below to send it to your mobile for free!

Be the first to comment on this article!

  • Leave a comment

All fields marked with * are required

What do you think

Your e-mail will not be displayed

You must read and type the 6 chars within 0..9 and A..F

You must read and type the 6 chars.


  • Oi!: Apple discounts for one day only

  • Apple iCal: An insider's guide

  • Microsoft Office heads to the browser

  • Microsoft confirms SP2 for Vista, Office 2007

  • Intuit reveals QuickBooks 2009

  • 101 software tips, tweaks and tricks

  • How to share files between Office and iWork

  • Dragon NaturallySpeaking 10 Preferred

  • Free Speed: Make your Mac faster

More articles »

Find the right software

Brand
  • Multiple options can be selected

    • Dragon NaturallySpeaking 10 Preferred

      Dragon NaturallySpeaking 10 Preferred

      Dragon NaturallySpeaking 10 isn't perfect, but it's the best dictation software available. We don't find this upgrade necessary for the most basic dictation, although new features may benefit heavily-accented English speakers and those who rely heavily on voice commands.

    • Adobe Acrobat 9 Pro Extended

      Adobe Acrobat 9 Pro Extended

      Adobe Acrobat 9 document-creation software is adding dynamic features such as integration of animation, dynamic maps, 256-bit encryption, and improved forms. We've been playing with the beta edition of Acrobat 9 Pro Extended.

    • Quickbooks QBi 2008/2009

      Quickbooks QBi 2008/2009

      Existing users of Reckon's line of accounting packages have the best reason in years to upgrade with the QBi series. New users should find the attractive pricing of the entry-level versions pretty compelling too.

    • Mozilla Thunderbird 2

      Mozilla Thunderbird 2

      Thunderbird 2 provides a compelling option for users looking for an open source e-mail client.

    • Microsoft Office 2008 for Mac (Special Media Edition)

      Microsoft Office 2008 for Mac (Special Media Edition)

      Office 2008 for Mac may be the best pick for business users, but most people can get by with less expensive alternatives.

    More reviews »

    Membership benefits

    Contact community members

    Contact community members

    Add friends or tech gurus to you contacts and send them messages. Sign up for a free CNET Australia membership now!