Advertising

The Seattle Times Company

NWjobs | NWautos | NWhomes | NWsource | Free Classifieds | seattletimes.com

The Seattle Times

Business / Technology


Our network sites seattletimes.com | Advanced

Originally published August 28, 2008 at 12:00 AM | Page modified August 28, 2008 at 1:23 AM

Comments (0)     E-mail article     Print view

How Microsoft's spell-check gatekeepers select words to add

Microsoft's Natural Language Group is in an ongoing race to keep up with the evolution of the dozens of languages for which they produce...

Microsoft's Natural Language Group is in an ongoing race to keep up with the evolution of the dozens of languages for which they produce spell-checkers and other writing tools.

Here's how the group selects words to add:

The first step is finding possible candidates for inclusion in the spell-checker lexicon. When Mike Calcagno started at Microsoft in 1998, that was done ad hoc, with candidate words or changes sent to someone high enough on the corporate ladder to get attention.

"The number of issues that we would see at that time was so small that we could keep track of it on a single Excel spreadsheet," he said.

Now, the company uses software to monitor actual language usage across its vast properties.

"When you add a word to your custom dictionary, either in Word itself or in Hotmail, that word comes to us," Calcagno said. When a word is added hundreds of times, it becomes part of the candidate list. Words still come in on an ad hoc basis, too.

The lists are filtered with software to eliminate words the team has already considered.

Then the words are sorted by frequency and sent to outside editors who evaluate each one against a set of guidelines Microsoft has created, such as whether a new word has appeared in a major dictionary.

Rarely, editors can't decide whether a word should be added and it's sent back to the Natural Language Group for debate. The team of about 50 software engineers, computational linguists, machine learning experts and other specialists hail from around the world.

With occasional exceptions, the words to be added — often tens of thousands of new ones — are shipped out to users in the next release of Office, used by hundreds of millions of people around the world.

"Everybody's speller gets updated and few people notice," he said.

— Benjamin J. Romano

Copyright © 2008 The Seattle Times Company

More Business & Technology headlines...

E-mail article Print view      Share:    Digg     Newsvine

Comments
No comments have been posted to this article. Start the conversation.

advertising

The local, public face of Chase, Phyllis Campbell is trading on trust

10 investing missteps to avoid

Sunday Buzz: Boeing fighter to run on biofuel; Mastro bankruptcy trustee keeps job

On the Economy: Washington state has to play the add-value card, not low-cost-leader ace

How do innovators think?

Advertising

Video

Mourners gather at KeyArena for slain officer's memorial
Mourners gathered at KeyArena for the memorial service of Seattle police Officer Timothy Brenton on November 6, 2009.

Procession for slain SPD officer
Election Night: Approve R-71
Election Night: Reject R-71
Election Night: Joe Mallahan
Election Night: Mike McGinn
Election Night: Susan Hutchison
Election Night: Dow Constatine
Candlelight vigil for Officer Brenton
Flying Elephant on Aurora

Marketplace

nwautos

2009's most fuel-efficient sedansnew
Choosing a new sedan? Weigh the impact of your choice on your wallet and on the planet.
Post a comment

Open Houses

Find this weekend's open house listings.
Or search by location:

 
Most read
Most commented
Most e-mailed
 
 
Advertising