Jump to content

User:Mboverload/RegExTypoFix

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Alphachimp (talk | contribs) at 11:53, 10 August 2006 (add release misspelling). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Developer(s)mboverload
Stable release
Loaded by AWB on-demand
TypeTypofix built-in to AutoWikiBrowser
LicenseGPL
Websitesourceforge.net/.../regextypofix

RegExTypoFix (Regular Expression Typographical error Fixer, or RETF) is a set of over 1600 regular expressions used to automatically fix common typos and misspellings. It is built into AutoWikiBrowser. Anyone who can use AutoWikiBrowser can use RegExTypoFix. It is also easily ported into any application that supports regular expression strings.

The lofty goal of RETF is to be completely automatic. That is, 100% accuracy. Some day it may be built into programs that want basic spellchecking without user input.

RETF is completely manually compiled. By spellchecking articles/talk pages with Microsoft Word, and seeing which typos are the most common then adding those. I can add about 35 new words an hour. Example of how it works:
find="\b(D|d)issapoin(t|ts|ted|ting|tment|tments)\b" replacewith="$1isappoin$2" />

The only interaction needed is to review the change and hit save. You can do other stuff while it loads the page in the background.

Updates

Each time you enable RegExTypoFix in AWB it loads the typos from Wikipedia:AutoWikiBrowser/Typos. No need for updates anymore!

Using RETF

  1. Start AutoWikiBrowser
  2. More Options tab > Enable RegExTypoFix
  3. More Options tab > Skip article when no typo fixed
  4. Start tab > Summary box dropdown list > Select [[WP:RETF|RegExTypoFix]]
  5. Find a misspelling you want to fix

Communicate

If you have fewer than 50 articles in your AWB queue you're not aiming high enough.
--mboverload@

Sign up for the spam list

A weekly newsletter describing that week's changes and other comments

Talk with other users on IRC

We share an IRC channel with AutoWikiBrowser at chat.freenode.net - #AutoWikiBrowser. If you don't have an IRC client I suggest mIRC.

Have problems/suggestions/a word you want to be included?

Misspellings to be removed/fixed

Misspellings to be added

Please see RegExTypoFix/rejectedwords and the full list of fixed misspellings before you suggest a word. Thanks!
  • Publically -> publicly --Guinnog
  • (T|t)our(a|n)ment -> Tournament - Peripitus (Talk) 12:24, 7 August 2006 (UTC)


Misspellings added because of user input

Signature is of the person who suggested it
  • streching / streched -> stretc(hing|hed) --Guinnog ("strech" itself is the name of a company)
  • Capitalize all days and months —Mets501 (talk)
  • Calvanism -> Calvinism Outriggr
  • gauge: guag(e|ed|es)\b -> $1aug$2 Outriggr 10:36, 9 August 2006 (UTC)
  • (S|s)aterday -> Saturday —Mets501 (talk)
  • didnt -> didn't --Thiseye
  • managment -> management --Thiseye
  • extensivly -> extensively --Thiseye
  • proccess -> process --Thiseye
  • flamb(uo|ou)yant -> flamboyant --Thiseye
  • consquenc(e|es) -> consequence --Thiseye
  • (E|e)nviromen(t|ts|tal|tally|talist|talists) -> Environment --Outriggr
  • all intensive purposes -> all intents and purposes --Thiseye
  • offica(l|ly) -> officia$1 --Thiseye
  • pronounciation -> pronunciation --Outriggr
  • repond and variants -> respond --Guinnog
  • flourescent -> fluorescent --Guinnog
  • milennium -> millennium --Guinnog
  • flourine -> fluorine --Guinnog
  • noteable -> notable --Mets501
  • relas(e|ed|ing|es) --> releas(e|ed|ing|es) Outriggr