Skip to content

Convert Webster's Unabridged 1913 dictionary in to a more usable format

License

Notifications You must be signed in to change notification settings

brandon733/WebsterParser

 
 

Repository files navigation

WebsterParser

A better dictionary for your mac

In the a blog post named [You’re probably using the wrong dictionary] (http://jsomers.net/blog/dictionary), James Somers proposes using Webster’s Unabridged Dictionary as it provides more evocative and accurate definitions than most modern dictionaries.

The text of the 1913 version has been digitized and can be found on Project Gutenberg. Unfortunately the text files are in a very arcane format. Being created before UTF-8 was commonly used, it specifies a lot of non-standard entities to encode the all the various accents and special symbols.

This project parses these original text files and creates a reasonably clean UTF-8 XML version which can be converted into a mac dictionary file with Apple’s Dictionary Kit.

Screenshot of the dictionary

How to build

Get the submodule(s). Run:

git submodule update --init

Or, after pull, to update the submodule(s). Run:

git submodule update --remote

Build it! With NodeJS installed, run:

npm install
node index.js

Building the dictionary might take a while (around three minutes on my machine)

Just want the .dictionary file?

Download it in the releases section of this project.

Using it on iOS?

You can use the dictionary file on your iDevice if it is jailbroken. SSH into your device and navigate to /private/var/mobile/Library/Assets/com_apple_MobileAsset_DictionaryServices_dictionary2. On your iDevice dowload any new stock dictionary (Select a word -> Define -> Manage -> Download) that you don't need. In your SSH browser find out which folder was just added. Navigate to folderwithcrypticnumber/AssetData. Replace the .dictionary folder with the webster.dictionary folder, but keeping the name. You should now be able to lookup words.

I don't know how to change the name of the dictionary in the list, pointers are welcome.

Notes

  • When running index.js, the following unknown entities pop up. Maybe someone will one day track these down:
    sb
    integral2l
    colret
    filig
    ffllig
    fllig
    astascending
    astdescending
    ai
    dot
    lbrace2
    rbrace2
    Prime
    8star
    upslur
    downslur
    tsdo
    ecrl
    colbreak
    

About

Convert Webster's Unabridged 1913 dictionary in to a more usable format

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • JavaScript 82.6%
  • CSS 11.3%
  • Makefile 6.1%