-
-
Notifications
You must be signed in to change notification settings - Fork 44
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat(street_name_normalization): improvements to dedupe algo, more co…
…nservative approach
- Loading branch information
1 parent
5be4d95
commit 8aabc40
Showing
15 changed files
with
328 additions
and
165 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,9 @@ | ||
state highway|s.highway|s highway|st.highway|st highway|sh|s.h.|s.h|s h|st.h|st h|s.hw|s hw|st.hw|st hw|s.hwy|s hwy|shwy|s.hgwy|s hgwy|st.hgwy|st hgwy|s.hway|s hway|st.hway|st hway|s.hwy|s hwy|st.hwy|st hwy|s.hi|s hi|st.hi|st hi|statehighway | ||
state road|sr|stateroad|s.r.|s.r|s r|s.road|s road|st.road|st road|staterd|srd|s.rd|s rd|state rd|strd|st.rd|st rd | ||
state highway|s.highway|s highway|st.highway|st highway|sh|s.h.|s.h|s h|st.h|st h|s.hw|s hw|st.hw|st hw|s.hwy|s hwy|shwy|s.hgwy|s hgwy|st.hgwy|st hgwy|s.hway|s hway|st.hway|st hway|s hwy|st.hwy|st hwy|s.hi|s hi|st.hi|st hi|statehighway | ||
state route|sr|stateroute|s.r.|s.r|s r|s.route|s route|st.route|st route|statert|srt|s.rt|s rt|srte|s.rte|s rte|state rt|state rte|strt|strte|st.rt|st rt|st.rte|st rte | ||
county highway|ch|c.h.|c.h|c h|c.hw|c hw|co.hw|co hw|cty.hw|cty hw|c.hgwy|c hgwy|co.hgwy|co hgwy|cty.hgwy|cty hgwy|c.hway|c hway|co.hway|co hway|cty.hway|cty hway|c.hwy|c hwy|co.hwy|co hwy|cty.hwy|cty hwy|c.hi|c hi|co.hi|co hi|cty.hi|cty hi | ||
county route|cr|c.r.|c.r|c r|co.r|co r|c.rt|c rt|co.rt|co rt|cty.r|cty r|cty.rt|cty rt|c.rte|c rte|co.rte|co rte|cty.rte|cty rte|county touring route | ||
rural route|rr|r.r|r r | ||
township highway|th|t.h.|t.h|t h|twp.h|twp h|tshp.h|tshp h|t.hw|t hw|twp.hw|twp hw|tshp.hw|tshp hw|t.hgwy|t hgwy|twp.hgwy|twp hgwy|tshp.hgwy|tshp hgwy|t.hway|t hway|twp.hway|twp hway|tshp.hway|tshp hway|t.hwy|t hwy|twp.hwy|twp hwy|tshp.hwy|tshp hwy|t.hi|t hi|twp.hi|twp hi|tshp.hi|tshp hi | ||
township route|tr|t.r.|t.r|t r|t rt|t.rt|trt|t.rte|t rte|twpr|twp.r|twp r|twp.rt|twp rt|twp.rte|twp rte|tshp.r|tshp r|tshp.rt|tshp rt|tshp.rte|tshp rte | ||
us highway|us hwy|u s hwy | ||
us route|us rte|u s rte |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,80 +1,2 @@ | ||
ambassador | ||
arch bishop|archbishop | ||
bishop | ||
brigadier general|brig gen | ||
cardinal | ||
colonel|col | ||
commander|cmdr | ||
congressman|congress man | ||
congresswoman|congress woman | ||
corporal|cpl | ||
captain|capt|cpt | ||
chairman|chair man | ||
chairwoman|chair woman | ||
czar|tsar | ||
dame | ||
deputy prime minister|deputy pm | ||
district judge | ||
doctor|dr|doc | ||
doctors|drs|docs | ||
duke | ||
dutchess | ||
emperor | ||
brother|br | ||
father|fr | ||
sister|sr | ||
his royal highness|hrh|h r h | ||
her royal highness|hrh|h r h | ||
general|gen | ||
his honor|his honour | ||
her honor|her honour | ||
honorable|honourable|hon | ||
judge | ||
lady | ||
lieutenant|lieut|lgt|lt | ||
lieutenant governor|lieut governor|lgt governor|lieut gov|lgt gov|lt governor|lt gov | ||
lieutenant colonel|lieut colonel|lgt colonel|lieut col|lgt col|lt colonel|lt col | ||
lieutenant commander|lieut commander|lgt commander|lieut cmdr|lgt cmdr|lt commander|lt cmdr | ||
lieutenant general|lieut general|lgt general|lieut gen|lgt gen|lt general|lt gen | ||
lord | ||
king|kg | ||
madame | ||
madames | ||
major|maj | ||
major general|maj gen|maj general|major gen | ||
messrs | ||
mp|member of parliament | ||
mps|members of parliament | ||
mr|mister | ||
mrs|misses | ||
ms|miss | ||
officer|ofcr | ||
pope | ||
president|pres | ||
prime minister|pm | ||
prince | ||
princess | ||
private first class|pfc|p f c | ||
professor|prof | ||
professors|profs | ||
queen | ||
reverend|rev | ||
right honorable|right honorable|right honourable|right hon|rt honorable|rt honorable|rt honourable|rt hon|rh|r h|r hon|right and honorable|right and honourable | ||
saint|st | ||
saints|ss | ||
sainte|ste | ||
san|s | ||
santa|sta | ||
sargeant|sgt | ||
secretary|sec | ||
sir | ||
sirs | ||
representative|rep | ||
representatives|reps | ||
senator|sen | ||
senators|sens | ||
vice chairman|vice chair man | ||
vice chairperson|vice chair|vice chair person | ||
vice chairwoman|vice chair woman | ||
vice president|vice pres | ||
vice prime minister|vice pm |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
beach|bch | ||
fort|ft | ||
mount|mt | ||
court|ct|crt | ||
square|sqr|sqre|squ|sq |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,20 +1 @@ | ||
creek|cr | ||
crossway|crwy | ||
fairway|fy | ||
flat|fl | ||
lane|ln | ||
mill|ml | ||
park|pk | ||
place|pl | ||
port|prt | ||
quadrant|qd | ||
route|rt | ||
terrasse|tr | ||
turn|tn | ||
boulevarde|bl | ||
bridge|br | ||
brook|brk | ||
center|cntr | ||
crescent|cr | ||
passage|pass | ||
pike|pk | ||
concourse|conc |
Oops, something went wrong.