Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make Sorting Office smarter with spelling #90

Open
giacecco opened this issue Mar 5, 2015 · 2 comments
Open

Make Sorting Office smarter with spelling #90

giacecco opened this issue Mar 5, 2015 · 2 comments

Comments

@giacecco
Copy link
Contributor

giacecco commented Mar 5, 2015

Today Sorting Office will recognise "St. Ives" as a town, but not "St Ives", without the dot. I believe we can be smarter than that! :-)

I suggest we do the following:

  • work on defining of a series of spelling transformations on top of the ones we do already, e.g. we're case insensitive at the moment, but we may also ignore dots, hyphens and punctuation in general when searching for matches in the reference tables, and then
  • implement the above in Sorting Office

@MurrayData can help not falling into any traps in doing this. @pezholio what do you reckon?

See also OpenAddressesUK/forum#2 .

@pezholio
Copy link
Member

pezholio commented Mar 9, 2015

We handle this with streets already, this would probably need some Elasticsearch configuration to ignore punctuation

@giacecco
Copy link
Contributor Author

I believe we need to see the problem independently from the implementation. There are many other cases we need to manage like abbreviations in general, e.g. see OpenAddressesUK/forum#2 . If we find out that Elastic Search is not good enough, we'll have to think to something else.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants