Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add NFC and NFD normalization options (keeping NFKC as the default) #313

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

nickjwhite
Copy link

@nickjwhite nickjwhite commented Aug 31, 2018

I'm proposing this as NFC/NFD normalization is definitely useful
for some models, and this allows users to load models which use
one of these normalizations. #257 (the proposal to use NFC by
default) didn't get enough traction to merge, but this at least allows
those of us who benefit from alternative normalization to distribute
our models to users, without having to ask them to apply a patch.

While NFKC is kept as the default, this gives the option to use NFC
and NFD normalization options. These can't be used directly, but
allow a model that has been trained with an alternative
normalization to be loaded and used. Without this patch, such a
model will throw an error when unpickling such a model.

Such a model can be built using different normalization= parameters,
as for example in #257.

While NFKC is kept as the default, this gives the option to use NFC
and NFD normalization options. These can't be used directly, but
allow a model that has been trained with an alternative
normalization to be loaded and used. Without this patch, such a
model will throw an error when unpickling such a model.

Such a model can be built using different normalization= parameters,
as for example in PR#257.
@kba
Copy link
Collaborator

kba commented Aug 31, 2018

Since this doesn't change the behavior, I see no reason not to merge this.

Sorry about #257 being stalled, maybe @tmbdev and/or @zuphilip can chime in to make sure we don't break anything.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants