Normalize character set #279

sheo0147 · 2016-12-14T02:33:33Z

General Issue.

Vuls imports some text data from Database such as NVD/JVN... and so on.
But Vuls code is not normalize charset before parse or store to DB and so on.

Normalize character set from any to Unicode when importing some text data.
Normalize character set to Unicode ( or selectable, but default is unicode) when outputs like Mail, Slack and so on.

hogehuga · 2016-12-14T03:37:38Z

I think the default character code is UTF-8.

As for input, I seems deal with below.

As for output, When the data of go-cve-dictionary use UTF-8 or convert to UTF-8

Mail header require charset, because Sometimes the mail software can not identify the character code.
Other report option seems to UTF-8.

I think, if the input charactor code is unified in UTF-8, output character code will be the default UTF-8.

hogehuga · 2016-12-14T05:47:10Z

I send pull request #280. It fix Content-Type header issue.
The rest, output UTF-8 charset from go-cve-dictionary when Other multi byte country data.

kotakanbe closed this as completed Jan 13, 2017

kotakanbe added the bug label Feb 14, 2017

Provide feedback