: <tab> : <tab> PU <tab> O
<unprintable char> <tab> <unprintable char> <tab> OD <tab> O
L <tab> L <tab> M <tab> O
GeniaNPParser fails on the line with the unprintable characters, as the split() method only finds 2 tokens, not 4. I have made changes to my local codebase to simply skip any line that contains less than 4 tokens, and log a message to the console consisting of the actual line, line number and file name. With this info, you can find the offending lines and determine if it is worth fixing.
I'm happy to contribute my changes, or you could contribute your own changes that give the same result.