What is the format or encoding of a file with data like this?
5. EF BF BD is the REPLACEMENT CHARACTER encoded in UTF-8. It likely means that this data was in some format other than UTF-8 (say ISO-8859-1), but was parsed at some point by a UTF-8 system that replaced the illegal bytes with REPLACEMENT CHARACTER. Without more background on how you came to have this file, it's hard to …