First character of the reading from the text file :  [duplicate]

You are getting the characters  on the first line because this sequence is the UTF-8 byte order mark (BOM). If a text file begins with a BOM, it’s likely it was generated by a Windows program like Notepad.

To solve your problem, we choose to read the file explicitly as UTF-8, instead of whatever default system character encoding (US-ASCII, etc.):

BufferedReader in = new BufferedReader(
    new InputStreamReader(
        new FileInputStream("myFile.txt"),
        "UTF-8"));

Then in UTF-8, the byte sequence  decodes to one character, which is U+FEFF. This character is optional – a legal UTF-8 file may or may not begin with it. So we will skip the first character only if it’s U+FEFF:

in.mark(1);
if (in.read() != 0xFEFF)
  in.reset();

And now you can continue with the rest of your code.

Leave a Comment