BOM (byte order mark)
Table of Contents
About
The byte order mark (BOM) is a magic number (header) (Unicode character, U+FEFF BYTE ORDER MARK (BOM)]]
It is not a character, but a byte sequence at the beginning of the file.
It can be found at the start of a text file and indicates
* The byte order, or endianness, of the text file
* The fact that the text stream's encoding is Unicode, to a high level of confidence;
* Which Unicode encoding the text stream is encoded as.
===== Articles Related =====
===== Library ===== <note important>If you pass a character set to a file reader, you don't need to handle the BOM.</note> * https://github.com/gpakosz/UnicodeBOMInputStream ===== Documentation / Reference ===== * Byte_order_mark * BOM in Apache common CSV * CSVFormat.EXCEL.parse should handle byte order marks