pdfSetTextEncoding
Set the encoding to use for text extraction.
void pdfSetTextEncoding(char *encodingName)
This function sets the encoding to use for text extraction.
The default encoding is "Latin1"
(ISO-8859-1). The other
built-in text encodings are "ASCII7"
(7-bit ASCII),
"UTF-8"
(Unicode in UTF-8 format), and
"UCS-2"
(2-byte Unicode). Additional encodings can be
defined via the config file.
The text encoding is a global setting: it affects text extraction from all XpdfViewer objects.
C:
pdfSetTextEncoding(viewer, "UTF-8");