pdfGetWordLength

Get the Unicode length of a word.
int pdfGetWordLength(PDFWordHandle word)
This function returns the length of a word (whose handle was obtained with pdfGetWord), i.e., the number of Unicode characters in the word.

The Unicode length is useful when calling pdfGetWordCharBox.

(The length returned by pdfGetWordLength will generally be different from the value obtained with pdfGetWordText, which converts the word to the requested output encoding.)

C:
PDFWordHandle word; char *text; int length; pdfBuildWordList(pdf, 1); for (i = 0; i < pdfGetNumWords(pdf); ++i) { word = pdfGetWord(pdf, i); for (j = 0; j < pdfGetWordLength(word); ++j) { pdfGetWordCharBox(word, i, &xMin, &yMin, &xMax, &yMax); ... } }
pdfBuildWordList
pdfGetNumWords
pdfGetWord
pdfGetWordText
pdfGetWordFontName
pdfGetWordColor
pdfGetWordBox
pdfGetWordBox2
pdfGetWordCharBox
pdfGetWordCharBox2
pdfGetWordSpaceAfter
pdfGetWordFontSize
pdfGetWordFontIsFixedWidth
pdfGetWordFontIsSerif
pdfGetWordFontIsSymbolic
pdfGetWordFontIsItalic
pdfGetWordFontIsBold
pdfGetWordRotation
pdfGetWordCharPos
pdfGetWordCharLen