pdfGetWordCharBox
Get the bounding box of a character in a word.
void pdfGetWordCharBox(PDFWordHandle word, int charIdx,
double *xMin, double *yMin, double *xMax, double *yMax)
This function returns the bounding box of
the
charIdx
th character in a word (whose
handle was obtained with pdfGetWord
).
The xMin
, yMin
, xMax
,
and yMax
coordinates are in a coordinate space that
places (0,0) at the top-left corner of the page and has 72 units per
inch (same coordinate system as used by pdfExtractTextFromRect
).
This function works on Unicode characters (used internally in
XpdfText). The pdfGetWordLength
function returns the number of
Unicode characters.
This function is identical to pdfGetWordCharBox2
except that it
returns points in a top-down coordinate space.
C:
PDFWordHandle word;
char *text;
int length;
pdfBuildWordList(pdf, 1);
for (i = 0; i < pdfGetNumWords(pdf); ++i) {
word = pdfGetWord(pdf, i);
for (j = 0; j < pdfGetWordLength(word); ++j) {
pdfGetWordCharBox(word, i, &xMin, &yMin, &xMax, &yMax);
...
}
}
pdfGetWordCharBox2
pdfBuildWordList
pdfGetNumWords
pdfGetWord
pdfGetWordText
pdfGetWordLength
pdfGetWordFontName
pdfGetWordColor
pdfGetWordBox
pdfGetWordBox2
pdfGetWordCharBox2
pdfGetWordSpaceAfter
pdfGetWordFontSize
pdfGetWordFontIsFixedWidth
pdfGetWordFontIsSerif
pdfGetWordFontIsSymbolic
pdfGetWordFontIsItalic
pdfGetWordFontIsBold
pdfGetWordRotation
pdfGetWordCharPos
pdfGetWordCharLen