Click or drag to resize

TextExtractorWord Class

TextExtractorWord object represents a word on a PDF page. Each word contains a sequence of characters in one or more styles (see TextExtractor.Style).
Inheritance Hierarchy
SystemObject
  pdftron.PDFTextExtractorWord

Namespace:  pdftron.PDF
Assembly:  pdftron (in pdftron.dll) Version: 255.255.255.255
Syntax
public sealed class TextExtractorWord : IClosable

The TextExtractorWord type exposes the following members.

Constructors
  NameDescription
Public methodTextExtractorWord
Initializes a new instance of the TextExtractorWord class
Top
Methods
  NameDescription
Public methodClose
Public methodEquals
Public methodGetBBox
Gets the b box.
Public methodGetCharStyle
return The quadrilateral representing a tight bounding box for this word (in unrotated page coordinates).
Public methodGetCurrentNum
Gets the index of this word of the current line. A word that starts the line will return 0, whereas the last word in the line will return (line.GetNumWords()-1).
Public methodGetGlyphQuads
Gets the quadpoint of the glyph at index
Public methodGetHashCode
Serves as a hash function for a particular type.
(Inherited from Object.)
Public methodGetNextWord
Gets the next object
Public methodGetNumGlyphs
Gets the num glyphs.
Public methodGetString
Gets Unicode string
Public methodGetStringLen
Gets the number of Chars in the string.
Public methodGetStyle
Gets predominant style for this word.
Public methodGetType
Gets the Type of the current instance.
(Inherited from Object.)
Public methodIsValid
Checks if valid word
Public methodSet
Sets value to given Word object
Public methodToString
Returns a string that represents the current object.
(Inherited from Object.)
Top
See Also