com.groupdocs.parser.options

Class PageTextAreaOptions

    • Constructor Detail

      • PageTextAreaOptions

        public PageTextAreaOptions(boolean useOcr)
        Initializes a new instance of the TextOptions class with the OCR usage option.
        Parameters:
        useOcr - The value that indicates whether the OCR functionality is used to extract a text.
      • PageTextAreaOptions

        public PageTextAreaOptions(boolean useOcr,
                                   OcrOptions ocrOptions)
        Initializes a new instance of the TextOptions class with the ability to set OCR options.
        Parameters:
        useOcr - The value that indicates whether the OCR functionality is used to extract a text.
        ocrOptions - The additional options for OCR functionality.
      • PageTextAreaOptions

        public PageTextAreaOptions(String expression)
        Initializes a new instance of the PageTextAreaOptions class with the regular expression. Other options are set by default (see remarks for details).

        The following properties have default values:

        • MatchCase: false
        • UniteSegments: false
        • IgnoreFormatting: false
        • Rectangle: null
        Parameters:
        expression - The regular expression.
      • PageTextAreaOptions

        public PageTextAreaOptions(String expression,
                                   Rectangle rectangle)
        Initializes a new instance of the PageTextAreaOptions class with the regular expression and rectangular area. Other options are set by default (see remarks for details).

        The following properties have default values:

        • MatchCase: false
        • UniteSegments: false
        • IgnoreFormatting: false
        Parameters:
        expression - The regular expression.
        rectangle - The rectangular area that contains page areas.
      • PageTextAreaOptions

        public PageTextAreaOptions(String expression,
                                   boolean matchCase,
                                   boolean uniteSegments,
                                   boolean ignoreFormatting,
                                   Rectangle rectangle)
        Initializes a new instance of the PageTextAreaOptions class.
        Parameters:
        expression - The regular expression.
        matchCase - The value that indicates whether a text case isn't ignored.
        uniteSegments - The value that indicates whether segments are united.
        ignoreFormatting - The value that indicates whether text formatting is ignored.
        rectangle - The rectangular area that contains page areas.
    • Method Detail

      • getExpression

        public String getExpression()
        Gets the regular expression.
        Returns:
        A string that represents the regular expression.
      • isMatchCase

        public boolean isMatchCase()
        Gets the value that indicates whether a text case isn't ignored.
        Returns:
        true if a text case isn't ignored; otherwise, false.
      • isUniteSegments

        public boolean isUniteSegments()
        Gets the value that indicates whether segments are united.
        Returns:
        {code true} if segments are united; otherwise, {code false}.
      • isIgnoreFormatting

        public boolean isIgnoreFormatting()
        Gets the value that indicates whether text formatting is ignored.
        Returns:
        true if text formatting is ignored; otherwise, false.
      • isUseOcr

        public boolean isUseOcr()
        Gets the value that indicates whether the OCR Connector is used to extract a text.
        Returns:
        true if the OCR functionality is used; otherwise, false.
      • getOcrOptions

        public OcrOptions getOcrOptions()
        Gets the additional options for OCR functionality.
        Returns:
        An instance of OcrOptions class with the additional OCR options.