java.lang.Object
com.azure.ai.formrecognizer.documentanalysis.models.DocumentPage

public final class DocumentPage extends Object
Content and layout elements extracted from a page from the input.
  • Constructor Details

    • DocumentPage

      public DocumentPage()
  • Method Details

    • getPageNumber

      public int getPageNumber()
      Get the 1-based page number in the input document.
      Returns:
      the pageNumber value.
    • getAngle

      public Float getAngle()
      Get the general orientation of the content in clockwise direction, measured in degrees between (-180, 180].
      Returns:
      the angle value.
    • getWidth

      public Float getWidth()
      Get the width of the image/PDF in pixels/inches, respectively.
      Returns:
      the width value.
    • getHeight

      public Float getHeight()
      Get the height of the image/PDF in pixels/inches, respectively.
      Returns:
      the height value.
    • getUnit

      public DocumentPageLengthUnit getUnit()
      Get the unit used by the width, height, and boundingBox properties. For images, the unit is "pixel". For PDF, the unit is "inch".
      Returns:
      the unit value.
    • getSpans

      public List<DocumentSpan> getSpans()
      Get the location of the page in the reading order concatenated content.
      Returns:
      the spans value.
    • getWords

      public List<DocumentWord> getWords()
      Get the extracted words from the page.
      Returns:
      the words value.
    • getSelectionMarks

      public List<DocumentSelectionMark> getSelectionMarks()
      Get the extracted selection marks from the page.
      Returns:
      the selectionMarks value.
    • getLines

      public List<DocumentLine> getLines()
      Get the extracted lines from the page, potentially containing both textual and visual elements.
      Returns:
      the lines value.
    • getBarcodes

      public List<DocumentBarcode> getBarcodes()
      Get the extracted barcodes from the page.
      Returns:
      the barcodes value.
    • getFormulas

      public List<DocumentFormula> getFormulas()
      Get the extracted formulas from the page.
      Returns:
      the formulas value.