hotpdf.memory_map.MemoryMap

class hotpdf.memory_map.MemoryMap
__init__() None

Initialize the MemoryMap. 2D Matrix representation of a PDF Page.

Parameters:
  • width (int) – The width of a page.

  • height (int)

Methods

__init__()

Initialize the MemoryMap.

build_memory_map()

Build the memory map based on width and height.

extract_text_from_bbox(x0, x1, y0, y1)

Extract text within a specified bounding box.

find_text(query[, case_sensitive])

Find text within the memory map.

load_memory_map(page[, ...])

Load memory map data from an XML page.