Skip to content

Explanation

These pages exist to give you a mental model. They are not tutorials and they are not recipes — there are no commands to run, and nothing here is load-bearing for getting a working OCR pipeline up. Read them when you want to understand why the SDK is shaped the way it is.

Topics

  • Layout and reading order — what a "block" is, how reading_order differs from layout, and why the Markdown renderer leans on blocks rather than raw OCR items.
  • Searchable PDF internals — the invisible-text-overlay technique, why it preserves the original page, and where the font requirement comes from.
  • HTTP vs gRPC — when each transport is the right pick, and the proto3 bool-presence caveat that subtly affects gRPC defaults.

Where to go next