PDFAnnotations
Source: AnnotationsFormat: JSON

Export Full Annotation Data to JSON

Get the complete data schema of your document markups for custom software development.

Drop your PDF here

Securely extract highlights and comments in seconds.

Example Output
preview.json
[
{
"content": "Data privacy regulations have evolved...",
"page": 12,
"author": "Reviewer A"
}
]

The Full Data Picture

This is the deep dive. While other formats simplify content, JSON reveals everything.

Advanced Data Structures

We expose the underlying PDF annotation model:

  • Subtypes: Differentiate between Highlight, Underline, StrikeOut, Text (Sticky Note), and FreeText.
  • Relationships: See inReplyTo fields that link simple text replies to their parent sticky notes.
  • Status: Track review states (e.g., 'Accepted', 'Rejected') if the PDF software saved them.

This is the raw material for building Machine Learning models to analyze document sentiment or custom archival systems for enterprises.

Need more formats?

We support Notion, Obsidian, JSON, CSV, and Text exports.

View All Tools

Frequently Asked Questions

Is it a nested structure?

Yes. We preserve parent-child relationships for threaded comments.

Are coordinates provided?

Yes. Full `rect` data is provided for every mark.

What is the date format?

ISO-8601 (e.g., `2023-10-27T10:00:00Z`).