refine.page

Fast annotation. Stable snapshots. Clean data.

Capture and annotate web page snapshots for labeling and review. Create high-quality datasets with complete, self-contained page captures.

refine.page extension showing annotation interface

Features

High-Fidelity Snapshots

Web pages change. Screenshots and PDFs lose structure. refine.page captures complete pages as self-contained files: preserving styles, images, and layout exactly as they appeared.

Frozen Snapshots

All interactive elements (links, forms, scripts) are disabled. The snapshot stays exactly as captured, nothing drifts.

Text Annotations

Highlight and label text content with different annotation types:

  • Relevant – Mark content relevant to a query
  • Answer – Mark content containing the answer
  • No Content – Mark when no relevant content exists

Q&A Labeling

Create question-answer pairs for each snapshot. Add your query, record the expected answer, and link it to highlighted content—text, elements, or image regions.

Evaluation Metrics

Rate each Q&A pair with answer correctness, whether the answer exists in the page, and page content quality assessments.

Review Workflow

Approve or decline snapshots with optional review notes. Streamline your labeling process with built-in review tools.

Export/Import

Backup and restore your labeled data as JSON files. Your data is always portable and under your control.

Local Storage

All data stored locally using Chrome storage API. Your annotations stay private and are fully exportable.

Use Cases

  • Building question-answering datasets from real web content
  • Training data for retrieval and RAG systems
  • Structured content review and QA workflows
  • Archiving web pages with traceable annotations

Get Started

refine.page is currently available for Chrome. Firefox support is coming soon.

Add to Chrome