Capture and annotate web page snapshots for labeling and review. Create high-quality datasets with complete, self-contained page captures.
Features
High-Fidelity Snapshots
Web pages change. Screenshots and PDFs lose structure. refine.page captures complete pages as self-contained files: preserving styles, images, and layout exactly as they appeared.
Frozen Snapshots
All interactive elements (links, forms, scripts) are disabled. The snapshot stays exactly as captured, nothing drifts.
Text Annotations
Highlight and label text content with different annotation types:
- Relevant – Mark content relevant to a query
- Answer – Mark content containing the answer
- No Content – Mark when no relevant content exists
Q&A Labeling
Create question-answer pairs for each snapshot. Add your query, record the expected answer, and link it to highlighted content—text, elements, or image regions.
Evaluation Metrics
Rate each Q&A pair with answer correctness, whether the answer exists in the page, and page content quality assessments.
Review Workflow
Approve or decline snapshots with optional review notes. Streamline your labeling process with built-in review tools.
Export/Import
Backup and restore your labeled data as JSON files. Your data is always portable and under your control.
Local Storage
All data stored locally using Chrome storage API. Your annotations stay private and are fully exportable.
Use Cases
- Building question-answering datasets from real web content
- Training data for retrieval and RAG systems
- Structured content review and QA workflows
- Archiving web pages with traceable annotations
Get Started
refine.page is currently available for Chrome. Firefox support is coming soon.
Add to Chrome