How Stepify Works
Stepify uses AI to control a real web browser, navigate applications, and document every step along the way. Here's what
happens behind the scenes.
The Generation Process
1. Goal Definition
Each tutorial starts with a clear goal — for example, "Add a new team member in NextCloud" or "Create a sales pipeline
in Odoo CRM". The goal tells Stepify what to accomplish.
2. Browser Automation
Stepify launches a real browser (Chromium) with pre-configured authentication for the target platform. It navigates the
application just like a human user would — clicking buttons, filling forms, and waiting for pages to load.
3. Screenshot Capture
At each meaningful action, Stepify captures a screenshot at 1280×800 resolution. These screenshots are processed to:
- Redact any personally identifiable information (PII) like email addresses
- Add visual annotations highlighting where the action occurred
- Generate alt text for accessibility
4. Video Recording
The entire browser session is recorded as a video. After recording, Stepify generates:
- A full-length tutorial video (WebM format)
- A short preview clip for the tutorials listing page
- WebVTT captions describing each step
5. Quality Review
Every generated tutorial goes through an automated quality review that checks:
- Whether steps follow a logical flow
- Whether screenshots match their descriptions
- Whether the goal was actually achieved
- Overall clarity and completeness
Tutorials scoring below the quality threshold are flagged for human review rather than published automatically.
6. Publishing
Approved tutorials are published with a unique shareable URL. Each tutorial page includes the video, transcript, and
step-by-step guide — all generated from a single browser session.
Privacy and Security
- PII Redaction: Email addresses and other personal data are automatically detected and blurred in screenshots before
saving
- Isolated Sessions: Each tutorial generation runs in its own browser context
- No External Data: Screenshots and videos are stored on our own infrastructure, not third-party services