Stepify uses AI to control a real web browser, navigate applications, and document every step along the way. Here's what happens behind the scenes.
The Generation Process
1. Goal Definition
Each tutorial starts with a clear goal — for example, "Add a new team member in NextCloud" or "Create a sales pipeline in Odoo CRM". The goal tells Stepify what to accomplish.
2. Browser Automation
Stepify launches a real browser (Chromium) with pre-configured authentication for the target platform. It navigates the application just like a human user would — clicking buttons, filling forms, and waiting for pages to load.
3. Screenshot Capture
At each meaningful action, Stepify captures a screenshot at 1280×800 resolution. These screenshots are processed to:
-
Redact any personally identifiable information (PII) like email addresses
-
Add visual annotations highlighting where the action occurred
-
Generate alt text for accessibility
4. Video Recording
The entire browser session is recorded as a video. After recording, Stepify generates:
-
A full-length tutorial video (WebM format)
-
A short preview clip for the tutorials listing page
-
WebVTT captions describing each step
5. Quality Review
Every generated tutorial goes through an automated quality review that checks:
-
Whether steps follow a logical flow
-
Whether screenshots match their descriptions
-
Whether the goal was actually achieved
-
Overall clarity and completeness
Tutorials scoring below the quality threshold are flagged for human review rather than published automatically.
6. Publishing
Approved tutorials are published with a unique shareable URL. Each tutorial page includes the video, transcript, and step-by-step guide — all generated from a single browser session.
Privacy and Security
-
PII Redaction: Email addresses and other personal data are automatically detected and blurred in screenshots before saving
-
Isolated Sessions: Each tutorial generation runs in its own browser context
-
No External Data: Screenshots and videos are stored on our own infrastructure, not third-party services