Home Using Stepify How Stepify Works

How Stepify Works

Last updated on Feb 15, 2026

Stepify uses AI to control a real web browser, navigate applications, and document every step along the way. Here's what happens behind the scenes.

The Generation Process

1. Goal Definition

Each tutorial starts with a clear goal — for example, "Add a new team member in NextCloud" or "Create a sales pipeline in Odoo CRM". The goal tells Stepify what to accomplish.

2. Browser Automation

Stepify launches a real browser (Chromium) with pre-configured authentication for the target platform. It navigates the application just like a human user would — clicking buttons, filling forms, and waiting for pages to load.

3. Screenshot Capture

At each meaningful action, Stepify captures a screenshot at 1280×800 resolution. These screenshots are processed to:

  • Redact any personally identifiable information (PII) like email addresses

  • Add visual annotations highlighting where the action occurred

  • Generate alt text for accessibility

4. Video Recording

The entire browser session is recorded as a video. After recording, Stepify generates:

  • A full-length tutorial video (WebM format)

  • A short preview clip for the tutorials listing page

  • WebVTT captions describing each step

5. Quality Review

Every generated tutorial goes through an automated quality review that checks:

  • Whether steps follow a logical flow

  • Whether screenshots match their descriptions

  • Whether the goal was actually achieved

  • Overall clarity and completeness

Tutorials scoring below the quality threshold are flagged for human review rather than published automatically.

6. Publishing

Approved tutorials are published with a unique shareable URL. Each tutorial page includes the video, transcript, and step-by-step guide — all generated from a single browser session.

Privacy and Security

  • PII Redaction: Email addresses and other personal data are automatically detected and blurred in screenshots before saving

  • Isolated Sessions: Each tutorial generation runs in its own browser context

  • No External Data: Screenshots and videos are stored on our own infrastructure, not third-party services