AUTONOMOUS DESKTOP INTELLIGENCE

CORTEX

An AI agent with full control over your Windows desktop. It sees your screen, operates any app, browses the web, runs terminal commands, hears your voice — and learns new skills from JSON definitions.

0+ AI Tools
0 Interfaces
0 Step Types
<2s Avg Action

Six Ways to Control

Every interface your computer exposes — unified under one AI brain.

Desktop UIA

Full Windows UI Automation. Click buttons by name, type into fields, read element trees — no coordinates needed.

ClickByName · TypeByName · FindElements · InvokeElement · SkillScaffold

Browser CDP

Chrome DevTools Protocol. Navigate, click elements, fill forms, intercept network, execute JavaScript.

NavigateToUrl · ClickElement · GetPageContent · ExecuteJavaScript

Terminal

Native Windows Pseudo Console. Run commands, manage processes in isolated terminals, each with unique tracking IDs.

SendCommand · StartProcess · StopProcess · CLIScaffold

Camera Vision

OpenCV-powered webcam capture. The AI can see the physical world — read documents, identify objects, observe the environment.

CaptureCamera · ListCameras

Voice Control

Whisper-powered speech recognition with wake word detection. Say "Hey Cortex" and speak your command naturally.

"Hey Cortex" · Whisper STT · Natural Language

Skill Engine

JSON-driven deterministic skills. The AI inspects any app, generates a skill definition, saves it, and executes it — all in one session.

SkillScaffold · CLIScaffold · WriteFile · RunNewSkill

Watch It Think

From inspecting an app to creating and running a skill — in seconds.

cortex — session
YOU _
Screen Recording

Teach It Once

Define app automations as JSON. The agent executes deterministic step sequences — no fumbling, no screenshots.

Spotify.json
{
  "App": "Spotify",
  "WindowTitleMatch": "Spotify",
  "Actions": [{
    "Name": "SpotifySearch",
    "Parameters": [{
      "Name": "Query",
      "Required": true
    }],
    "Steps": [
      { "Type": "FocusWindow" },
      { "Type": "PressKeyCombination",
        "Keys": ["Control", "L"] },
      { "Type": "TypeText",
        "Text": "{Query}" },
      { "Type": "PressKey",
        "Key": "Return" }
    ]
  }]
}
01

Inspect

Use SkillScaffold or CLIScaffold to discover an app's UI elements, keyboard shortcuts, and supported patterns.

02

Define

Write a JSON skill with deterministic steps: FocusWindow, PressKey, TypeText, InvokeElement, and more.

03

Save

The agent writes the JSON file via WriteFile — native C#, no terminal. Deduplication prevents loops.

04

Execute

Call RunNewSkill to execute immediately in the same session. No restart. No rebuild. Instant.

Under the Hood

Runtime
.NET 10 / C# 13
Desktop Automation
UIA3 Native
Browser
CDP Protocol
Terminal
Pseudo Console
Vision
Camera Capture
Voice
Local Whisper
AI Framework
Microsoft.Extensions.AI
Extensibility
MCP Servers + JSON Skills

Give Your AI
Hands. Eyes. Ears.

Give your AI full autonomy over your desktop.