Keyboard API

Overview

The Keyboard API endpoints provide tools for simulating keyboard input on Windows. These endpoints allow an agent system to type text at the current cursor position or simulate keyboard shortcuts and hotkey combinations. This functionality is essential for automating user interactions with applications, filling out forms, and triggering system functionality through keyboard shortcuts.

The keyboard functions enable agents to interact with applications just as a human would through the keyboard, making them powerful tools for workflow automation and application control.

Available Endpoints

Type Text

POST /tools-api/keyboard/type

Types the specified text at the current cursor position. You can include "\n" in the text to simulate pressing the Enter key after typing.

View Details

Press Hotkey

POST /tools-api/keyboard/press

Presses a hotkey combination (e.g., Win+R for Run dialog, Alt+F4 to close window, Ctrl+Alt+Delete for system menu). Supports various modifier keys including Ctrl, Alt, Shift, Win, and AltGr.

View Details