Mouse (Coordinates) API Endpoints

Overview

The Mouse (Coordinates) API category provides endpoints for controlling the mouse using precise screen coordinates. These endpoints enable click operations, dragging, scrolling, and other mouse actions at exact positions on the screen.

Use these endpoints when you need to interact with specific screen locations and have the exact X and Y coordinates. For more context-aware mouse control with AI assistance, see the Mouse (AI Vision) category.

Available Endpoints

Mouse Click

POST /tools-api/mouse/click

Performs a left mouse button click at the specified coordinates.

View Details

Mouse Double Click

POST /tools-api/mouse/doubleclick

Performs a double click at the specified coordinates.

View Details

Mouse Right Click

POST /tools-api/mouse/rightclick

Performs a right mouse button click at the specified coordinates.

View Details

Mouse Drag

POST /tools-api/mouse/drag

Performs a drag operation from one set of coordinates to another.

View Details

Mouse Scroll

POST /tools-api/mouse/scroll

Performs a scroll operation at the specified coordinates with the given number of clicks.

View Details

Usage Notes

When using these endpoints, keep in mind that screen coordinates are measured in pixels from the top-left corner of the screen (0,0). The exact screen resolution will depend on the system's display settings.

For most reliable results, make sure the target window is in the foreground before performing mouse operations. You can use the bring-to-front endpoint from the Windows Automation category for this purpose.

If you don't know the exact coordinates of an element you want to interact with, consider using the Mouse (AI Vision) category which allows you to describe UI elements instead of providing exact coordinates.