/
Server configuration and capabilities from the MCP manifest
Size: 5,994 bytes (1,499 tokens)
Tools Found: 15
Analysis ID: 3ow70v6v
Windows MCP is an open-source project that enables seamless integration between AI agents and the Windows operating system, providing a rich toolset for UI automation and interaction.
Stars
Registry ID: 357d625e-8b12-4ae1-8915-ee77d3dc9a38
Added to Registry: 9/21/2025
Last Updated: 9/22/2025
Last Seen: 9/22/2025
License: MIT License
Default Branch: main
Last Push: 9/21/2025
Last Release: 8/25/2025
Open Issues: 6
Historical performance and growth metrics over time
{ "server.json": { "name": "io.github.CursorTouch/Windows-MCP", "status": "active", "$schema": "https://static.modelcontextprotocol.io/schemas/2025-07-09/server.schema.json", "version": "1.0.0", "packages": [ { "version": "0.3.0", "transport": { "type": "stdio" }, "identifier": "windows_mcp", "runtime_hint": "uvx", "registry_type": "pypi", "registry_base_url": "https://pypi.org" } ], "repository": { "url": "https://github.com/CursorTouch/Windows-MCP", "source": "github" }, "description": "An MCP Server for computer-use in Windows OS" }, "manifest.json": { "icon": "./assets/logo.png", "name": "Windows-MCP", "tools": [ { "name": "Launch-Tool", "description": "Launch an application from the Windows Start Menu by name (e.g., \"notepad\", \"calculator\", \"chrome\")" }, { "name": "Powershell-Tool", "description": "Execute PowerShell commands and return the output with status code" }, { "name": "State-Tool", "description": "Capture comprehensive desktop state including focused/opened applications, interactive UI elements (buttons, text fields, menus), informative content (text, labels, status), and scrollable areas. Optionally includes visual screenshot when use_vision=True. Essential for understanding current desktop context and available UI interactions." }, { "name": "Clipboard-Tool", "description": "Copy text to clipboard or retrieve current clipboard content. Use \"copy\" mode with text parameter to copy, \"paste\" mode to retrieve." }, { "name": "Click-Tool", "description": "Click on UI elements at specific coordinates. Supports left/right/middle mouse buttons and single/double/triple clicks. Use coordinates from State-Tool output." }, { "name": "Type-Tool", "description": "Type text into input fields, text areas, or focused elements. Set clear=True to replace existing text, False to append. Click on target element coordinates first." }, { "name": "Switch-Tool", "description": "Switch to a specific application window (e.g., \"notepad\", \"calculator\", \"chrome\", etc.) and bring to foreground." }, { "name": "Resize-Tool", "description": "Resize a specific application window (e.g., \"notepad\", \"calculator\", \"chrome\", etc.) to specific size (WIDTHxHEIGHT) or move to specific location (X,Y)." }, { "name": "Scroll-Tool", "description": "Scroll at specific coordinates or current mouse position. Use wheel_times to control scroll amount (1 wheel = ~3-5 lines). Essential for navigating lists, web pages, and long content." }, { "name": "Drag-Tool", "description": "Drag and drop operation from source coordinates to destination coordinates. Useful for moving files, resizing windows, or drag-and-drop interactions." }, { "name": "Move-Tool", "description": "Move mouse cursor to specific coordinates without clicking. Useful for hovering over elements or positioning cursor before other actions." }, { "name": "Shortcut-Tool", "description": "Execute keyboard shortcuts using key combinations. Pass keys as list (e.g., ['ctrl', 'c'] for copy, ['alt', 'tab'] for app switching, ['win', 'r'] for Run dialog)." }, { "name": "Key-Tool", "description": "Press individual keyboard keys. Supports special keys like 'enter', 'escape', 'tab', 'space', 'backspace', 'delete', arrow keys 'up', 'down', 'left', 'right'), function keys ('f1'-'f12')." }, { "name": "Wait-Tool", "description": "Pause execution for specified duration in seconds. Useful for waiting for applications to load, animations to complete, or adding delays between actions." }, { "name": "Scrape-Tool", "description": "Fetch and convert webpage content to markdown format. Provide full URL including protocol (http/https). Returns structured text content suitable for analysis." } ], "author": { "name": "CursorTouch", "email": "jeogeoalukka@gmail.com" }, "server": { "type": "python", "mcp_config": { "env": {}, "args": [ "--directory", "${__dirname}", "run", "main.py" ], "command": "uv" }, "entry_point": "main.py" }, "license": "MIT", "version": "0.3.0", "homepage": "https://github.com/CursorTouch", "keywords": [ "windows", "automation", "ai", "mcp" ], "repository": { "url": "https://github.com/CursorTouch/Windows-MCP", "type": "git" }, "description": "MCP Server that enables Claude to interact with Windows OS", "dxt_version": "0.1", "screenshots": [ "./assets/screenshots", "./assets/screenshots/screenshot_1.png", "./assets/screenshots/screenshot_2.png", "./assets/screenshots/screenshot_3.png" ], "compatibility": { "platforms": [ "win32" ] }, "documentation": "https://github.com/CursorTouch/Windows-MCP", "tools_generated": true, "long_description": "Windows MCP is an open-source project that enables seamless integration between AI agents and the Windows operating system. Acting as an MCP server bridges the gap between LLMs and the Windows operating system, allowing agents to perform tasks such as **file navigation, application control, UI interaction, QA testing,** and more.\n\n## Key Features\n\n- **Seamless Windows Integration**: Interacts natively with Windows UI elements, opens apps, controls windows, simulates user input, and more.\n- **Use Any LLM (Vision Optional)**: Unlike many automation tools, Windows MCP doesn't rely on any traditional computer vision techniques or specific fine-tuned models; it works with any LLMs, reducing complexity and setup time.\n- **Rich Toolset for UI Automation**: Includes tools for basic keyboard, mouse operation and capturing window/UI state.\n- **Lightweight & Open-Source**: Minimal dependencies and easy setup with full source code available under MIT license.\n- **Customizable & Extendable**: Easily adapt or extend tools to suit your unique automation or AI integration needs.\n- **Real-Time Interaction**: Typical latency between actions (e.g., from one mouse click to the next) ranges from **1.5 to 2.3 secs**, and may slightly vary based on the number of active applications and system load, also the inferencing speed of the llm.\n\n## Requirements\n\n### UV Package Manager\nThis MCP server requires [UV](https://github.com/astral-sh/uv), a fast Python package manager. \n\n```bash\npip install uv\n```\n\nFor detailed installation instructions, see the [UV documentation](https://github.com/astral-sh/uv#installation)." } }
Forks
Watchers
Contributors
Last Push: 9/21/2025
Last Release: 8/25/2025
Open Issues: 6