This guide walks you through integrating OpenAI’s Computer Use Agent (CUA) with Browserbase for scalable cloud browser agents. CUA is an AI model that can see the screen, understand context, and take actions within a browser — enabling advanced interaction with web applications. Pair CUA with Browserbase’s scalable remote browser infrastructure to run AI-powered browser agents effortlessly in the cloud. Try out the Computer Use Agent now: cua.browserbase.comDocumentation Index
Fetch the complete documentation index at: https://docs.browserbase.com/llms.txt
Use this file to discover all available pages before exploring further.
Prerequisites
- OpenAI API key with Computer Use Agent access
- Browserbase account and API key
- Python 3.8+
Basic integration
This basic setup will get you up and running with a CUA agent using Browserbase as the underlying browser automation platform.Customizing the CUA agent
You can customize the CUA agent by updating the flags in the CLI:--input: The initial input to the agent (optional: the CLI will prompt you for input if not provided)--debug: Enable debug mode.--show: Show images (screenshots) during the execution.--start-url: Start the browsing session with a specific URL (only for browser environments). By default, the CLI will start the browsing session withhttps://bing.com.
Related resources
Browserbase documentation
Explore the full Browserbase documentation
OpenAI API documentation
Learn more about OpenAI’s APIs
Example projects on GitHub
Find the sample project for this integration