Amazon Nova Act - An SDK for web agents
Amazon AGI Labs introduced a new AI model designed to perform actions in a web browser. In addition, they released an AI agent framework called Nova Act SDK for building and deploying web agents.

The details
- Nova Act outperforms similar AI systems like Claude 3.7 Sonnet and OpenAI’s Computer Use Agent (CUA) in reliability benchmarks for browser tasks (see the image below).
- The new SDK allows developers to experiment with an early version of Nova Act. With this SDK, developers can create agents to do tasks in a web browser, such as submitting an out-of-office request or setting calendar events.
- According to Amazon AGI Labs, Nova Act SDK focuses on reliable building blocks that can be combined into more complex workflows.
- The SDK works with Playwright, an open-source browser automation framework created by Microsoft. Playwright lets developers control web browsers through code.

Our thoughts
The Nova Act SDK is a modern agent framework designed to perform tasks reliably and help automate complex workflows. Amazon will use the SDK in the software of future Alexa updates.
In addition, Amazon is planning a feature that lets users shop at other online stores through the Amazon Shopping App. An agent enters the data from the Amazon account onto the external website. Of course, high data protection is essential here, as the agent accesses sensitive information like bank details.
More information: 🔗 Amazon AGI Labs | Amazon
Magic AI tool
This week’s Magic AI tool is ElevenLabs*. With this tool, you can create realistic speech from texts in seconds!
ElevenLabs is a platform that uses advanced AI to generate realistic speech. And yes, it sounds really realistic! As a blogger, you can turn your texts into audio tracks! You can also offer the audio in different languages. With ElevenLabs, language barriers are a thing of the past!
Step-by-Step Guide:
- Sign up for free at ElevenLabs.com* (10,000 characters per month (~10 min) for free).
- Click on the “Text to Speech” tab to navigate to the Speech Synthesis tool.
- Enter your text and select a voice of your choice.
- Optional: You can adjust the voice in the settings menu.
- Click “Generate speech” to create your audio file. That’s it! 🎉🎉
Hand-picked articles
- Build a Local AI Agent to Chat with Financial Charts Using Agno
- Build a Multi-Agent Stock Market Analyst to Compare Stock Price Performance
- Portfolio Allocation - How to Analyze a Stock Portfolio Using Python
😀 Do you enjoy our content? If so, why not support us with a small financial contribution? This helps us fund our work to ensure we can stick around long-term.