Abstract
With the rapid development of large language model technology, Web agents, as a key technology for automated Web interaction, have gradually become a research hotspot. In this study, a LangGraph - based Web agent was designed and implemented. Driven by the multimodal large language model GPT-4o, and through the automated Web browsing environment Playwright, multiple Web page operation tools were realized. The research demonstrated successful cases of the agent in Web interaction, and at the same time, revealed its challenges in aspects such as page navigation and hallucination handling. Future research will focus on optimizing the agent to improve its stability and execution efficiency in the Web environment.