webqa-agent

WebQA Agent is an autonomous web browser agent that audits performance, functionality & UX for engineers and vibe-coding creators.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mmmay0722

These details have not been verified by PyPI

Project description

WebQA Agent

Join us on 🎮Discord | 💬WeChat

English · 简体中文

If you like WebQA Agent, please give us a ⭐ on GitHub!

🤖 WebQA Agent is a fully automated web testing agent that understands the web like a human — generating test cases, evaluating functionality, performance, and UX end-to-end. ✨ Available as GUI/CLI for direct use, or as an OpenClaw skill.

🚀 Core Features

📋 Feature Overview

WebQA-Agent provides two testing modes to support different scenarios 🤖 Generate Mode and 📋 Run Mode.

Capability	🤖 Generate Mode	📋 Run Mode
Core Features	AI-driven discovery -> Dynamic generation -> Precise execution	Execute based on instructions and expected verification
Use Cases	New feature, comprehensive quality assurance	Repeatable and regression testing scenarios
User Input	Minimal: Only URL or a one-sentence business goal	Structured: Simple natural language step descriptions
Advantages	Reflection-based planning, adaptive to UI changes; Configurable functional / performance / security / UX evaluation for comprehensive QA	Stable and predictable results; No selector maintenance; Real-time Console and Network monitoring

Usage & Deployment: Supports CLI execution (see CLI Usage); also supports full-stack deployment (Local / Docker / K8s) with a web interface for visual management. See Deployment.

🛠️ Tool System

Default Tools (Always Enabled):

UI Actions: Browser interactions (click, type, navigate)
UI Assertions: State verification
UX Verification: Text typo checking, layout analysis

Custom Tools (Optional, Configuration-Enabled):

Performance: Lighthouse-based performance testing
Security: Nuclei vulnerability scanning
Link Detection: Dynamic link discovery

Enable custom tools in config.yaml:

test_config:
  custom_tools:
    enabled:
      - lighthouse
      - nuclei

🧭 Architecture

WebQA Agent Architecture

📹 Examples

🎬 Watch Demo: One-click testing of Baidu.com

🚀 Quick Start

Choose between 🛠️ CLI Quick Start or 🖥️ Full-stack Deployment (Web Dashboard).

🛠️ CLI Quick Start (Recommended for Developers)

Recommended using uv (Python>=3.11):

# 1) Create project and install
uv init my-webqa && cd my-webqa
uv add webqa-agent

# 2) Install browser (Required)
uv run playwright install chromium

# 3) Generate Mode
uv run webqa-agent init -m gen  # Init config, edit config.yaml with URL & API Key
uv run webqa-agent gen          # Start AI-driven testing

# 4) Run Mode
uv run webqa-agent init -m run  # Init config, write natural language cases
uv run webqa-agent run          # Start execution

See CLI Usage for more CLI details.

🖥️ Full-stack Deployment (Recommended for Teams)

For visual dashboard, test management, and history, start with Docker Compose:

git clone https://github.com/MigoXLab/webqa-agent.git
cd webqa-agent/deploy/docker-compose
cp .env.example .env
# Edit .env: fill in your LLM API Key
./start.sh

Access via http://localhost. For other deployment methods, see Deployment.

⚙️ CLI Usage

CLI Parameter Details

WebQA Agent provides a concise command-line interface for initialization, autonomous exploration, case execution, and launching the Web UI.

Command	Description	Common Arguments
`init`	Initialize configuration file	`-m <gen/run>`: Specify mode; `-o <path>`: Output path; `--force`: Overwrite existing
`gen`	Generate Mode: AI-driven test generation & execution	`-c <path>`: Config path; `-w <n>`: Parallel workers
`run`	Run Mode: Execute YAML-defined test cases	`-c <path/dir>`: Config file or folder; `-w <n>`: Parallel workers

Examples:

# Initialize Run mode configuration
webqa-agent init -m run

# Run all cases in a directory with 4 parallel workers
webqa-agent run -c ./my_cases -w 4

Generate Mode - Configuration

🔧 Optional Dependencies (Custom Tools)

Performance testing (Lighthouse): npm install lighthouse chrome-launcher (requires Node.js ≥18)
Security testing (Nuclei):

  brew install nuclei      # macOS
  nuclei -ut               # Update templates
  # Linux/Windows: https://github.com/projectdiscovery/nuclei/releases

📄 Configuration Details

The configuration file must include the test_config field to define test types.

Business Objectives: Specifies business goals to steer AI test focus and coverage.
Custom Tools: Optional tools like Performance (Lighthouse), Security (Nuclei), button checks, and link detection.
Dynamic Step Generation: Automatically generates additional test steps when new UI elements are detected during execution.
Filter Model: Configures a lightweight model for pre-filtering page elements to improve planning efficiency.

For more details, please refer to docs/MODES&CLI.md

target:
  url: https://example.com              # Website URL to test
  description: Website QA testing

test_config:
  business_objectives: Test search functionality, generate 3 test cases
  custom_tools:                         # Optional: Enable custom testing tools (by step_type)
    enabled:
      # - lighthouse                    # Lighthouse performance testing
                                        # Requires: npm install lighthouse chrome-launcher (local, recommended)
                                        # or: npm install -g lighthouse chrome-launcher (global)
      # - nuclei                        # Nuclei security scanning
                                        # Requires: go install -v github.com/projectdiscovery/nuclei/v3/cmd/nuclei@latest
                                        # or download from: https://github.com/projectdiscovery/nuclei/releases
      # - traverse_clickable_elements   # Clickable element traversal testing
      # - detect_dynamic_links          # Dynamic link discovery and validation

llm_config:                             # LLM configuration, supports OpenAI, Anthropic Claude, Google Gemini, and OpenAI-compatible models (e.g., Doubao, Qwen)
  model: gpt-5.4                        # Primary model
  filter_model: gpt-5-mini              # Lightweight model for element filtering (optional)
  api_key: your_api_key                 # Or set via environment variable (OPENAI_API_KEY)
  base_url: https://api.openai.com/v1   # Optional, API endpoint. For OpenAI-compatible models (Doubao, Qwen, etc.), set to their API endpoint

browser_config:
  headless: False                       # Auto True in Docker
  language: en-US

report:
  language: en-US                       # zh-CN or en-US

Run Mode - Configuration

Run Mode configuration must include the cases field.

Multi-modal Interaction: Use action to describe visible text, images, or relative positions on the page. Supported browser actions include click, hover, input, clear, keyboard input, scrolling, mouse movement, file upload, drag-and-drop, and wait; page actions include navigation, back.
Multi-modal Verification: Use verify to ensure the agent stays on track, validating visual content, URLs, paths, and combined image–element conditions.
End-to-End Monitoring: Monitoring Console logs and Network request status, and supporting configuration of ignore_rules to ignore known errors.

For more details and test case writing specifications, please refer to docs/MODES&CLI.md

target:
  url: https://example.com              # Target website URL

llm_config:                             # LLM configuration
  api: openai
  model: gpt-5-mini
  api_key: your_api_key_here
  base_url: https://api.openai.com/v1

browser_config:
  viewport: {"width": 1280, "height": 720}
  headless: False                       # Auto True in Docker
  language: en-US
  # cookies: /path/to/cookie.json

ignore_rules:                           # Ignore rules configuration (optional)
  network:                              # Network request ignore rules
    - pattern: ".*\\.google-analytics\\.com.*"
      type: "domain"
  console:                              # Console log ignore rules
    - pattern: "Failed to load resource.*favicon"
      match_type: "regex"
    - pattern: "Warning:"
      match_type: "contains"

cases:                                  # Test case list
  - name: Image Upload                  # Test case name
    steps:                              # Test steps
      - action: Upload icon is the image icon in the input box, located next to the Baidu search button, used for uploading files
        args:
          file_path: ./tests/data/test.jpeg
      - action: Wait for image upload
      - verify: Verify that the input field displays an open palm/hand icon image
      - action: Enter "How many fingers are in the image?" in the search input box, then press Enter, wait 2 seconds

📊 View Results

Test reports are generated in the reports/ directory. Open the HTML file to view detailed results.

🛠️ Extending WebQA Agent Tools

WebQA Agent supports custom tool development for domain-specific testing capabilities.

Document	Description
Custom Tool Development	Quick reference for creating custom tools
LLM Context Document	Comprehensive guide for AI-assisted development, useful for vibe coding

We welcome contributions! Check out existing tools for examples.

🖥️ Deployment

For teams that need a persistent web dashboard with test management, scheduled tasks, and execution history, deploy the full-stack platform:

Method	Use Case	Guide
Local Development	Personal dev & debugging	deploy/README.md
Docker Compose	Single-machine / Team trial	deploy/README.md
Kubernetes	Production cluster	deploy/k8s/README.md

💡 Extending Internal Logic: WebQA Agent supports extending internal logic based on your team's infrastructure (such as integrating internal SSO, OSS object storage, internal LLMs, etc.). You are free to customize and develop it to fit your needs. deploy/README.md

Note: The web dashboard platform is currently only available in Chinese.

🗺️ RoadMap

Interaction & Visualization: Real-time display of reasoning processes
Generate Mode Expansion: Integration of additional evaluation dimensions
Tool Agent Context Integration: More comprehensive and precise execution

🙏 Acknowledgements

natbot: Drive a browser with GPT-3
Midscene.js: AI Operator for Web, Android, Automation & Testing
browser-use: AI Agent for Browser control

📄 License

This project is licensed under the Apache 2.0 License.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mmmay0722

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0.1

Mar 31, 2026

0.3.0

Mar 30, 2026

0.2.3.3

Jan 29, 2026

0.2.3.2

Jan 28, 2026

0.2.3.1

Jan 27, 2026

0.2.3

Jan 16, 2026

0.2.2.post1

Dec 31, 2025

0.2.2

Dec 31, 2025

0.2.1

Dec 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webqa_agent-0.3.0.1.tar.gz (3.6 MB view details)

Uploaded Mar 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

webqa_agent-0.3.0.1-py3-none-any.whl (3.7 MB view details)

Uploaded Mar 31, 2026 Python 3

File details

Details for the file webqa_agent-0.3.0.1.tar.gz.

File metadata

Download URL: webqa_agent-0.3.0.1.tar.gz
Upload date: Mar 31, 2026
Size: 3.6 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for webqa_agent-0.3.0.1.tar.gz
Algorithm	Hash digest
SHA256	`4d02130d3c7f15170518664c4daa51863feb288bffa3cad54efbecd291693a4b`
MD5	`02078ad281874a091feb15e619d3e425`
BLAKE2b-256	`242525d5c5cf22c45adbc8ac118255d1591a5d98a72ca54897f2c7d491c3f651`

See more details on using hashes here.

Provenance

The following attestation bundles were made for webqa_agent-0.3.0.1.tar.gz:

Publisher: workflow.yml on MigoXLab/webqa-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: webqa_agent-0.3.0.1.tar.gz
- Subject digest: 4d02130d3c7f15170518664c4daa51863feb288bffa3cad54efbecd291693a4b
- Sigstore transparency entry: 1202810230
- Sigstore integration time: Mar 31, 2026
Source repository:
- Permalink: MigoXLab/webqa-agent@4723594da7ba81766e67ac40b14ef48102572a88
- Branch / Tag: refs/tags/v0.3.0.1
- Owner: https://github.com/MigoXLab
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@4723594da7ba81766e67ac40b14ef48102572a88
- Trigger Event: push

File details

Details for the file webqa_agent-0.3.0.1-py3-none-any.whl.

File metadata

Download URL: webqa_agent-0.3.0.1-py3-none-any.whl
Upload date: Mar 31, 2026
Size: 3.7 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for webqa_agent-0.3.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c9c4b40c20cc09b330ed96fff6a6812170649d460337e5dcc3dc31298461ef1e`
MD5	`b999fef31908128ca5629033e11c32f4`
BLAKE2b-256	`8276574fa7f1dfeddd9de01d102b1c04a28ec06397015a8b55d9dc88d57705e4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for webqa_agent-0.3.0.1-py3-none-any.whl:

Publisher: workflow.yml on MigoXLab/webqa-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: webqa_agent-0.3.0.1-py3-none-any.whl
- Subject digest: c9c4b40c20cc09b330ed96fff6a6812170649d460337e5dcc3dc31298461ef1e
- Sigstore transparency entry: 1202810241
- Sigstore integration time: Mar 31, 2026
Source repository:
- Permalink: MigoXLab/webqa-agent@4723594da7ba81766e67ac40b14ef48102572a88
- Branch / Tag: refs/tags/v0.3.0.1
- Owner: https://github.com/MigoXLab
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@4723594da7ba81766e67ac40b14ef48102572a88
- Trigger Event: push

webqa-agent 0.3.0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

📑 Table of Contents

🚀 Core Features

📋 Feature Overview

🛠️ Tool System

🧭 Architecture

📹 Examples

🚀 Quick Start

🛠️ CLI Quick Start (Recommended for Developers)

🖥️ Full-stack Deployment (Recommended for Teams)

⚙️ CLI Usage

CLI Parameter Details

Generate Mode - Configuration

🔧 Optional Dependencies (Custom Tools)

📄 Configuration Details

Run Mode - Configuration

📊 View Results

🛠️ Extending WebQA Agent Tools

🖥️ Deployment

🗺️ RoadMap

🙏 Acknowledgements

📄 License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance