Skip to content

[FEATURE] Implement take_screenshot skill #6

@edenreich

Description

@edenreich

Summary

Implement the take_screenshot skill to provide visual capture functionality. This skill allows the agent to capture screenshots of full pages, specific elements, or viewports with various configuration options.

Acceptance Criteria

  • Capture full page screenshots with scrolling
  • Support element-specific screenshots using selectors
  • Implement PNG and JPEG image formats
  • Add quality configuration for JPEG images (0-100)
  • Support custom file path specification
  • Handle file system permissions and path validation
  • Implement viewport-only screenshots (default behavior)
  • Add image metadata and capture timestamp
  • Handle screenshot failures gracefully
  • Support screenshot masking for sensitive content
  • Add comprehensive logging for capture operations
  • Add unit tests covering various screenshot scenarios

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions