Skip to content

Screen Control Definition Format: Describe an app's UI for more accurate screen control, reducing number of prompts to AI control application

Notifications You must be signed in to change notification settings

joshakaw/screen-control-definition-format

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Screen Control Definition Format (scdf)

A format that allows for a standard way to describe a user interface for efficient AI screen control. It can describe any app regardless of platform (Windows, Linux, Web, etc).

It can describe:

  • Location of buttons
  • Functionality of a certain window
  • Links/connections between different windows

And it's not just read-only. An AI screen control app could also update the SCDF (either locally or on a hosted database) if it encounters a new UI with primative information.

Draft 1 is available Features:

  • Applications & Windows
  • Controls (buttons, inputs, lists, etc.)
  • Control location & type
  • Events/actions associated with controls
  • Links between windows (navigation, relationships)
  • Metadata (AI-detected vs human-defined, timestamps, notes)

An example of how it would work:

  • An AI agent detects a new app window → inserts into scdf_windows
  • It scans for buttons → inserts into scdf_controls
  • It sees a "Login" button that opens a new window → inserts into scdf_window_links
  • It describes what the button does → scdf_control_events
  • AI logs an annotation for low confidence → scdf_annotations

About

Screen Control Definition Format: Describe an app's UI for more accurate screen control, reducing number of prompts to AI control application

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published