GitHub - SuryaViswanath/eye-in-hand

This project is about eye-in-hand. In this our aim is to understand the persons action based on his gaze, hand trajectory, and hand shape. Based on these parameters we aim to predict what is the object that the person could potentially pick up

Using pre-trained models to identify what's happening in the frame with regards to the person's actions

approach inspired from the NVIDIA Cosmos-reason1 paper: https://d1qx31qr3h6wln.cloudfront.net/publications/Cosmos_Reason1_Paper.pdf

Stages:

    system starts -> 
    
        check eye gaze direction -> does object exist -> set value
        
        check hand movement exists -> plot hand movement trajectory -> set value
        
        check handshape -> which object uses the handshape -> set value
        
        -> use the 3 values to predict the next action -> this is done till the action is done
        
        -> check if action is performed -> if not, repeat the process

How to run this:

Step 1 Clone the repository;

git clone https://github.com/SuryaViswanath/eye-in-hand.git

Step 2 Install the dependencies:

pip install -r requirements.txt

Step 3 Download and setup Ollama:

https://ollama.com/download

after installing ollama, run the following command:

ollama run deepseek-r1:1.5b

Start the LLM inference for reasoning capabilities:

ollama serve

Step 4 Run the system: python main.py

System Design:

Example Outputs:

If you like what you are seeing, please star the repo

To Do:

Add object detection
Add live action anticipation

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
__init__.py		__init__.py
main.py		main.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

SuryaViswanath/eye-in-hand

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages