You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
(not a feature/solution request, just a general query)
asking whether such a solution exists already or if I am not looking harder since I am not tech-oriented, I want there to be a script that on being invoked, takes a partial screenshot of my screen, performs OCR on it and makes an embedding(?) of the ocr result which is progressively added to each time I run the script.
The script can prompt me the option to choose from whether to:
embed the contents of the screenshot, or...
to embed and search for semantically similar notes, or...
to embed, search AND ask a question based on the semantically similar notes as context.
currently, I have a very crude implementation of this which involves scripts for taking screenshot and saving the OCR result as md files inside an obsidian vault where I can perform all the required functions, I am barely inconvenienced by it since it gets the job done very well, but at the same time prompting gemini to create me these neat little scripts which help with my workload tremendously, makes me believe that it should be possible to implement something like this given I know the logic of it.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
thanks for this wonderful software.
(not a feature/solution request, just a general query)
asking whether such a solution exists already or if I am not looking harder since I am not tech-oriented, I want there to be a script that on being invoked, takes a partial screenshot of my screen, performs OCR on it and makes an embedding(?) of the ocr result which is progressively added to each time I run the script.
The script can prompt me the option to choose from whether to:
currently, I have a very crude implementation of this which involves scripts for taking screenshot and saving the OCR result as md files inside an obsidian vault where I can perform all the required functions, I am barely inconvenienced by it since it gets the job done very well, but at the same time prompting gemini to create me these neat little scripts which help with my workload tremendously, makes me believe that it should be possible to implement something like this given I know the logic of it.
Beta Was this translation helpful? Give feedback.
All reactions