GSoC Project_8 : Refining Zero-Shot Object Segmentation by Combining Vision Foundation Models #29541
Replies: 3 comments 3 replies
-
Dear Daan , Klaas , Samet My name is Seung Chan Kwon, and I am currently a second-year master's student at Soongsil University, pursuing studies in computer vision and Python. I am very interested in the GSoC 2025 project with OpenVINO: "Refining Zero-Shot Object Segmentation by Combining Vision Foundation Models." I am familiar with using segmentation models, having received the Chairman’s Award in the University National Center of Excellence in Software Joint AI Competition for my project on Satellite Image Building Area Segmentation. Additionally, I have experience working with the SAM (Segment Anything Model) in my undergraduate capstone project. I also contributed to solving the SAM tutorial issue, as you can see here: Therefore, I am very eager to contribute to this project.
Thank you for your time and consideration. @adrianboguszewski — could you please connect me with the mentors? |
Beta Was this translation helpful? Give feedback.
-
Hope you're all doing well! I want to follow up on my previous message and let you know that I've been diving into more resources related to the “Refining Zero-Shot Object Segmentation by Combining Vision Foundation Models” project. Although I haven't received a response to my first message yet, I've been spending time exploring the topic and getting a better understanding. But after thinking more about how to approach that idea, I have a few questions right now🤔:
Also, I submitted my first PR a while ago. It addressed a Broadcast operation format issue(f32) after looking into some conformance test-related source code and running a few tests. But, the task wasn’t really related to the project I‘m focusing now. I want to take on something more relevant to the project, so It would be great if you could give me some guidance. Looking forward to hearing from you. |
Beta Was this translation helpful? Give feedback.
-
@TempestBirds729804 @kwonseungchan |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Dear Daan, Klaas, and Samet,
你好!My name is Yan Zhang, a third-year undergraduate student majoring in Computer Science at Hefei University of Technology. I'm planning to apply for CS master's programs at U.S. universities for 2026 Fall. I’ve been particularly interested in computer vision, especially object detection and segmentation tasks. I am familiar with Python and C++ development and have experience working with computer vision frameworks such as OpenCV and PyTorch. I’ve spent some time working with CLIP-based feature extraction in past projects.
Previously, I participated as the fourth author in a national invention patent project — “A contrastive micro-expression recognition method based on text-position attention”, where we implemented a CLIP-based micro-expression recognition pipeline. I’m also involved in a pipeline inspection robot project, working on the visual environment recognition module. This work is in collaboration with the Faculty of Mechanical Engineering at my university.
The GSoC project “Refining Zero-Shot Object Segmentation by Combining Vision Foundation Models” immediately caught my attention. I find the idea of combining models like DINOv2 and SAM to build a more general and robust segmentation pipeline both exciting and meaningful. I’d love to explore this further and contribute to the OpenVINO ecosystem. 🚀
My first PR is #29515 .
Looking forward to learning and collaborating with the community!
Best regards,
Yan Zhang
Beta Was this translation helpful? Give feedback.
All reactions