Skip to content

feat: add Bilibili toolkit and end-to-end video understanding workflow#117

Open
17m-debug wants to merge 2 commits intojd-opensource:mainfrom
17m-debug:feat/video-understanding-feature
Open

feat: add Bilibili toolkit and end-to-end video understanding workflow#117
17m-debug wants to merge 2 commits intojd-opensource:mainfrom
17m-debug:feat/video-understanding-feature

Conversation

@17m-debug
Copy link

This PR introduces a set of multimodal tools and a comprehensive workflow to achieve "Search-Download-Analyze" automation

for video content.

✨ Key Features:

New Tools (tools/):

bilibili_search: Allows searching for videos on Bilibili via keywords.

bilibili_download: Supports downloading videos to local storage using Bilibili URLs.

video_understanding: Integrates local VLM (Vision Language Model) to analyze video content and supports RAG (Retrieval-
Augmented Generation) for Q&A.

New Workflow (workflows/):

End-to-End Pipeline: Implements a search -> download -> understand loop.

Logic: The agent searches for a topic, downloads the top match, and uses the local VLM to answer user queries based on the

video content.

✅ Use Case:

User query: "Search for the latest gameplay of Black Myth: Wukong on Bilibili and explain the boss mechanics shown in the video."

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant