This folder contains hands-on tasks for you to complete using Node.js and different model modalities from the GitHub Models Marketplace.
- Goal: Use a vision model to generate a caption for an image.
- Steps:
- Write a Node.js script that sends an image to a vision model (e.g., GPT-4V).
- Input: Any image file (e.g., 'sample-image.jpg').
- Output: The generated caption.
- Reference: SDK Options
- Goal: Use an audio model to transcribe speech from an audio file.
- Steps:
- Write a Node.js script that sends an audio file to a speech-to-text model (e.g., Whisper).
- Input: Any audio file (e.g., 'sample-audio.mp3').
- Output: The transcribed text.
- Reference: SDK Options
- Goal: Use an open-source model to generate text based on a prompt.
- Steps:
- Write a Node.js script that calls an open-source model (e.g., Mistral) for text generation.
- Input: Any prompt string.
- Output: The generated text.
- Reference: SDK Options
Complete these tasks, upload your solutions to GitHub, and submit your repository link using the form in
task/README.mdto claim your course certificate.