-
Task Type:
- Chat/conversation
- Text completion
- Code generation
- Embeddings
-
Model Characteristics:
- Context window size
- Training data recency
- Special capabilities (function calling, etc.)
-
Performance Needs:
- Response time
- Output quality
- Cost considerations
-
Rate Limits:
- Requests per minute/day
- Token limits
- Concurrent requests
- Define your use case
- Identify must-have features
- Compare models in playground
- Test with your specific prompts
- Evaluate performance
- GPT-4 variants
- Claude models
- Open-source alternatives
- Specialized models (code, etc.)
- Review the model card in the Marketplace for:
- Supported capabilities (chat, code, vision, embeddings)
- Rate limits and pricing
- Example code and SDK support
- Provider-specific features (function calling, streaming, etc.)