Chats 1.4 Release Notes

Release date: 2024-05-20

Version 1.4.0 focuses on API enhancements, with core highlights including comprehensive function call support, API caching capabilities, and code execution for Google Gemini models. This release also includes multiple performance optimizations and bug fixes.

Highlights

Comprehensive Function Call Support: All API endpoints now support function calls, including the general OpenAI-compatible chat completion API, Google Gemini API, and Azure OpenAI Response API
API Caching Feature: New caching capability using /v1-cached or /v1-cached-createOnly endpoints with significantly optimized performance
Google Gemini Code Execution: Google Gemini models now support code execution functionality
Performance Optimizations: Multiple API performance improvements, including async processing and reduced duplicate database calls

What's New

1) Comprehensive Tool Call Support

This version implements function call support across all API endpoints:

OpenAI Compatible API

General chat completion API (/v1/chat/completions) fully supports tool calls
New ToolCallSegment type for tool call fragments in streaming responses
Optimized function call response logic to ensure correct first response timing

Google AI / Gemini API

Google AI now supports tool calls
Google Gemini models now support code execution functionality
Both tool calls and code execution work seamlessly in chat completion flows

Azure OpenAI Response API

Response API now supports tool calls
Confirmed Response API working correctly through testing
Fixed issue where reasoning content was not displayed

2) API Caching Feature

To improve API performance and reduce costs, a new user-level API caching mechanism has been added:

Cache Endpoints

/v1-cached: Chat completion endpoint with caching enabled
/v1-cached-createOnly: Chat completion endpoint for cache creation only

Cache Features

New UserApiCache entity for storing user API request caches
Cache system supports tool call scenarios
Added cache metrics for monitoring cache effectiveness
Cache usage saved asynchronously to avoid blocking main flow

Performance Improvements

Avoid duplicate authentication database calls
Async client info fetching to optimize API response speed
Multiple cache performance optimizations significantly improve API throughput

3) Logging and Monitoring Improvements

Logging Optimizations

Ignore routine log output from OpenAI compatible controller
Suppress error logs for o3/o4-mini models
Added necessary warning prompts

Monitoring Enhancements

Added cache metrics statistics
Improved observability of tool call processes

4) User Experience Optimizations

Message Processing

Added merge logic in full chat completion
Fixed issue where reasoning content was not displayed
Optimized first response timing

Compatibility

Improved compatibility with various model providers
Ensured tool calls work correctly in different scenarios

Technical Details

New Entities and Types

UserApiCache: User API cache entity
ToolCallSegment: Tool call segment type

API Endpoint Changes

Added: /v1-cached - Chat completion with caching enabled
Added: /v1-cached-createOnly - Chat completion for cache creation only

Database Changes

Added UserApiCache related table structures
Cache usage statistics fields

Upgrade Notes

Caching Feature: To use the API caching feature, use the new /v1-cached or /v1-cached-createOnly endpoints
Tool Calls: All APIs now support function calls without special configuration
Performance Improvements: This version includes multiple performance optimizations; API response speed will be noticeably improved after upgrade

Known Issues and Limitations

Caching feature is experimental; thorough testing in production environments is recommended before use
Google Gemini code execution functionality requires model support

Full Changelog

View the complete commit history: 1.3.1.794...1.4.0.815

Key commits:

suppress o3/o4-mini error log
confirmed response api works
google gemini supports code execution
correct function call response logic
correct first response tick
google ai now supports tool call
response api also supports tool call
fix reasoning content not show issue
save cache usage in async way
avoid duplicated authentication db call for api
add cache metrics
optimize performance by async client info call
cache also supports tool call
confirmed cache working
implement cache support
initial commit of google ai tool
Add merge logic in full chat completion
add ToolCallSegment
initial commit of UserApiCache

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chats 1.4 Release Notes

Highlights

What's New

1) Comprehensive Tool Call Support

2) API Caching Feature

3) Logging and Monitoring Improvements

4) User Experience Optimizations

Technical Details

New Entities and Types

API Endpoint Changes

Database Changes

Upgrade Notes

Known Issues and Limitations

Full Changelog

FilesExpand file tree

1.4.md

Latest commit

History

1.4.md

File metadata and controls

Chats 1.4 Release Notes

Highlights

What's New

1) Comprehensive Tool Call Support

2) API Caching Feature

3) Logging and Monitoring Improvements

4) User Experience Optimizations

Technical Details

New Entities and Types

API Endpoint Changes

Database Changes

Upgrade Notes

Known Issues and Limitations

Full Changelog