You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In terms of functionality, we have made several changes to the git repository. Firstly, we have addressed the network issues with huggingface and enhanced the inference speed by introducing local inference capabilities for embeddings. Additionally, considering the limitations of the SqlAlchemy framework, we have completely revamped the module responsible for interacting with relational databases, enabling more flexible database operations. In practical scenarios, LLM products often require integration with multiple users and multiple models. Hence, we have added support for multi-tenancy in the ModelCache, while also making preliminary compatibility adjustments for system commands and multi-turn dialogue.
95
+
96
+
<html>
97
+
<head>
98
+
<style>
99
+
table, th, td {
100
+
border-collapse: collapse;
101
+
text-align: left;
102
+
padding: 10px;
103
+
margin-left: 20px;
104
+
margin-right: 20px;
105
+
}
106
+
.checkmark {
107
+
font-size: 24px;
108
+
}
109
+
110
+
</style>
111
+
</head>
112
+
<body>
113
+
114
+
<table>
115
+
<tr>
116
+
<th rowspan="2">Module</th>
117
+
<th rowspan="2">Function</th>
118
+
119
+
</tr>
120
+
<tr>
121
+
<th>ModelCache</th>
122
+
<th>GPTCache</th>
123
+
</tr>
124
+
<tr>
125
+
<td rowspan="2">Basic Interface</td>
126
+
<td>Data query interface</td>
127
+
<td class="checkmark">☑ </td>
128
+
<td class="checkmark">☑ </td>
129
+
</tr>
130
+
<tr>
131
+
<td>Data writing interface</td>
132
+
<td class="checkmark">☑ </td>
133
+
<td class="checkmark">☑ </td>
134
+
</tr>
135
+
<tr>
136
+
<td rowspan="3">Embedding</td>
137
+
<td>Embedding model configuration</td>
138
+
<td class="checkmark">☑ </td>
139
+
<td class="checkmark">☑ </td>
140
+
</tr>
141
+
<tr>
142
+
<td>Large model embedding layer</td>
143
+
<td class="checkmark">☑ </td>
144
+
<td></td>
145
+
</tr>
146
+
<tr>
147
+
<td>BERT model long text processing</td>
148
+
<td class="checkmark">☑ </td>
149
+
<td></td>
150
+
</tr>
151
+
<tr>
152
+
<td rowspan="2">Large model invocation</td>
153
+
<td>Decoupling from large models</td>
154
+
<td class="checkmark">☑ </td>
155
+
<td></td>
156
+
</tr>
157
+
<tr>
158
+
<td>Local loading of embedding model</td>
159
+
<td class="checkmark">☑ </td>
160
+
<td></td>
161
+
</tr>
162
+
<tr>
163
+
<td rowspan="2">Data isolation</td>
164
+
<td>Model data isolation</td>
165
+
<td class="checkmark">☑ </td>
166
+
<td class="checkmark">☑ </td>
167
+
</tr>
168
+
<tr>
169
+
<td>Hyperparameter isolation</td>
170
+
<td></td>
171
+
<td></td>
172
+
</tr>
173
+
<tr>
174
+
<td rowspan="3">Databases</td>
175
+
<td>MySQL</td>
176
+
<td class="checkmark">☑ </td>
177
+
<td class="checkmark">☑ </td>
178
+
</tr>
179
+
<tr>
180
+
<td>Milvus</td>
181
+
<td class="checkmark">☑ </td>
182
+
<td class="checkmark">☑ </td>
183
+
</tr>
184
+
<tr>
185
+
<td>OceanBase</td>
186
+
<td class="checkmark">☑ </td>
187
+
<td></td>
188
+
</tr>
189
+
<tr>
190
+
<td rowspan="3">Session management</td>
191
+
<td>Single-turn dialogue</td>
192
+
<td class="checkmark">☑ </td>
193
+
<td class="checkmark">☑ </td>
194
+
</tr>
195
+
<tr>
196
+
<td>System commands</td>
197
+
<td class="checkmark">☑ </td>
198
+
<td></td>
199
+
</tr>
200
+
<tr>
201
+
<td>Multi-turn dialogue</td>
202
+
<td class="checkmark">☑ </td>
203
+
<td></td>
204
+
</tr>
205
+
<tr>
206
+
<td rowspan="2">Data management</td>
207
+
<td>Data persistence</td>
208
+
<td class="checkmark">☑ </td>
209
+
<td class="checkmark">☑ </td>
210
+
</tr>
211
+
<tr>
212
+
<td>One-click cache clearance</td>
213
+
<td class="checkmark">☑ </td>
214
+
<td></td>
215
+
</tr>
216
+
<tr>
217
+
<td rowspan="2">Tenant management</td>
218
+
<td>Support for multi-tenancy</td>
219
+
<td class="checkmark">☑ </td>
220
+
<td></td>
221
+
</tr>
222
+
<tr>
223
+
<td>Milvus multi-collection capability</td>
224
+
<td class="checkmark">☑ </td>
225
+
<td></td>
226
+
</tr>
227
+
<tr>
228
+
<td>Other</td>
229
+
<td>Long-short dialogue distinction</td>
230
+
<td class="checkmark">☑ </td>
231
+
<td></td>
232
+
</tr>
233
+
</table>
234
+
235
+
</body>
236
+
</html>
237
+
93
238
## Core-Features
94
239
In ModelCache, we adopted the main idea of GPTCache, includes core modules: adapter, embedding, similarity, and data_manager. The adapter module is responsible for handling the business logic of various tasks and can connect the embedding, similarity, and data_manager modules. The embedding module is mainly responsible for converting text into semantic vector representations, it transforms user queries into vector form.The rank module is used for sorting and evaluating the similarity of the recalled vectors. The data_manager module is primarily used for managing the database. In order to better facilitate industrial applications, we have made architectural and functional upgrades as follows:
0 commit comments