You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: content/modules/ROOT/pages/module-model-testing.adoc
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -62,7 +62,7 @@ There are a three things to take note of in this JSON file.
62
62
63
63
== The initial set of tests
64
64
65
-
We have included an initial set of tests which should pass when run against the *Parasol Chat* model.
65
+
We have included an initial set of tests which should pass when run against the *Parasol Chat* model.
66
66
67
67
For each of these tests we're provided an expected response, so for example, when we ask the question: *"In one sentence, who founded Parasol Insurance?"* we should get a response similar to: *"Parasol Insurance was founded in 1936 by James Falkner and James Labocki."*
68
68
@@ -107,7 +107,7 @@ The output should be similar to the following.
107
107
[.console-output]
108
108
[source,bash]
109
109
----
110
-
npm WARN skipping integrity check for git dependency ssh://[email protected]/kenlimmj/rouge.git
110
+
npm WARN skipping integrity check for git dependency ssh://[email protected]/kenlimmj/rouge.git
111
111
npm WARN deprecated [email protected]: Glob versions prior to v9 are no longer supported
112
112
npm WARN deprecated [email protected]: This module is not supported, and leaks memory. Do not use it. Check out lru-cache if you want a good and tested way to coalesce async requests by a key value, which is much more comprehensive and powerful.
113
113
@@ -142,7 +142,7 @@ When the test run is complete you should see.
Both of the tests should have passed successfully.
145
+
Both of the tests should have passed successfully.
146
146
147
147
== Failing the tests
148
148
@@ -152,7 +152,7 @@ As we discussed above, one of our tests is testing responses to the question: *"
152
152
153
153
To answer this question our model has been trained with knowledge about the history of Parasol Insurance.
154
154
155
-
Let's change the model our tests are performed against to an untrained model (*merlinite-7b-lab*). To do this, open the `.vscode/settings.json` file to `replace` the contents with:
155
+
Let's change the model our tests are performed against to an untrained model (*granite-30-8b-instruct*). To do this, open the `.vscode/settings.json` file to `replace` the contents with:
156
156
157
157
[.console-input]
158
158
[source,json,subs="+attributes,macros+"]
@@ -169,7 +169,7 @@ Let's change the model our tests are performed against to an untrained model (*m
169
169
}
170
170
----
171
171
172
-
Go back to the `Testing` panel. Then, click on the `Run Tests` to run the tests again.
172
+
Go back to the `Testing` panel. Then, click on the `Run Tests` to run the tests again.
Copy file name to clipboardExpand all lines: content/modules/ROOT/pages/module-private-docs.adoc
+6-2Lines changed: 6 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -149,7 +149,11 @@ You should notice that the chatbot can answer these questions based on the claim
149
149
150
150
image::private-docs/chat-policies-unknown.png[Chatbot questions about policies]
151
151
152
-
The answer you will get back might be different from the one in the screenshot above, but you'll likely observe that the chatbot struggles with this question, as it doesn't have access to Parasol's specific policies, *in particular the 6 month policy term limit*. It might even respond with an incorrect decision! If only there was a way to automatically retrieve this information and provide it to the chatbot.
152
+
The answer you will get back might be different from the one in the screenshot above, but you'll likely observe that the chatbot struggles with this question, as it doesn't have access to Parasol's specific policies, *in particular the 6 month policy term limit*. It might even respond with an incorrect decision, but sound very authoritative when doing so:
By doing this we have now registered the `NotificationService` file as class that contains one or more `@Tool` annotated methods. These methods become available to the LLM to be called.
321
+
By doing this we have now registered the `NotificationService` file as a class that contains one or more `@Tool` annotated methods. These methods become available to the LLM to be called.
0 commit comments