diff --git a/web/frontend-feedback-analytics/presentation/css/layout.scss b/web/frontend-feedback-analytics/presentation/css/layout.scss index 5d0536f94..6853f737c 100644 --- a/web/frontend-feedback-analytics/presentation/css/layout.scss +++ b/web/frontend-feedback-analytics/presentation/css/layout.scss @@ -68,3 +68,25 @@ .reveal .justify-start { justify-content: flex-start; } .reveal .justify-center { justify-content: center; } .reveal .justify-end { justify-content: flex-end; } + + +.reveal table { + font-size: 0.6em; + width: 100%; +} +.reveal th { + background-color: #4CAF50; + color: white; +} +.reveal tr:nth-child(even) { + background-color: #f2f2f2; +} + +.reveal .row { + display: flex; + width: 100%; +} +.reveal .col { + flex: 1; + padding: 0 10px; +} diff --git a/web/frontend-feedback-analytics/presentation/index.html b/web/frontend-feedback-analytics/presentation/index.html index e24fa8af4..5c4ddd7a9 100644 --- a/web/frontend-feedback-analytics/presentation/index.html +++ b/web/frontend-feedback-analytics/presentation/index.html @@ -1,389 +1,679 @@ -
- - + + + +
-
-
-
-
-
-
-
- | Data storage | -Neo4j Integration | -Advanced Querying | -
|---|---|---|
| We use a graph database to store the data. | -- Neo4J is leveraged for both the graph database and the - vector store. - | -- Neo4J enables advanced queries, such as clustering the data - and finding communities. - | -
| Vector store is utilized for storing embeddings. | -- | - These tasks are more efficient and easier to implement using - a graph database. - | -
High Performance Computing for pre-processing the data
-
-
-
-
-
-
-
-
-
- - To improve the AI model we need to annotate and format the data - properly. After the data is annotated we can use it to train the - different models. -
-- If the top sources are not accurate we can retrain the embedding - model based on human feedback. -
-
- - For improving our retrieval and enhancing our result we are using - an AI technique called NER (Named Entity Recognition) to annotate - the data. -
-- This can be done manually with tools such as Diffgram or Doccano - or can be automated using an AI model to pre-annotate. -
-
-
- Questions?
-- All of our presentation and diagrams can be found in our - - github repository. - -
-
+
+
+
+
+
+
+
+ | Data storage | +Neo4j Integration | +Advanced Querying | +
|---|---|---|
| We use a graph database to store the data. | ++ Neo4J is leveraged for both the graph database and the + vector store. + | ++ Neo4J enables advanced queries, such as clustering the data + and finding communities. + | +
| Vector store is utilized for storing embeddings. | ++ | + These tasks are more efficient and easier to implement using + a graph database. + | +
High Performance Computing for pre-processing the data
+
+
+
+
+
+
+
+
+
+ + To improve the AI model we need to annotate and format the data + properly. After the data is annotated we can use it to train the + different models. +
++ If the top sources are not accurate we can retrain the embedding + model based on human feedback. +
+
+ + For improving our retrieval and enhancing our result we are using + an AI technique called NER (Named Entity Recognition) to annotate + the data. +
++ This can be done manually with tools such as Diffgram or Doccano + or can be automated using an AI model to pre-annotate. +
+
+
+ + Consistent prompt design ensures standardized extraction across + models +
+| Feature | +MOLMO 7B | +Amazon Nova Pro | +Claude 3.5 Sonnet | +
|---|---|---|---|
| Deployment | +Self-hosted (OpenShift) | +AWS Bedrock | +AWS Bedrock | +
| Parameters | +7 Billion | +Proprietary | +Proprietary (175B+) | +
| Processing Time | +15-20 sec/image (~ 24 hours) |
+ 2-3 sec/image (~ 3-3.5 hours) |
+ 2-4 sec/image (~ 3-3.5 hours) |
+
| Input Cost | +Free (self-hosted) | +$0.0008/1K tokens | +$0.003/1K tokens | +
| Output Cost | +Free (self-hosted) | +$0.0032/1K tokens | +$0.015/1K tokens | +
| Aspect | +MOLMO 7B | +Amazon Nova Pro | +Claude 3.5 Sonnet | +
|---|---|---|---|
| Content Accuracy | +Moderate | +Good | +Excellent | +
| Prompt Following | +Inconsistent | +Variable | +Highly consistent | +
| Output Structure | +Often deviates | +Sometimes deviates | +Follows structure precisely | +
| Legal Domain | +Basic understanding | +Good understanding | +Strong contextual grasp | +
| Overall Quality | +Acceptable | +Good | +Superior | +
{
+ "Acts": {
+ "Election Act": {
+ "96106_greatseal.gif": "Image Type: Official seal...",
+ // More images...
+ },
+ // More acts...
+ },
+ "Regulations": {
+ "Health Act": {
+ "diagram.png": "Image Type: Technical diagram...",
+ // More images...
+ }
+ }
+}
+ Questions?
++ All of our presentation and diagrams can be found in our + + github repository. + +
+