Skip to content

Commit 6bd5327

Browse files
aashipandyakartikpersistentpraveshkumar1988prakriti-solankeykaranchellani
authored
Staging To Main (#190)
* added types for API response * type fixes * Wiki issue fixed * connect URL * Chatbot cross icon visibility * Update gemini_llm.py * Update gemini_llm.py * Update the library google-cloud-aiplatform * Update logging level from Info to DEBUG * sources req * Update gemini_llm.py * Update gemini_llm.py * Update Dockerfile * Update Dockerfile * Update Dockerfile * review comments solution * Update Dockerfile * Update Dockerfile * query changes * dockerfile changes * Update gemini_llm.py * dockerfile changes * Update gemini_llm.py * query changes for entity * format and lint fixes * fixed doc no issue * removed settings icon * gemini model changes using langserve * LLM Model Changes * llm model changes * response changes * update vector index * update logging to track embeddings * connection modal changes * table-query-check * MOve create source node try catch in score.py * create seprate function for each source URL * Gemini security remove and added sleep * Update requirements.txt * URL scan Code refactoring * query changes * new connect modal * Update Extract function * Update requirements.txt * Extract API code improvement * folder name correction for gcs bucket * Remove unused function and move make_relationship function to other class * Added sentence-transformer embeddings and update read file as temp from local * Gemini security remove and added sleep * Update requirements.txt * Revert "Update requirements.txt" This reverts commit 7237be5. * Merging issue fixed * combining 4 chunks * driver-changes * updated list name * made chunks to combine configurable * added multiple models and embeddings for chatbot * fixed configurable source issue * added model param for QNA API * lint fixes * Added Common function * modified prompt * fixed chat response issue * modal changes * merged Update modal * llm dropdown changes * session changes * prefilling the input field fix * modified prompt * Modified prompt * added env variable for llms * Modified prompt and added sources to response * Modified output response structure * path-change * added env values to docker file * format fixes * graph fix * Modified source extracting function * added source types * added sources links in the chat bot message * conditional source check * File uploaded into chunks * function name change * format fixes * added chunking frontend code * added new status * Modified prompt * added new status after upload completion * Modified prompt * Issue fixed relation to embeding model and large file upload * added progress bar * added progress staus in table * legends check * removed binary filesstate * backend processing source fix * alert fix * Modified commit * openai error resolve * added file name for all sources * Issues Fixed * Gemini versions and library realted changes updated * Update Model details * Updated prompt * Import graphDbClass on score.py * posix path error fix in pathlib * changed chat bot icon * added Gemini 1.5Pro * color changes * format fixes * backend url change --------- Co-authored-by: kartikpersistent <[email protected]> Co-authored-by: Pravesh Kumar <[email protected]> Co-authored-by: Prakriti Solankey <[email protected]> Co-authored-by: karanchellani <[email protected]> Co-authored-by: vasanthasaikalluri <[email protected]>
1 parent 2b787c4 commit 6bd5327

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

60 files changed

+2547
-1865
lines changed

.gitignore

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -162,4 +162,6 @@ cython_debug/
162162
temp.pdf
163163
google-cloud-sdk
164164
google-cloud-cli-469.0.0-linux-x86_64.tar.gz
165-
/data/llm-experiments-387609-c73d512ca3b1.json
165+
/data/llm-experiments-387609-c73d512ca3b1.json
166+
/backend/src/merged_files
167+
/backend/src/chunks

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -74,7 +74,7 @@ KNN_MIN_SCORE = ""\
7474
userName: Username to use for graph creation ( if None will use username from config file )
7575
password: Password to use for graph creation ( if None will use password from config file )
7676
file: File object containing the PDF file path to be used
77-
model: Type of model to use ('OpenAI GPT 3.5' or 'OpenAI GPT 4')
77+
model: Type of model to use ('Gemini Pro' or 'Diffbot')
7878
7979
Returns:
8080
Json response to API with fileName, nodeCount, relationshipCount, processingTime,
@@ -114,7 +114,7 @@ KNN_MIN_SCORE = ""\
114114
https://github.com/neo4j-labs/llm-graph-builder/assets/121786590/b725a503-6ade-46d2-9e70-61d57443c311
115115
116116
## Links
117-
The Public [ Google cloud Run URL](https://staging-frontend-dcavk67s4a-uc.a.run.app/).
117+
The Public [ Google cloud Run URL](https://devfrontend-dcavk67s4a-uc.a.run.app).
118118
[Workspace URL](https://workspace-preview.neo4j.io/workspace)
119119
120120

backend/Dockerfile

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,4 +9,6 @@ RUN apt-get update \
99
&& apt install -y tesseract-ocr \
1010
&& export LD_LIBRARY_PATH=/usr/lib/x86_64-linux-gnu:$LD_LIBRARY_PATH \
1111
&& pip install --no-cache-dir --upgrade -r /code/requirements.txt
12-
CMD ["uvicorn", "score:app", "--host", "0.0.0.0", "--port", "8000"]
12+
13+
CMD ["uvicorn", "score:app", "--host", "0.0.0.0", "--port", "8000"]
14+

backend/example.env

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,4 +12,5 @@ KNN_MIN_SCORE = ""
1212
LANGCHAIN_API_KEY = ""
1313
LANGCHAIN_PROJECT = ""
1414
LANGCHAIN_TRACING_V2 = ""
15-
LANGCHAIN_ENDPOINT = ""
15+
LANGCHAIN_ENDPOINT = ""
16+
NUMBER_OF_CHUNKS_TO_COMBINE = ""

backend/requirements.txt

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -28,8 +28,8 @@ docstring_parser==0.16
2828
effdet==0.4.1
2929
emoji==2.10.1
3030
exceptiongroup==1.2.0
31-
fastapi==0.110.0
32-
fastapi-health==0.4.0
31+
fastapi
32+
fastapi-health
3333
filelock==3.13.1
3434
filetype==1.2.0
3535
flatbuffers==23.5.26
@@ -38,7 +38,7 @@ frozenlist==1.4.1
3838
fsspec==2024.2.0
3939
google-api-core==2.18.0
4040
google-auth==2.29.0
41-
google-cloud-aiplatform==1.45.0
41+
google-cloud-aiplatform
4242
google-cloud-bigquery==3.19.0
4343
google-cloud-core==2.4.1
4444
google-cloud-resource-manager==1.12.3
@@ -78,6 +78,8 @@ langchain-text-splitters==0.0.1
7878
langdetect==1.0.9
7979
langsmith==0.1.31
8080
layoutparser==0.3.4
81+
langserve
82+
langchain-cli
8183
lxml==5.1.0
8284
MarkupSafe==2.1.5
8385
marshmallow==3.20.2
@@ -156,9 +158,10 @@ unstructured-client
156158
unstructured-inference
157159
unstructured.pytesseract
158160
urllib3
159-
uvicorn==0.29.0
161+
uvicorn
160162
wikipedia==1.4.0
161163
wrapt==1.16.0
162164
yarl==1.9.4
163165
youtube-transcript-api==0.6.2
164166
zipp==3.17.0
167+
sentence-transformers

0 commit comments

Comments
 (0)