Dev (#918)

prakriti-solankey · vasanthasaikalluri · kartikpersistent · kartikpersistent · commit 2f8e901ed7da · 2025-01-28T15:46:37.000Z
* Add communities Checkbox to graph viz (#739)

* DataScience icon addition

* added checkbox to create_communities

* added gds status to connect call

* added conditionall check for community chat modes

* icon stroke changes

* isgds active check

* icon changes

* isGdsVal change

* format fixes

* checkbox check uncheck change

* graph query

* checkbox addition

* filter logic

* filter logic missing checks

* updated_graph_query

* filter logic optimised

* gds active check

* handle checkbox show

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Added time to process file in extract API and it's functions

* Add Severity level in cloud logging

* changed name

* modified the query

* updated entities param

* added document to details

* format fixes

* added global embedding

* added database

* added database

* modified is_entity

* modifies chunk entities

* Added secweb to fix security issues

* removed QA integration

* created neo4j from existing index

* modified script

* Integrate local search to chat details (#746)

* added the commuties tab

* removed unused variables

* removed scipy libarary

* added the mode check

* Integrated the communities tab

* added the cjheck

* enabled the top entities mode

* tabs order rearange

* added the loader to sources tab for entity search+vector

* fixed the chat mode per prop

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* removal of unused code

* removed entity label

* Added Description to chat mode menu (#743)

* added tooltips to chat mode menu

* addition of description to menu

* 741-tooltips-to-selectOptions

* menu changes

* acommunities name change

* close changes

* name changes

* format fixes

* Update log_struct method to add severity

* community check

* Entity Empty Label fix and Icon

* Update Utils.ts

* Retry processing - node and rels count update condition for start from beginning (#737)

* Remove TotalPages when save file on local (#684)

* file_name reference and verify_ssl issue fixed (#683)

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* Remove TotalPages when save file on local (#684)

* file_name reference and verify_ssl issue fixed (#683)

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* Reapply "Dockerfile changes with VITE label"

This reverts commit a83e0855fbf54d2b5af009d96c4edf0bcd7ab84a.

* Revert "Dockerfile changes with VITE label"

This reverts commit 2840ebc9e6156c51465a9f54be72ca2d014147c2.

* Concurrent processing of files (#665)

* Update README.md

* Droped the old vector index (#652)

* added cypher_queries and llm chatbot files

* updated llm-chatbot-python

* added llm-chatbot-python

* updated llm-chatbot-python folder

* Added chatbot "hybrid " mode use case

* added the concurrent file processing

* page refresh scenario

* fixed waiting files processing issue in refresh scenario

* removed boolean param

* fixed processedCount issue

* checkbox with waiting check

* fixed the refresh scenario with processing files

* processing files check

* server side error

* processing file count check for processing files less than batch size

* processing count check to handle allselected files

* created helper functions

* code improvements

* __ changes (#656)

* DiffbotGraphTransformer doesn't need an LLMGraphTransformer (#659)

Co-authored-by: jeromechoo &lt;hello@jeromechoo.com&gt;

* Removed experiments/llm-chatbot-python folder from DEV branch

* redcued the password clear timeout

* Removed experiments/Cypher_Queries.ipynb file from DEV branch

* disabled the closed button on banner and connection dialog while API is in pending state

* update delete query with entities

* node id check (#663)

* Status source and type filtering  (#664)

* status source

* Name change

* type change

* rollback to previous working nvl version

* added the alert

* add BATCH_SIZE to docker

* temp fixes for 0.3.1

* alert fix for less than batch size processing

* new virtual env

* added Hybrid Chat modes (#670)

* Rename the function #657

* label and checkboxes placement changes (#675)

* label and checkboxes placement changes

* checkbox placement changes

* Graph node filename check

* env fixes with latest nvl libraries

* format fixes

* removed local files

* Remove TotalPages when save file on local (#684)

* file_name reference and verify_ssl issue fixed (#683)

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* ndl changes

* label and checkboxes placement changes (#675)

* label and checkboxes placement changes

* checkbox placement changes

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* Status source and type filtering  (#664)

* status source

* Name change

* type change

* added the alert

* temp fixes for 0.3.1

* label and checkboxes placement changes (#675)

* label and checkboxes placement changes

* checkbox placement changes

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* ndl changes

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* added cypher_queries and llm chatbot files

* updated llm-chatbot-python

* added llm-chatbot-python

* updated llm-chatbot-python folder

* page refresh scenario

* fixed waiting files processing issue in refresh scenario

* Removed experiments/llm-chatbot-python folder from DEV branch

* disabled the closed button on banner and connection dialog while API is in pending state

* node id check (#663)

* Status source and type filtering  (#664)

* status source

* Name change

* type change

* rollback to previous working nvl version

* added the alert

* temp fixes for 0.3.1

* label and checkboxes placement changes (#675)

* label and checkboxes placement changes

* checkbox placement changes

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* ndl changes

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* Status source and type filtering  (#664)

* status source

* Name change

* type change

* added the alert

* temp fixes for 0.3.1

* label and checkboxes placement changes (#675)

* label and checkboxes placement changes

* checkbox placement changes

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* ndl changes

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* env fixes with latest nvl libraries

* format fixes

* User flow changes for recreating supported vector index (#682)

* removed the if check

* Add one more check for create vector index when chunks are exist without embeddings

* removed local files

* condition changes

* chunks exists check

* chunk exists without embeddings check

* vector Index issue fixed

* vector index with different dimension

* Update graphDB_dataAccess.py

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* property spell fix

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: Jayanth T &lt;jayanth_t@persistent.com&gt;
Co-authored-by: abhishekkumar-27 &lt;164544129+abhishekkumar-27@users.noreply.github.com&gt;
Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: Jerome Choo &lt;mail@jeromechoo.com&gt;
Co-authored-by: jeromechoo &lt;hello@jeromechoo.com&gt;
Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* env changes

* format fixes

* set retry status

* retry processing backend

* added the retry icon on rows

* vite changes in docker compose

* added retry dialog

* Integrated the Retry processing API

* Integrated the Extract API fro retry processing

* Integrated ndl toast component

* replaced foreach with normal for loop for better performance

* types improvements

* used toast component

* spell fix

* Issue fixed

* processing changes in main

* function closing fix

* retry processing issue fixed

* autoclosing the retry popup on retry api success

* removed the retry if check

* resetting the node and relationship count on retry

* added the enter key events on the popups

* fixed wikipedia icon on large file alert popup

* setting nodes to 0 and start from last processed chunk logic changes

* Retry Popup fixes

* status changes for upload failed scenario

* kept condition specific

* changed status to reprocess from retry

* Reprocess wording changes

* tooltip changes

* wordings and size changes

* Changed status to Reprocess

* updated node count for start from begnning

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;
Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: Jayanth T &lt;jayanth_t@persistent.com&gt;
Co-authored-by: abhishekkumar-27 &lt;164544129+abhishekkumar-27@users.noreply.github.com&gt;
Co-authored-by: Jerome Choo &lt;mail@jeromechoo.com&gt;
Co-authored-by: jeromechoo &lt;hello@jeromechoo.com&gt;

* uncommented the Retry Processing

* removed __Entity__ labels

* spell fix

* fixed postprocessing method invoking issue for odd no files

* lint fix

* Added filesource and name in chunks

* Preload=True remove from HSTS

* Graph communities (#748)

* UI changes

* modes enable disable

* separated sources entities chunk communities

* communities added into separate component

* Update ChatInfoModal.tsx

* added filename and source for chunksinfo

* removed the console.log

* mode disable changes

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* aria label addition

* code improvements used URL class for host url check

* host level check

* Update Security header

* encryption of localstorage values

* 'mode-selection-changes'

* added local chat history

* added neo4j from existing index to entity vector mode

* label changes

* commented security header

* Communities (#721)

* added communities creation

* added communities

* removed tqdm

* removed __Entity__ labels

* removed graph_object

* removed graph object in the function

* Modified queries

* added properties and modified to entity labels

* Post processing call after all files completion (#716)

* Dev To STAGING (#532)

* format fixes and graph schema indication fix

* Update README.md

* added chat modes variable in env updated the readme

* spell fix

* added the chat mode in env table

* added the logos

* fixed the overflow issues

* removed the extra fix

* Fixed specific scenario  "when the text from schema closes it should reopen the previous modal"

* readme changes

* removed dev console logs

* added new retrieval query (#533)

* format fixes and tab rendering fix

* fixed the setting modal reopen issue

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;
Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;

* Dev (#535)

* format fixes and graph schema indication fix

* Update README.md

* added chat modes variable in env updated the readme

* spell fix

* added the chat mode in env table

* added the logos

* fixed the overflow issues

* removed the extra fix

* Fixed specific scenario  "when the text from schema closes it should reopen the previous modal"

* readme changes

* removed dev console logs

* added new retrieval query (#533)

* format fixes and tab rendering fix

* fixed the setting modal reopen issue

---------

Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;

* Dev (#537)

* format fixes and graph schema indication fix

* Update README.md

* added chat modes variable in env updated the readme

* spell fix

* added the chat mode in env table

* added the logos

* fixed the overflow issues

* removed the extra fix

* Fixed specific scenario  "when the text from schema closes it should reopen the previous modal"

* readme changes

* removed dev console logs

* added new retrieval query (#533)

* format fixes and tab rendering fix

* fixed the setting modal reopen issue

---------

Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;

* Fix typo: correct 'josn_obj' to 'json_obj' (#697)

* Staging To Main (#495)

* Integration_qa test (#375)

* Test IntegrationQA added

* update test cases

* update test

* update node count assertions

* test changes

* update changes

* modification test

* Code refatctor test cases

* Handle allowedlist issue in test

* test changes

* update test

* test case execution

* test chatbot updates

* test case update file

* added file

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* recent merges

* pdf deletion due to out of diskspace

* fixed status blank issue

* Rendering the file name instead of link for gcs and s3 sources in the info modal

* Convert is_cancelled value from string to bool

* added the default page size

* Issue fixed Processed chunked as 0 when file re-process again

* Youtube timestamps (#386)

* Wikipedia source to accept all valid urls

* wikipedia url to support multiple languages

* integrated wiki langauge param for extract api

* Youtube video timestamps

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* groq llm integration backend (#286)

* groq llm integration backend

* groq and description in node properties

* added groq in options

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* offset in chunks (#389)

* page number in gcs loader (#393)

* added youtube timestamps (#392)

* chat pop up button (#387)

* expand

* minimize-icon

* css changes

* chat history

* chatbot wider Side Nav

* expand icon

* chatbot UI

* Delete

* merge fixes

* code suggestions

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* chunks create before extraction using is_pre_process variable (#383)

* chunks create before extraction using is_pre_process variable

* Return total pages for Model

* update requirement.txt

* total pages on uplaod API

* added the Confirmation Dialog

* added the selected files into the confirmation modal

* format and lint fixes

* added the stop watch image

* fileselection on alert dialog

* Add timeout in docker for gunicorn workers

* Add cancel icon to info popup (#384)

* Info Modal Changes

* css changes

* recent merges

* Integration_qa test (#375)

* Test IntegrationQA added

* update test cases

* update test

* update node count assertions

* test changes

* update changes

* modification test

* Code refatctor test cases

* Handle allowedlist issue in test

* test changes

* update test

* test case execution

* test chatbot updates

* test case update file

* added file

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* fixed status blank issue

* Rendering the file name instead of link for gcs and s3 sources in the info modal

* added the default page size

* Convert is_cancelled value from string to bool

* Issue fixed Processed chunked as 0 when file re-process again

* Youtube timestamps (#386)

* Wikipedia source to accept all valid urls

* wikipedia url to support multiple languages

* integrated wiki langauge param for extract api

* Youtube video timestamps

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* groq llm integration backend (#286)

* groq llm integration backend

* groq and description in node properties

* added groq in options

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Save Total Pages in DB

* Added total Pages

* file selection when we didn't select anything from Main table

* added the danger icon only for large files

* added the overflow for more files and file selection for all new files

* moved the interface to types

* added the icon accoroding to the source

* set total page for wiki and youtube

* h3 heading

* merge

* updated the alert on basis if total pages

* deleted chunks

* polling based on total pages

* isNan check

* large file based on file size for s3 and gcs

* file source in server side event

* time calculation based on chunks for gcs and s3

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;
Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: abhishekkumar-27 &lt;164544129+abhishekkumar-27@users.noreply.github.com&gt;
Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* fixed the layout issue

* Populate graph schema (#399)

* crreate new endpoint populate_graph_schema and update the query for getting lables from DB

* Added main.py changes

* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)

* added the condtion

* removed llms

* Fixed issue : Remove extra unused param

* get emb only if used (#278)

* Chatbot chunks (#402)

* Added file name to the content  sent to LLM

* added chunk text in the response

* increased the docs parts sent to llm

* Modified graph query

* mardown rendering

* youtube starttime

* icons

* offset changes

* removed the files due to codespace space issue

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Settings modal to support generating the labels from the llm by using text given by user (#405)

* added the json

* added schema from text dialog

* integrated the schemaAPI

* added the alert

* resize fixes

* fixed css issue

* fixed status blank issue

* Modified response when no docs is retrived (#413)

* Fixed env/docker-compose for local deployments + README doc (#410)

* Fixed env/docker-compose for local deployments + README doc

* wrong place for ENV in README

* by default, removed langsmith + fixed knn score string to float

* by default, removed langsmith + fixed knn score string to float

* Fixed strings in docker-compose env

* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)

* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that

* Support for all unstructured files (#401)

* all unstructured files

* responsiveness

* added file type

* added the extensions

* spell mistake

* ppt file changes

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)

* added the json

* added schema from text dialog

* integrated the schemaAPI

* added the alert

* resize fixes

* Extract schema using direct ChatOpenAI API and Chain

* integrated the checkbox for schema to text dialog

* Update SettingModal.tsx

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* gcs file content read via storage client (#417)

* gcs file content read via storage client

* added the access token the file state

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* pypdf2 to read files from gcs (#420)

* 407 remove driver from frontend (#416)

* removed driver

* removed API

* connecting to database on page refresh

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Css handling of info modal and Tooltips (#418)

* css change

* toolTips

* Sidebar Tooltips

* copy to clip

* css change

* added image types

* added gcs

* type fix

* docker changes

* speech

* added the toolip for dropzone sources

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Fixed retrival bugs (#421)

* yarn format fixes

* changed the delete message

* added the cancel  button

* changed the message on tooltip

* added space

* UI fixes

* tooltip for setting

* updated req

* wikipedia URL input (#424)

* accept only wikipedia links

* added wikipedia link

* added wikilink regex

* wikipedia single url only

* changed the alert message

* wording change

* pushed validation state persist error

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* speech and copy (#422)

* speech and copy

* startTime

* added chunk properties

* tooltips

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Fixed issue for out of range in KNN API

* solved conflicts

* conflict solved

* Remove logging info from update KNN API

* tooltip changes

* format and lint fixes

* responsiveness changes

* Fixed issue for total pages GCS, S3

* UI polishing (#428)

* button and tooltip changes

* checking validation on change

* settings module populate fix

* format fixes

* opening the modal after auth success

* removed the limit

* added the scrobar for dropdowns

* speech state (#426)

* speech state

* Button Details changes

* delete wording change

* Total pages in buckets (#431)

* page number NA for buckets

* added N/A for gcs and s3 pages

* total pages for gcs

* remove unwanted logger

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* removed the max width

* Update FileTable.tsx

* Update the docker file

* Modified prompt (#438)

* Update Dockerfile

* Update Dockerfile

* Update Dockerfile

* rendering Fix

* Local file upload gcs (#442)

* Uplaod file to GCS

* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled

* Add life cycle rule on uploaded bucket

* pdf upload local and gcs bucket check

* delete files when processed and extract changes

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* Modified chat length and entities used (#443)

* metadata for unstructured files (#446)

* Unstructured file metadata (#447)

* metadata for unstructured files

* sleep in gcs upload

* updated

* icons added to chunks (#435)

* icons added to chunks

* info modal icons

* Dev (#433)

* Integration_qa test (#375)

* Test IntegrationQA added

* update test cases

* update test

* update node count assertions

* test changes

* update changes

* modification test

* Code refatctor test cases

* Handle allowedlist issue in test

* test changes

* update test

* test case execution

* test chatbot updates

* test case update file

* added file

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* recent merges

* pdf deletion due to out of diskspace

* fixed status blank issue

* Rendering the file name instead of link for gcs and s3 sources in the info modal

* Convert is_cancelled value from string to bool

* added the default page size

* Issue fixed Processed chunked as 0 when file re-process again

* Youtube timestamps (#386)

* Wikipedia source to accept all valid urls

* wikipedia url to support multiple languages

* integrated wiki langauge param for extract api

* Youtube video timestamps

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* groq llm integration backend (#286)

* groq llm integration backend

* groq and description in node properties

* added groq in options

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* offset in chunks (#389)

* page number in gcs loader (#393)

* added youtube timestamps (#392)

* chat pop up button (#387)

* expand

* minimize-icon

* css changes

* chat history

* chatbot wider Side Nav

* expand icon

* chatbot UI

* Delete

* merge fixes

* code suggestions

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* chunks create before extraction using is_pre_process variable (#383)

* chunks create before extraction using is_pre_process variable

* Return total pages for Model

* update requirement.txt

* total pages on uplaod API

* added the Confirmation Dialog

* added the selected files into the confirmation modal

* format and lint fixes

* added the stop watch image

* fileselection on alert dialog

* Add timeout in docker for gunicorn workers

* Add cancel icon to info popup (#384)

* Info Modal Changes

* css changes

* recent merges

* Integration_qa test (#375)

* Test IntegrationQA added

* update test cases

* update test

* update node count assertions

* test changes

* update changes

* modification test

* Code refatctor test cases

* Handle allowedlist issue in test

* test changes

* update test

* test case execution

* test chatbot updates

* test case update file

* added file

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* fixed status blank issue

* Rendering the file name instead of link for gcs and s3 sources in the info modal

* added the default page size

* Convert is_cancelled value from string to bool

* Issue fixed Processed chunked as 0 when file re-process again

* Youtube timestamps (#386)

* Wikipedia source to accept all valid urls

* wikipedia url to support multiple languages

* integrated wiki langauge param for extract api

* Youtube video timestamps

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* groq llm integration backend (#286)

* groq llm integration backend

* groq and description in node properties

* added groq in options

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Save Total Pages in DB

* Added total Pages

* file selection when we didn't select anything from Main table

* added the danger icon only for large files

* added the overflow for more files and file selection for all new files

* moved the interface to types

* added the icon accoroding to the source

* set total page for wiki and youtube

* h3 heading

* merge

* updated the alert on basis if total pages

* deleted chunks

* polling based on total pages

* isNan check

* large file based on file size for s3 and gcs

* file source in server side event

* time calculation based on chunks for gcs and s3

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;
Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: abhishekkumar-27 &lt;164544129+abhishekkumar-27@users.noreply.github.com&gt;
Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* fixed the layout issue

* Populate graph schema (#399)

* crreate new endpoint populate_graph_schema and update the query for getting lables from DB

* Added main.py changes

* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)

* added the condtion

* removed llms

* Fixed issue : Remove extra unused param

* get emb only if used (#278)

* Chatbot chunks (#402)

* Added file name to the content  sent to LLM

* added chunk text in the response

* increased the docs parts sent to llm

* Modified graph query

* mardown rendering

* youtube starttime

* icons

* offset changes

* removed the files due to codespace space issue

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Settings modal to support generating the labels from the llm by using text given by user (#405)

* added the json

* added schema from text dialog

* integrated the schemaAPI

* added the alert

* resize fixes

* fixed css issue

* fixed status blank issue

* Modified response when no docs is retrived (#413)

* Fixed env/docker-compose for local deployments + README doc (#410)

* Fixed env/docker-compose for local deployments + README doc

* wrong place for ENV in README

* by default, removed langsmith + fixed knn score string to float

* by default, removed langsmith + fixed knn score string to float

* Fixed strings in docker-compose env

* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)

* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that

* Support for all unstructured files (#401)

* all unstructured files

* responsiveness

* added file type

* added the extensions

* spell mistake

* ppt file changes

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)

* added the json

* added schema from text dialog

* integrated the schemaAPI

* added the alert

* resize fixes

* Extract schema using direct ChatOpenAI API and Chain

* integrated the checkbox for schema to text dialog

* Update SettingModal.tsx

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* gcs file content read via storage client (#417)

* gcs file content read via storage client

* added the access token the file state

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* pypdf2 to read files from gcs (#420)

* 407 remove driver from frontend (#416)

* removed driver

* removed API

* connecting to database on page refresh

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Css handling of info modal and Tooltips (#418)

* css change

* toolTips

* Sidebar Tooltips

* copy to clip

* css change

* added image types

* added gcs

* type fix

* docker changes

* speech

* added the toolip for dropzone sources

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Fixed retrival bugs (#421)

* yarn format fixes

* changed the delete message

* added the cancel  button

* changed the message on tooltip

* added space

* UI fixes

* tooltip for setting

* updated req

* wikipedia URL input (#424)

* accept only wikipedia links

* added wikipedia link

* added wikilink regex

* wikipedia single url only

* changed the alert message

* wording change

* pushed validation state persist error

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* speech and copy (#422)

* speech and copy

* startTime

* added chunk properties

* tooltips

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Fixed issue for out of range in KNN API

* solved conflicts

* conflict solved

* Remove logging info from update KNN API

* tooltip changes

* format and lint fixes

* responsiveness changes

* Fixed issue for total pages GCS, S3

* UI polishing (#428)

* button and tooltip changes

* checking validation on change

* settings module populate fix

* format fixes

* opening the modal after auth success

* removed the limit

* added the scrobar for dropdowns

* speech state (#426)

* speech state

* Button Details changes

* delete wording change

* Total pages in buckets (#431)

* page number NA for buckets

* added N/A for gcs and s3 pages

* total pages for gcs

* remove unwanted logger

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* removed the max width

* Update FileTable.tsx

* Update the docker file

* Modified prompt (#438)

* Update Dockerfile

* Update Dockerfile

* Update Dockerfile

* rendering Fix

* Local file upload gcs (#442)

* Uplaod file to GCS

* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled

* Add life cycle rule on uploaded bucket

* pdf upload local and gcs bucket check

* delete files when processed and extract changes

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* Modified chat length and entities used (#443)

* metadata for unstructured files (#446)

* Unstructured file metadata (#447)

* metadata for unstructured files

* sleep in gcs upload

* updated

* icons added to chunks (#435)

* icons added to chunks

* info modal icons

---------

Co-authored-by: abhishekkumar-27 &lt;164544129+abhishekkumar-27@users.noreply.github.com&gt;
Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;
Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: Ajay Meena &lt;meenajy1996@gmail.com&gt;
Co-authored-by: Morgan Senechal &lt;morgan@neo4j.com&gt;
Co-authored-by: karanchellani &lt;142801957+karanchellani@users.noreply.github.com&gt;

* fixed gcs status message issue

* added if check for failed count

* Null issue Fixed from backend for upload API and graph_document when model name mismatch

* added word break issue

* Added neo4j-rust-ext

* processing time estimation based on bytes

* File extension upper case fixed, File delete from GCS or local based on env variable.

* timer per byte

* Update Dockerfile

* Adding sort rows on the table (#451)

* Gcs upload folder hashed (#453)

* implement foldername hashed in GCS bucket uplaod

* Raise exception if invalid model selected

* folder name for gcs upload

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* upload all unstructuredfiles to gcs (#455)

* Mofified chunk query (#454)

* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.

- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"

* Fix the PARTIAL CONTENT issue

* File-table no data found (#456)

* 'file-table''

* review comment

* Llm format change (#459)

* changed the llm models format to lowercase

* added the error message

* llm model changes

* format fixes

* removed unused import

* added the capitalize method

* delete files from merged_file_path only if source is local file

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* commented total page code (#460)

* format fixes

* removed the disabled check on dropdown

* Large file env

* DEV to STAGING (#461)

* Integration_qa test (#375)

* Test IntegrationQA added

* update test cases

* update test

* update node count assertions

* test changes

* update changes

* modification test

* Code refatctor test cases

* Handle allowedlist issue in test

* test changes

* update test

* test case execution

* test chatbot updates

* test case update file

* added file

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* recent merges

* pdf deletion due to out of diskspace

* fixed status blank issue

* Rendering the file name instead of link for gcs and s3 sources in the info modal

* Convert is_cancelled value from string to bool

* added the default page size

* Issue fixed Processed chunked as 0 when file re-process again

* Youtube timestamps (#386)

* Wikipedia source to accept all valid urls

* wikipedia url to support multiple languages

* integrated wiki langauge param for extract api

* Youtube video timestamps

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* groq llm integration backend (#286)

* groq llm integration backend

* groq and description in node properties

* added groq in options

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* offset in chunks (#389)

* page number in gcs loader (#393)

* added youtube timestamps (#392)

* chat pop up button (#387)

* expand

* minimize-icon

* css changes

* chat history

* chatbot wider Side Nav

* expand icon

* chatbot UI

* Delete

* merge fixes

* code suggestions

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* chunks create before extraction using is_pre_process variable (#383)

* chunks create before extraction using is_pre_process variable

* Return total pages for Model

* update requirement.txt

* total pages on uplaod API

* added the Confirmation Dialog

* added the selected files into the confirmation modal

* format and lint fixes

* added the stop watch image

* fileselection on alert dialog

* Add timeout in docker for gunicorn workers

* Add cancel icon to info popup (#384)

* Info Modal Changes

* css changes

* recent merges

* Integration_qa test (#375)

* Test IntegrationQA added

* update test cases

* update test

* update node count assertions

* test changes

* update changes

* modification test

* Code refatctor test cases

* Handle allowedlist issue in test

* test changes

* update test

* test case execution

* test chatbot updates

* test case update file

* added file

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* fixed status blank issue

* Rendering the file name instead of link for gcs and s3 sources in the info modal

* added the default page size

* Convert is_cancelled value from string to bool

* Issue fixed Processed chunked as 0 when file re-process again

* Youtube timestamps (#386)

* Wikipedia source to accept all valid urls

* wikipedia url to support multiple languages

* integrated wiki langauge param for extract api

* Youtube video timestamps

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* groq llm integration backend (#286)

* groq llm integration backend

* groq and description in node properties

* added groq in options

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Save Total Pages in DB

* Added total Pages

* file selection when we didn't select anything from Main table

* added the danger icon only for large files

* added the overflow for more files and file selection for all new files

* moved the interface to types

* added the icon accoroding to the source

* set total page for wiki and youtube

* h3 heading

* merge

* updated the alert on basis if total pages

* deleted chunks

* polling based on total pages

* isNan check

* large file based on file size for s3 and gcs

* file source in server side event

* time calculation based on chunks for gcs and s3

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;
Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: abhishekkumar-27 &lt;164544129+abhishekkumar-27@users.noreply.github.com&gt;
Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* fixed the layout issue

* Populate graph schema (#399)

* crreate new endpoint populate_graph_schema and update the query for getting lables from DB

* Added main.py changes

* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)

* added the condtion

* removed llms

* Fixed issue : Remove extra unused param

* get emb only if used (#278)

* Chatbot chunks (#402)

* Added file name to the content  sent to LLM

* added chunk text in the response

* increased the docs parts sent to llm

* Modified graph query

* mardown rendering

* youtube starttime

* icons

* offset changes

* removed the files due to codespace space issue

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Settings modal to support generating the labels from the llm by using text given by user (#405)

* added the json

* added schema from text dialog

* integrated the schemaAPI

* added the alert

* resize fixes

* fixed css issue

* fixed status blank issue

* Modified response when no docs is retrived (#413)

* Fixed env/docker-compose for local deployments + README doc (#410)

* Fixed env/docker-compose for local deployments + README doc

* wrong place for ENV in README

* by default, removed langsmith + fixed knn score string to float

* by default, removed langsmith + fixed knn score string to float

* Fixed strings in docker-compose env

* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)

* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that

* Support for all unstructured files (#401)

* all unstructured files

* responsiveness

* added file type

* added the extensions

* spell mistake

* ppt file changes

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)

* added the json

* added schema from text dialog

* integrated the schemaAPI

* added the alert

* resize fixes

* Extract schema using direct ChatOpenAI API and Chain

* integrated the checkbox for schema to text dialog

* Update SettingModal.tsx

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* gcs file content read via storage client (#417)

* gcs file content read via storage client

* added the access token the file state

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* pypdf2 to read files from gcs (#420)

* 407 remove driver from frontend (#416)

* removed driver

* removed API

* connecting to database on page refresh

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Css handling of info modal and Tooltips (#418)

* css change

* toolTips

* Sidebar Tooltips

* copy to clip

* css change

* added image types

* added gcs

* type fix

* docker changes

* speech

* added the toolip for dropzone sources

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Fixed retrival bugs (#421)

* yarn format fixes

* changed the delete message

* added the cancel  button

* changed the message on tooltip

* added space

* UI fixes

* tooltip for setting

* updated req

* wikipedia URL input (#424)

* accept only wikipedia links

* added wikipedia link

* added wikilink regex

* wikipedia single url only

* changed the alert message

* wording change

* pushed validation state persist error

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* speech and copy (#422)

* speech and copy

* startTime

* added chunk properties

* tooltips

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Fixed issue for out of range in KNN API

* solved conflicts

* conflict solved

* Remove logging info from update KNN API

* tooltip changes

* format and lint fixes

* responsiveness changes

* Fixed issue for total pages GCS, S3

* UI polishing (#428)

* button and tooltip changes

* checking validation on change

* settings module populate fix

* format fixes

* opening the modal after auth success

* removed the limit

* added the scrobar for dropdowns

* speech state (#426)

* speech state

* Button Details changes

* delete wording change

* Total pages in buckets (#431)

* page number NA for buckets

* added N/A for gcs and s3 pages

* total pages for gcs

* remove unwanted logger

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* removed the max width

* Update FileTable.tsx

* Update the docker file

* Modified prompt (#438)

* Update Dockerfile

* Update Dockerfile

* Update Dockerfile

* rendering Fix

* Local file upload gcs (#442)

* Uplaod file to GCS

* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled

* Add life cycle rule on uploaded bucket

* pdf upload local and gcs bucket check

* delete files when processed and extract changes

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* Modified chat length and entities used (#443)

* metadata for unstructured files (#446)

* Unstructured file metadata (#447)

* metadata for unstructured files

* sleep in gcs upload

* updated

* icons added to chunks (#435)

* icons added to chunks

* info modal icons

* fixed gcs status message issue

* added if check for failed count

* Null issue Fixed from backend for upload API and graph_document when model name mismatch

* added word break issue

* Added neo4j-rust-ext

* processing time estimation based on bytes

* File extension upper case fixed, File delete from GCS or local based on env variable.

* timer per byte

* Update Dockerfile

* Adding sort rows on the table (#451)

* Gcs upload folder hashed (#453)

* implement foldername hashed in GCS bucket uplaod

* Raise exception if invalid model selected

* folder name for gcs upload

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* upload all unstructuredfiles to gcs (#455)

* Mofified chunk query (#454)

* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.

- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"

* Fix the PARTIAL CONTENT issue

* File-table no data found (#456)

* 'file-table''

* review comment

* Llm format change (#459)

* changed the llm models format to lowercase

* added the error message

* llm model changes

* format fixes

* removed unused import

* added the capitalize method

* delete files from merged_file_path only if source is local file

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* commented total page code (#460)

* format fixes

* removed the disabled check on dropdown

* Large file env

---------

Co-authored-by: abhishekkumar-27 &lt;164544129+abhishekkumar-27@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;
Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;
Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: Ajay Meena &lt;meenajy1996@gmail.com&gt;
Co-authored-by: Morgan Senechal &lt;morgan@neo4j.com&gt;
Co-authored-by: karanchellani &lt;142801957+karanchellani@users.noreply.github.com&gt;

* DEV to STAGING (#462)

* Integration_qa test (#375)

* Test IntegrationQA added

* update test cases

* update test

* update node count assertions

* test changes

* update changes

* modification test

* Code refatctor test cases

* Handle allowedlist issue in test

* test changes

* update test

* test case execution

* test chatbot updates

* test case update file

* added file

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* recent merges

* pdf deletion due to out of diskspace

* fixed status blank issue

* Rendering the file name instead of link for gcs and s3 sources in the info modal

* Convert is_cancelled value from string to bool

* added the default page size

* Issue fixed Processed chunked as 0 when file re-process again

* Youtube timestamps (#386)

* Wikipedia source to accept all valid urls

* wikipedia url to support multiple languages

* integrated wiki langauge param for extract api

* Youtube video timestamps

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* groq llm integration backend (#286)

* groq llm integration backend

* groq and description in node properties

* added groq in options

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* offset in chunks (#389)

* page number in gcs loader (#393)

* added youtube timestamps (#392)

* chat pop up button (#387)

* expand

* minimize-icon

* css changes

* chat history

* chatbot wider Side Nav

* expand icon

* chatbot UI

* Delete

* merge fixes

* code suggestions

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* chunks create before extraction using is_pre_process variable (#383)

* chunks create before extraction using is_pre_process variable

* Return total pages for Model

* update requirement.txt

* total pages on uplaod API

* added the Confirmation Dialog

* added the selected files into the confirmation modal

* format and lint fixes

* added the stop watch image

* fileselection on alert dialog

* Add timeout in docker for gunicorn workers

* Add cancel icon to info popup (#384)

* Info Modal Changes

* css changes

* recent merges

* Integration_qa test (#375)

* Test IntegrationQA added

* update test cases

* update test

* update node count assertions

* test changes

* update changes

* modification test

* Code refatctor test cases

* Handle allowedlist issue in test

* test changes

* update test

* test case execution

* test chatbot updates

* test case update file

* added file

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* fixed status blank issue

* Rendering the file name instead of link for gcs and s3 sources in the info modal

* added the default page size

* Convert is_cancelled value from string to bool

* Issue fixed Processed chunked as 0 when file re-process again

* Youtube timestamps (#386)

* Wikipedia source to accept all valid urls

* wikipedia url to support multiple languages

* integrated wiki langauge param for extract api

* Youtube video timestamps

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* groq llm integration backend (#286)

* groq llm integration backend

* groq and description in node properties

* added groq in options

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Save Total Pages in DB

* Added total Pages

* file selection when we didn't select anything from Main table

* added the danger icon only for large files

* added the overflow for more files and file selection for all new files

* moved the interface to types

* added the icon accoroding to the source

* set total page for wiki and youtube

* h3 heading

* merge

* updated the alert on basis if total pages

* deleted chunks

* polling based on total pages

* isNan check

* large file based on file size for s3 and gcs

* file source in server side event

* time calculation based on chunks for gcs and s3

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;
Co-authored-by: Prakriti Solankey &lt;156313631+prakriti-solankey@users.noreply.github.com&gt;
Co-authored-by: abhishekkumar-27 &lt;164544129+abhishekkumar-27@users.noreply.github.com&gt;
Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* fixed the layout issue

* Populate graph schema (#399)

* crreate new endpoint populate_graph_schema and update the query for getting lables from DB

* Added main.py changes

* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)

* added the condtion

* removed llms

* Fixed issue : Remove extra unused param

* get emb only if used (#278)

* Chatbot chunks (#402)

* Added file name to the content  sent to LLM

* added chunk text in the response

* increased the docs parts sent to llm

* Modified graph query

* mardown rendering

* youtube starttime

* icons

* offset changes

* removed the files due to codespace space issue

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Settings modal to support generating the labels from the llm by using text given by user (#405)

* added the json

* added schema from text dialog

* integrated the schemaAPI

* added the alert

* resize fixes

* fixed css issue

* fixed status blank issue

* Modified response when no docs is retrived (#413)

* Fixed env/docker-compose for local deployments + README doc (#410)

* Fixed env/docker-compose for local deployments + README doc

* wrong place for ENV in README

* by default, removed langsmith + fixed knn score string to float

* by default, removed langsmith + fixed knn score string to float

* Fixed strings in docker-compose env

* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)

* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that

* Support for all unstructured files (#401)

* all unstructured files

* responsiveness

* added file type

* added the extensions

* spell mistake

* ppt file changes

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)

* added the json

* added schema from text dialog

* integrated the schemaAPI

* added the alert

* resize fixes

* Extract schema using direct ChatOpenAI API and Chain

* integrated the checkbox for schema to text dialog

* Update SettingModal.tsx

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* gcs file content read via storage client (#417)

* gcs file content read via storage client

* added the access token the file state

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* pypdf2 to read files from gcs (#420)

* 407 remove driver from frontend (#416)

* removed driver

* removed API

* connecting to database on page refresh

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Css handling of info modal and Tooltips (#418)

* css change

* toolTips

* Sidebar Tooltips

* copy to clip

* css change

* added image types

* added gcs

* type fix

* docker changes

* speech

* added the toolip for dropzone sources

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Fixed retrival bugs (#421)

* yarn format fixes

* changed the delete message

* added the cancel  button

* changed the message on tooltip

* added space

* UI fixes

* tooltip for setting

* updated req

* wikipedia URL input (#424)

* accept only wikipedia links

* added wikipedia link

* added wikilink regex

* wikipedia single url only

* changed the alert message

* wording change

* pushed validation state persist error

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* speech and copy (#422)

* speech and copy

* startTime

* added chunk properties

* tooltips

---------

Co-authored-by: vasanthasaikalluri &lt;165021735+vasanthasaikalluri@users.noreply.github.com&gt;
Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* Fixed issue for out of range in KNN API

* solved conflicts

* conflict solved

* Remove logging info from update KNN API

* tooltip changes

* format and lint fixes

* responsiveness changes

* Fixed issue for total pages GCS, S3

* UI polishing (#428)

* button and tooltip changes

* checking validation on change

* settings module populate fix

* format fixes

* opening the modal after auth success

* removed the limit

* added the scrobar for dropdowns

* speech state (#426)

* speech state

* Button Details changes

* delete wording change

* Total pages in buckets (#431)

* page number NA for buckets

* added N/A for gcs and s3 pages

* total pages for gcs

* remove unwanted logger

---------

Co-authored-by: kartikpersistent &lt;101251502+kartikpersistent@users.noreply.github.com&gt;

* removed the max width

* Update FileTable.tsx

* Update the docker file

* Modified prompt (#438)

* Update Dockerfile

* Update Dockerfile

* Update Dockerfile

* rendering Fix

* Local file upload gcs (#442)

* Uplaod file to GCS

* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled

* Add life cycle rule on uploaded bucket

* pdf upload local and gcs bucket check

* delete files when processed and extract changes

---------

Co-authored-by: Pravesh Kumar &lt;121786590+praveshkumar1988@users.noreply.github.com&gt;

* Modified chat length and entities used (#443)

* metadata for unstructured files (#446)

* Unstructured file metadata (#447)

* metadata for unstructured files

* sleep in gcs upload

* updated

* icons added to chunks (#435)

* icons added to chunks

* info modal icons

* fixed gcs status message issue

* added if check for failed count

* Null issue Fixed from backend for upload API and graph_document when model name mismatch

* added word break issue

* Added neo4j-rust-ext

* processing time estimation based on bytes

* File extension upper case fixed, File delete from GCS or local based on env variable.

* timer per byte

* Update Dockerfile

* Adding sort rows on the table (#451)

* Gcs upload folder hashed (#453)

* implement foldername hashed in GCS bucket uplaod

* Raise exception if invalid model selected

* folder name for gcs upload

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users.noreply.github.com&gt;

* upload all unstructuredfiles to gcs (#455)

* Mofified chunk query (#454)

* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.

- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"

* Fix the PARTIAL CONTENT issue

* File-table no data found (#456)

* 'file-table''

* review comment

* Llm format change (#459)

* changed the llm models format to lowercase

* added the error message

* llm model changes

* format fixes

* removed unused import

* added the capitalize method

* delete files from merged_file_path only if source is local file

---------

Co-authored-by: aashipandya &lt;156318202+aashipandya@users…
diff --git a/backend/score.py b/backend/score.py
@@ -36,6 +36,7 @@
 from src.ragas_eval import *
 from starlette.types import ASGIApp, Message, Receive, Scope, Send
 import gzip
+from langchain_neo4j import Neo4jGraph
 
 logger = CustomLogger()
 CHUNK_DIR = os.path.join(os.path.dirname(__file__), "chunks")
@@ -81,10 +82,9 @@ async def __call__(self, scope: Scope, receive: Receive, send: Send):
         await gzip_middleware(scope, receive, send)
 app = FastAPI()
 # SecWeb(app=app, Option={'referrer': False, 'xframe': False})
-app.add_middleware(ContentSecurityPolicy, Option={'default-src': ["'self'"], 'base-uri': ["'self'"], 'block-all-mixed-content': []}, script_nonce=False, style_nonce=False, report_only=False)
+# app.add_middleware(ContentSecurityPolicy, Option={'default-src': ["'self'"], 'base-uri': ["'self'"], 'block-all-mixed-content': []}, script_nonce=False, style_nonce=False, report_only=False)
 app.add_middleware(XContentTypeOptions)
 app.add_middleware(XFrame, Option={'X-Frame-Options': 'DENY'})
-#app.add_middleware(GZipMiddleware, minimum_size=1000, compresslevel=5)
 app.add_middleware(CustomGZipMiddleware, minimum_size=1000, compresslevel=5,paths=["/sources_list","/url/scan","/extract","/chat_bot","/chunk_entities","/get_neighbours","/graph_query","/schema","/populate_graph_schema","/get_unconnected_nodes_list","/get_duplicate_nodes","/fetch_chunktext"])
 app.add_middleware(
     CORSMiddleware,
@@ -955,6 +955,8 @@ async def fetch_chunktext(
        json_obj = {
            'api_name': 'fetch_chunktext',
            'db_url': uri,
+           'userName': userName,
+           'database': database,
            'document_name': document_name,
            'page_no': page_no,
            'logging_time': formatted_time(datetime.now(timezone.utc)),
@@ -972,5 +974,34 @@ async def fetch_chunktext(
        gc.collect()
 
 
+@app.post("/backend_connection_configuation")
+async def backend_connection_configuation():
+    try:
+        graph = Neo4jGraph()
+        logging.info(f'login connection status of object: {graph}')
+        if graph is not None:
+            graph_connection = True
+            isURI = os.getenv('NEO4J_URI')
+            isUsername= os.getenv('NEO4J_USERNAME')
+            isDatabase= os.getenv('NEO4J_DATABASE')
+            isPassword= os.getenv('NEO4J_PASSWORD')
+            encoded_password = encode_password(isPassword)
+            graphDb_data_Access = graphDBdataAccess(graph)
+            gds_status = graphDb_data_Access.check_gds_version()
+            write_access = graphDb_data_Access.check_account_access(database=isDatabase)
+            return create_api_response('Success',message=f"Backend connection successful",data={'graph_connection':graph_connection,'uri':isURI,'user_name':isUsername,'database':isDatabase,'password':encoded_password,'gds_status':gds_status,'write_access':write_access})
+        else:
+            graph_connection = False
+            return create_api_response('Success',message=f"Backend connection is not successful",data=graph_connection)
+    except Exception as e:
+        graph_connection = False
+        job_status = "Failed"
+        message="Unable to connect backend DB"
+        error_message = str(e)
+        logging.exception(f'{error_message}')
+        return create_api_response(job_status, message=message, error=error_message + ' or fill from the login dialog', data=graph_connection)
+    finally:
+        gc.collect()    
+
 if __name__ == "__main__":
     uvicorn.run(app)
diff --git a/backend/src/shared/common_fn.py b/backend/src/shared/common_fn.py
@@ -11,8 +11,8 @@
 import os
 from pathlib import Path
 
-def check_url_source(source_type, yt_url:str=None, queries_list:List[str]=None):
-    languages=[]
+def check_url_source(source_type, yt_url:str=None, wiki_query:str=None):
+    language=''
     try:
       logging.info(f"incoming URL: {yt_url}")
       if source_type == 'youtube':
diff --git a/backend/src/shared/constants.py b/backend/src/shared/constants.py
@@ -164,6 +164,90 @@
 LIMIT $limit
 """
 
+NODEREL_COUNT_QUERY_WITH_COMMUNITY = """
+MATCH (d:Document)
+WHERE d.fileName IS NOT NULL
+OPTIONAL MATCH (d)<-[po:PART_OF]-(c:Chunk)
+OPTIONAL MATCH (c)-[he:HAS_ENTITY]->(e:__Entity__)
+OPTIONAL MATCH (c)-[sim:SIMILAR]->(c2:Chunk)
+OPTIONAL MATCH (c)-[nc:NEXT_CHUNK]->(c3:Chunk)
+OPTIONAL MATCH (e)-[ic:IN_COMMUNITY]->(comm:__Community__)
+OPTIONAL MATCH (comm)-[pc1:PARENT_COMMUNITY]->(first_level:__Community__)
+OPTIONAL MATCH (first_level)-[pc2:PARENT_COMMUNITY]->(second_level:__Community__)
+OPTIONAL MATCH (second_level)-[pc3:PARENT_COMMUNITY]->(third_level:__Community__)
+WITH
+  d.fileName AS filename,
+  count(DISTINCT c) AS chunkNodeCount,
+  count(DISTINCT po) AS partOfRelCount,
+  count(DISTINCT he) AS hasEntityRelCount,
+  count(DISTINCT sim) AS similarRelCount,
+  count(DISTINCT nc) AS nextChunkRelCount,
+  count(DISTINCT e) AS entityNodeCount,
+  collect(DISTINCT e) AS entities,
+  count(DISTINCT comm) AS baseCommunityCount,
+  count(DISTINCT first_level) AS firstlevelcommCount,
+  count(DISTINCT second_level) AS secondlevelcommCount,
+  count(DISTINCT third_level) AS thirdlevelcommCount,
+  count(DISTINCT ic) AS inCommunityCount,
+  count(DISTINCT pc1) AS parentCommunityRelCount1,
+  count(DISTINCT pc2) AS parentCommunityRelCount2,
+  count(DISTINCT pc3) AS parentCommunityRelCount3
+WITH
+  filename,
+  chunkNodeCount,
+  partOfRelCount + hasEntityRelCount + similarRelCount + nextChunkRelCount AS chunkRelCount,
+  entityNodeCount,
+  entities,
+  baseCommunityCount + firstlevelcommCount + secondlevelcommCount + thirdlevelcommCount AS commCount,
+  inCommunityCount + parentCommunityRelCount1 + parentCommunityRelCount2 + parentCommunityRelCount3 AS communityRelCount
+CALL (entities) {
+  UNWIND entities AS e
+  RETURN sum(COUNT { (e)-->(e2:__Entity__) WHERE e2 in entities }) AS entityEntityRelCount
+}
+RETURN
+  filename,
+  COALESCE(chunkNodeCount, 0) AS chunkNodeCount,
+  COALESCE(chunkRelCount, 0) AS chunkRelCount,
+  COALESCE(entityNodeCount, 0) AS entityNodeCount,
+  COALESCE(entityEntityRelCount, 0) AS entityEntityRelCount,
+  COALESCE(commCount, 0) AS communityNodeCount,
+  COALESCE(communityRelCount, 0) AS communityRelCount
+"""
+NODEREL_COUNT_QUERY_WITHOUT_COMMUNITY = """
+MATCH (d:Document)
+WHERE d.fileName = $document_name
+OPTIONAL MATCH (d)<-[po:PART_OF]-(c:Chunk)
+OPTIONAL MATCH (c)-[he:HAS_ENTITY]->(e:__Entity__)
+OPTIONAL MATCH (c)-[sim:SIMILAR]->(c2:Chunk)
+OPTIONAL MATCH (c)-[nc:NEXT_CHUNK]->(c3:Chunk)
+WITH
+  d.fileName AS filename,
+  count(DISTINCT c) AS chunkNodeCount,
+  count(DISTINCT po) AS partOfRelCount,
+  count(DISTINCT he) AS hasEntityRelCount,
+  count(DISTINCT sim) AS similarRelCount,
+  count(DISTINCT nc) AS nextChunkRelCount,
+  count(DISTINCT e) AS entityNodeCount,
+  collect(DISTINCT e) AS entities
+WITH
+  filename,
+  chunkNodeCount,
+  partOfRelCount + hasEntityRelCount + similarRelCount + nextChunkRelCount AS chunkRelCount,
+  entityNodeCount,
+  entities
+CALL (entities) {
+  UNWIND entities AS e
+  RETURN sum(COUNT { (e)-->(e2:__Entity__) WHERE e2 in entities }) AS entityEntityRelCount
+}
+RETURN
+  filename,
+  COALESCE(chunkNodeCount, 0) AS chunkNodeCount,
+  COALESCE(chunkRelCount, 0) AS chunkRelCount,
+  COALESCE(entityNodeCount, 0) AS entityNodeCount,
+  COALESCE(entityEntityRelCount, 0) AS entityEntityRelCount
+"""
+
+
 ## CHAT SETUP
 CHAT_MAX_TOKENS = 1000
 CHAT_SEARCH_KWARG_SCORE_THRESHOLD = 0.5
diff --git a/frontend/src/components/Content.tsx b/frontend/src/components/Content.tsx
@@ -4,16 +4,7 @@ import { Button, Typography, Flex, StatusIndicator, useMediaQuery } from '@neo4j
 import { useCredentials } from '../context/UserCredentials';
 import { useFileContext } from '../context/UsersFiles';
 import { extractAPI } from '../utils/FileAPI';
-import {
-  BannerAlertProps,
-  ChildRef,
-  ContentProps,
-  CustomFile,
-  OptionType,
-  UserCredentials,
-  chunkdata,
-  connectionState,
-} from '../types';
+import { BannerAlertProps, ChildRef, ContentProps, CustomFile, OptionType, UserCredentials, chunkdata } from '../types';
 import deleteAPI from '../services/DeleteFiles';
 import { postProcessing } from '../services/PostProcessing';
 import { triggerStatusUpdateAPI } from '../services/ServerSideStatusUpdateAPI';
@@ -66,16 +57,7 @@ const Content: React.FC<ContentProps> = ({
   const [openGraphView, setOpenGraphView] = useState<boolean>(false);
   const [inspectedName, setInspectedName] = useState<string>('');
   const [documentName, setDocumentName] = useState<string>('');
-  const {
-    setUserCredentials,
-    userCredentials,
-    connectionStatus,
-    setConnectionStatus,
-    isGdsActive,
-    setGdsActive,
-    setIsReadOnlyUser,
-    isReadOnlyUser,
-  } = useCredentials();
+  const { setUserCredentials, userCredentials, setConnectionStatus, isGdsActive, isReadOnlyUser } = useCredentials();
   const [showConfirmationModal, setshowConfirmationModal] = useState<boolean>(false);
   const [extractLoading, setextractLoading] = useState<boolean>(false);
   const [retryFile, setRetryFile] = useState<string>('');
@@ -108,7 +90,9 @@ const Content: React.FC<ContentProps> = ({
     setchatModes,
     model,
   } = useFileContext();
-  const [viewPoint, setViewPoint] = useState<'tableView' | 'showGraphView' | 'chatInfoView'|'neighborView'>('tableView');
+  const [viewPoint, setViewPoint] = useState<'tableView' | 'showGraphView' | 'chatInfoView' | 'neighborView'>(
+    'tableView'
+  );
   const [showDeletePopUp, setshowDeletePopUp] = useState<boolean>(false);
   const [deleteLoading, setdeleteLoading] = useState<boolean>(false);
 
@@ -123,55 +107,15 @@ const Content: React.FC<ContentProps> = ({
     }
   );
   const childRef = useRef<ChildRef>(null);
-  const incrementPage = () => {
+
+  const incrementPage = async () => {
     setCurrentPage((prev) => prev + 1);
+    await getChunks(documentName, currentPage + 1);
   };
-  const decrementPage = () => {
+  const decrementPage = async () => {
     setCurrentPage((prev) => prev - 1);
+    await getChunks(documentName, currentPage - 1);
   };
-  useEffect(() => {
-    if (!init && !searchParams.has('connectURL')) {
-      let session = localStorage.getItem('neo4j.connection');
-      if (session) {
-        let neo4jConnection = JSON.parse(session);
-        setUserCredentials({
-          uri: neo4jConnection.uri,
-          userName: neo4jConnection.user,
-          password: atob(neo4jConnection.password),
-          database: neo4jConnection.database,
-          port: neo4jConnection.uri.split(':')[2],
-        });
-        if (neo4jConnection.isgdsActive !== undefined) {
-          setGdsActive(neo4jConnection.isgdsActive);
-        }
-        if (neo4jConnection.isReadOnlyUser !== undefined) {
-          setIsReadOnlyUser(neo4jConnection.isReadOnlyUser);
-        }
-      } else {
-        setOpenConnection((prev) => ({ ...prev, openPopUp: true }));
-      }
-      setInit(true);
-    } else {
-      setOpenConnection((prev) => ({ ...prev, openPopUp: true }));
-    }
-  }, []);
-  useEffect(() => {
-    if (currentPage >= 1) {
-      (async () => {
-        await getChunks(documentName, currentPage);
-      })();
-    }
-  }, [currentPage, documentName]);
-  useEffect(() => {
-    setFilesData((prevfiles) => {
-      return prevfiles.map((curfile) => {
-        return {
-          ...curfile,
-          model: curfile.status === 'New' || curfile.status === 'Reprocess' ? model : curfile.model,
-        };
-      });
-    });
-  }, [model]);
 
   useEffect(() => {
     if (afterFirstRender) {
@@ -264,15 +208,7 @@ const Content: React.FC<ContentProps> = ({
     }
     toggleChunksLoading();
   };
-  const getChunks = async (name: string, pageNo: number) => {
-    toggleChunksLoading();
-    const response = await getChunkText(userCredentials as UserCredentials, name, pageNo);
-    setTextChunks(response.data.data.pageitems);
-    if (!totalPageCount) {
-      setTotalPageCount(response.data.data.total_pages);
-    }
-    toggleChunksLoading();
-  };
+  
   const extractData = async (uid: string, isselectedRows = false, filesTobeProcess: CustomFile[]) => {
     if (!isselectedRows) {
       const fileItem = filesData.find((f) => f.id == uid);
@@ -915,7 +851,7 @@ const Content: React.FC<ContentProps> = ({
                 setTotalPageCount(null);
               }
               setCurrentPage(1);
-              // await getChunks(name, 1);
+              await getChunks(name, 1);
             }
           }}
           ref={childRef}
diff --git a/frontend/src/components/FileTable.tsx b/frontend/src/components/FileTable.tsx
@@ -585,13 +585,13 @@ const FileTable = forwardRef<ChildRef, FileTableProps>((props, ref) => {
               label='chunktextaction'
               text='View Chunks'
               size='large'
-              disabled={info.getValue() === 'Uploading'}
+              disabled={info.getValue() === 'Uploading' || info.getValue() === 'New'}
             >
-              <DocumentTextIconSolid />
+              <DocumentTextIconSolid className='n-size-token-7' />
             </IconButtonWithToolTip>
           </>
         ),
-        size: 300,
+        maxSize: 300,
         minSize: 180,
         header: () => <span>Actions</span>,
         footer: (info) => info.column.id,
diff --git a/frontend/src/components/Layout/PageLayout.tsx b/frontend/src/components/Layout/PageLayout.tsx
@@ -35,6 +35,7 @@ const PageLayout: React.FC = () => {
   const [shows3Modal, toggleS3Modal] = useReducer((s) => !s, false);
   const [showGCSModal, toggleGCSModal] = useReducer((s) => !s, false);
   const [showGenericModal, toggleGenericModal] = useReducer((s) => !s, false);
+  const navigate = useNavigate();
   const toggleLeftDrawer = () => {
     if (largedesktops) {
       setIsLeftExpanded(!isLeftExpanded);
diff --git a/frontend/src/components/Popups/ChunkPopUp/index.tsx b/frontend/src/components/Popups/ChunkPopUp/index.tsx
@@ -1,4 +1,5 @@
-import { Dialog, Typography, Flex, IconButton } from '@neo4j-ndl/react';
+
+import { Dialog, Typography, Flex, IconButton, useMediaQuery } from '@neo4j-ndl/react';
 import { ArrowLeftIconOutline, ArrowRightIconOutline } from '@neo4j-ndl/react/icons';
 import { chunkdata } from '../../../types';
 import Loader from '../../../utils/Loader';
@@ -23,20 +24,43 @@ const ChunkPopUp = ({
   currentPage: number | null;
   totalPageCount: number | null;
 }) => {
+
+  const { breakpoints } = tokens;
+  const isTablet = useMediaQuery(`(min-width:${breakpoints.xs}) and (max-width: ${breakpoints.lg})`);
   const sortedChunksData = useMemo(() => {
     return chunks.sort((a, b) => a.position - b.position);
   }, [chunks]);
   return (
-    <Dialog open={showChunkPopup} onClose={onClose}>
-      <Dialog.Header>Text Chunks</Dialog.Header>
+    <Dialog isOpen={showChunkPopup} onClose={onClose}>
+      <Dialog.Header>
+        <div className='flex flex-row items-center mb-2'>
+          <img
+            src={chunklogo}
+            style={{ width: isTablet ? 100 : 140, height: isTablet ? 100 : 140, marginRight: 10 }}
+            loading='lazy'
+          />
+          <div className='flex flex-col'>
+            <Typography variant='h2'>Text Chunks</Typography>
+            <Typography variant='body-medium' className='mb-2'>
+              These text chunks are extracted to build a knowledge graph and enable accurate information retrieval using
+              a different retrival strategies
+            </Typography>
+          </div>
+        </div>
+        {!chunksLoading && totalPageCount != null && totalPageCount > 0 && (
+          <div className='flex flex-row justify-end'>
+            <Typography variant='subheading-small'>Total Pages: {totalPageCount}</Typography>
+          </div>
+        )}
+      </Dialog.Header>
       <Dialog.Content>
         {chunksLoading ? (
           <Loader title='loading...'></Loader>
         ) : (
-          <ol className='max-h-80 overflow-y-auto'>
+          <ol className='max-h-80 overflow-y-auto flex flex-col gap-4'>
             {sortedChunksData.map((c, idx) => (
-              <li key={`${idx}${c.position}`} className='flex flex-row gap-2'>
-                <Flex flexDirection='column' gap='1'>
+              <li key={`${idx}${c.position}`} className='flex flex-row gap-1'>
+                <Flex flexDirection='column' gap='2'>
                   <Flex flexDirection='row'>
                     <Typography variant='label'>Position :</Typography>
                     <Typography variant='subheading-medium'>{c.position}</Typography>
@@ -57,10 +81,10 @@ const ChunkPopUp = ({
       {totalPageCount != null && totalPageCount > 1 && (
         <Dialog.Actions className='flex !justify-center items-center'>
           <Flex flexDirection='row'>
-            <IconButton disabled={currentPage === 1} onClick={decrementPage}>
+            <IconButton ariaLabel='decrementButton' isDisabled={currentPage === 1} onClick={decrementPage}>
               <ArrowLeftIconOutline />
             </IconButton>
-            <IconButton disabled={currentPage === totalPageCount} onClick={incrementPage}>
+            <IconButton ariaLabel='incrementButton' isDisabled={currentPage === totalPageCount} onClick={incrementPage}>
               <ArrowRightIconOutline />
             </IconButton>
           </Flex>
diff --git a/frontend/src/components/QuickStarter.tsx b/frontend/src/components/QuickStarter.tsx
diff --git a/frontend/src/utils/Constants.ts b/frontend/src/utils/Constants.ts
diff --git a/frontend/src/utils/Utils.ts b/frontend/src/utils/Utils.ts