Skip to content

Commit 14edc65

Browse files
committed
remove bad images
1 parent e556200 commit 14edc65

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

41 files changed

+40
-440
lines changed

ai-vector-search-and-rag/README.md

Lines changed: 40 additions & 40 deletions
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ ONNX, or Open Neural Network Exchange, is an open-source format for representing
4646
![][2]
4747

4848

49-
[2]: images/1715344779151.png
49+
[2]: https://media.licdn.com/dms/image/D5612AQGjt1gi9FuYPg/article-inline_image-shrink_1500_2232/0/1715344779151?e=1727308800&v=beta&t=pPi-iwpUl9LwOo96eE3qEPrwijOJCT1iQjaWe6urh74
5050

5151
Open Neural Network Exchange or ONNX is an open standard format of machine learning models. By using the Oracle Machine Learning Services REST API, you can deploy and score your ONNX format models. The REST API is exposed as part of the Oracle Machine Learning Services on Oracle Autonomous Database cloud service.
5252

@@ -71,7 +71,7 @@ The Oracle Machine Learning Services REST API supports ONNX format model deploym
7171
![][4]
7272

7373

74-
[4]: images/1715417391489.png
74+
[4]: https://media.licdn.com/dms/image/D4D12AQG9jlosfL7gOQ/article-inline_image-shrink_1500_2232/0/1715417391489?e=1727308800&v=beta&t=9ybWKvRfSmRYrX-hsouSTm_r36y2CoxwUAk4kLSzdI8
7575

7676
To prepare large unstructured textual data (for example, a PDF or Word document), you first transform the data into plain text and then pass the resulting text through _Chunker_. The Chunker then splits the text into smaller chunks using a process known as **chunking**. A single document may be split into many chunks, each transformed into a vector.
7777

@@ -80,7 +80,7 @@ To prepare large unstructured textual data (for example, a PDF or Word document)
8080
![][5]
8181

8282

83-
[5]: images/1715417730257.png
83+
[5]: https://media.licdn.com/dms/image/D4D12AQHP_p8faXa74g/article-inline_image-shrink_1500_2232/0/1715417730257?e=1727308800&v=beta&t=PJ2vFQGJVWb1iZrli3nzuz18SjPMfCO0i9T_YyVq0Pk
8484

8585
### 3. Short Demo Video for this article
8686

@@ -118,12 +118,12 @@ To download and extract the OML4Py client installation file, do the following:
118118
![][8]
119119

120120

121-
[8]: images/1715346700078.png
121+
[8]: https://media.licdn.com/dms/image/D4D12AQE-A8sdRrobYg/article-inline_image-shrink_1500_2232/0/1715346700078?e=1727308800&v=beta&t=R7o25gEs2RSs4yH6zhCHBwbccOl4667HMvYeW7sY_B0
122122

123123
![][9]
124124

125125

126-
[9]: images/1715347038280.png
126+
[9]: https://media.licdn.com/dms/image/D4D12AQE9pgPgaiGd7Q/article-inline_image-shrink_1500_2232/0/1715347038280?e=1727308800&v=beta&t=X51cMaKnl8nyRIE5tX801m5DrfXcB29FNu-7-xPyfTk
127127

128128
You will have to download this locally and then upload it to cloud machine using SCP, [please refer my previous article][10] on SCP if required.
129129

@@ -225,7 +225,7 @@ Load the packages below to ensure they have been installed successfully. Start P
225225
![][12]
226226

227227

228-
[12]: images/1715349973247.png
228+
[12]: https://media.licdn.com/dms/image/D4D12AQFCmIYAzN8Ihg/article-inline_image-shrink_1500_2232/0/1715349973247?e=1727308800&v=beta&t=MsnWtc202V_qDJvZQEyFZykn9n18qF7Hv_BzA8agTJo
229229

230230
**Run the OML4Py Client Installation Script**
231231

@@ -256,7 +256,7 @@ Load the packages below to ensure they have been installed successfully. Start P
256256
![][13]
257257

258258

259-
[13]: images/1715499456328.png
259+
[13]: https://media.licdn.com/dms/image/D4D12AQGfPdTOscqcTA/article-inline_image-shrink_1500_2232/0/1715499456328?e=1727308800&v=beta&t=y9MGRpAvKhBABv2FZwDVC1syKKqnk2NP_u3ih42_vWk
260260

261261
### 5. Convert Pretrained Models to ONNX Format
262262

@@ -325,7 +325,7 @@ I would like to download 2 PDFs, one related to [Breast cancer facts and figures
325325
![][18]
326326

327327

328-
[18]: images/1715498322584.png
328+
[18]: https://media.licdn.com/dms/image/D4D12AQFjxiAlhDumHg/article-inline_image-shrink_1000_1488/0/1715498322584?e=1727308800&v=beta&t=LoWczJCqNE_HSK6beW9JC11UAfY-CSZfPvBldbyzIn0
329329

330330
and the other related to [COVID-19][19]
331331

@@ -335,7 +335,7 @@ and the other related to [COVID-19][19]
335335
![][20]
336336

337337

338-
[20]: images/1715498408860.png
338+
[20]: https://media.licdn.com/dms/image/D4D12AQGZ39I6onR1uw/article-inline_image-shrink_1000_1488/0/1715498408860?e=1727308800&v=beta&t=11A6jH2TSb1wjqFAg-b6bX5euYhVZMDguqRJufKJouc
339339

340340
_Please Note: You can use any other PDFs as required to train. And I don't own any of these PDF files or content in it. This is just for demonstration purposes only._
341341

@@ -413,7 +413,7 @@ Connect as DOCUSER
413413
![][21]
414414

415415

416-
[21]: images/1715361732305.png
416+
[21]: https://media.licdn.com/dms/image/D4D12AQH54ezC7LpIsw/article-inline_image-shrink_1500_2232/0/1715361732305?e=1727308800&v=beta&t=uuZ2sASmOJSV3ihugiUMlU7seHgdBoZOgBbw7eq86zQ
417417

418418
Run UTL_TO_TEXT to convert the PDF document into text format:
419419

@@ -425,7 +425,7 @@ Run UTL_TO_TEXT to convert the PDF document into text format:
425425
![][22]
426426

427427

428-
[22]: images/1715358256114.png
428+
[22]: https://media.licdn.com/dms/image/D4D12AQH4zl3AYw5A2A/article-inline_image-shrink_1500_2232/0/1715358256114?e=1727308800&v=beta&t=GySuEUQvXtwlVCTwYVFp2-Ax4I0GNN3ZnrnmSX1_M10
429429

430430
JSON parameters to use normalization and some of the custom chunking specifications:
431431

@@ -446,7 +446,7 @@ JSON parameters to use normalization and some of the custom chunking specificati
446446
![][23]
447447

448448

449-
[23]: images/1715358954759.png
449+
[23]: https://media.licdn.com/dms/image/D4D12AQHNBN4rJoKO9Q/article-inline_image-shrink_1000_1488/0/1715358954759?e=1727308800&v=beta&t=WqFq1_gy4QlLrL8nnYUUXi8_uwMidxB0FXgnyMKZgqw
450450

451451
Convert chunks to embeddings:
452452

@@ -495,15 +495,15 @@ This creates a doc_chunks table lets check this table:
495495
![][24]
496496

497497

498-
[24]: images/1715359808238.png
498+
[24]: https://media.licdn.com/dms/image/D4D12AQH9AjCJjVJ8eg/article-inline_image-shrink_1500_2232/0/1715359808238?e=1727308800&v=beta&t=ZvYmynKj2adk7jb29D1F3kFlJe6CcMzCmIniVn_YVJM
499499

500500

501501
Select EMBED_DATA, EMBED_VECTOR from doc_chunks where rownum < 4;
502502

503503
![][25]
504504

505505

506-
[25]: images/1715359583739.png
506+
[25]: https://media.licdn.com/dms/image/D4D12AQE8MFu9qwXarw/article-inline_image-shrink_1500_2232/0/1715359583739?e=1727308800&v=beta&t=rh2wFxQ05qmUJ8CbH1RCsc6iccyd625evCvsjvOe2xI
507507

508508

509509
select count(*) from doc_chunks;
@@ -519,21 +519,21 @@ Login to Oracle APEX INTERNAL workspace as ADMIN user and create a new workspace
519519
![][26]
520520

521521

522-
[26]: images/1715360729372.png
522+
[26]: https://media.licdn.com/dms/image/D4D12AQFxrLtVZSWRRg/article-inline_image-shrink_1000_1488/0/1715360729372?e=1727308800&v=beta&t=vg6yBQyEkwH03kq-f8dzdrewsXE2HEEJOiCKTZtyQeY
523523

524524
Re-use existing schema -
525525

526526
![][27]
527527

528528

529-
[27]: images/1715360772535.png
529+
[27]: https://media.licdn.com/dms/image/D4D12AQFCIQ-GwU6PUw/article-inline_image-shrink_1000_1488/0/1715360772535?e=1727308800&v=beta&t=qOCydSwjfhFdA8uqIZ0_pFMnEV4BjS5yAygUR4m3fsQ
530530

531531
Click on Create Workspace -
532532

533533
![][28]
534534

535535

536-
[28]: images/1715360813529.png
536+
[28]: https://media.licdn.com/dms/image/D4D12AQFWRFrAfaRBQQ/article-inline_image-shrink_1000_1488/0/1715360813529?e=1727308800&v=beta&t=UIrjYMSjKjiBGNkc0VTn257cq_4GvFb9cqpi8tx3UyY
537537

538538
### 10. Create Vector AI Search Application to Search PDF documents
539539

@@ -542,7 +542,7 @@ Access Oracle APEX Navigation > SQL Commands > Create a function with following
542542
![][29]
543543

544544

545-
[29]: images/1715361311977.png
545+
[29]: https://media.licdn.com/dms/image/D4D12AQEqp3TtQ5VsJg/article-inline_image-shrink_1000_1488/0/1715361311977?e=1727308800&v=beta&t=QClmk8-OVUhYYdYijf6IRFeUdsRpTHheZY0y3qiXaFM
546546

547547
[Download code from my git repo][30]
548548

@@ -554,7 +554,7 @@ Now, lets create a simple Oracle APEX page in any application have an input text
554554
![][31]
555555

556556

557-
[31]: images/1715362104559.png
557+
[31]: https://media.licdn.com/dms/image/D4D12AQH9usGHWfpzDg/article-inline_image-shrink_1500_2232/0/1715362104559?e=1727308800&v=beta&t=9PqH4RQ_ptOD15IT-CDRYtcxteNy4MLUx-xF12LhZso
558558

559559
Where the PL/SQL Response Block will call rag_function as shown below.
560560

@@ -571,34 +571,34 @@ Just to ensure that we dont call the function without any data, lets add server
571571
![][32]
572572

573573

574-
[32]: images/1715362253715.png
574+
[32]: https://media.licdn.com/dms/image/D4D12AQGxtxI5-OO9qg/article-inline_image-shrink_1500_2232/0/1715362253715?e=1727308800&v=beta&t=m9713Ws0ziCcUfQ2C9enIKQ61l3cbX4qYs8oXuHr-cQ
575575

576576
Run the page with various inputs - this will basically query the uploaded PDF files using Oracle AI Vector Search
577577

578578
![][33]
579579

580580

581-
[33]: images/1715360071118.png
581+
[33]: https://media.licdn.com/dms/image/D4D12AQF9Td5YWuAJCw/article-inline_image-shrink_1000_1488/0/1715360071118?e=1727308800&v=beta&t=Echaerv72jfcg3I6STIgb-RT-_3a5Bu2W55mZzWOvKc
582582

583583
![][34]
584584

585585

586-
[34]: images/1715360143219.png
586+
[34]: https://media.licdn.com/dms/image/D4D12AQE0ETF9_CQxPg/article-inline_image-shrink_1000_1488/0/1715360143219?e=1727308800&v=beta&t=Dvz8WDZvTrHsz4X1GdJEBzecyX9Y1F8k_J5NyqkaYQo
587587

588588
![][35]
589589

590590

591-
[35]: images/1715360333435.png
591+
[35]: https://media.licdn.com/dms/image/D4D12AQEg7q6lLf6e7Q/article-inline_image-shrink_1500_2232/0/1715360333435?e=1727308800&v=beta&t=x53_HR1io2oe3SKKDe6uVgiOopJsx1j9hYR7pVbN7i8
592592

593593
![][36]
594594

595595

596-
[36]: images/1715360411760.png
596+
[36]: https://media.licdn.com/dms/image/D4D12AQGKIGnz4ezqeA/article-inline_image-shrink_1000_1488/0/1715360411760?e=1727308800&v=beta&t=wPfRFosm0eMc08qQtB5aSGPRlJ7H7adS4l6KovsOe08
597597

598598
![][37]
599599

600600

601-
[37]: images/1715360531314.png
601+
[37]: https://media.licdn.com/dms/image/D4D12AQEPJ5FRHdJx3A/article-inline_image-shrink_1000_1488/0/1715360531314?e=1727308800&v=beta&t=jc3nQ8ZmzNtlmoPumEAbzhliE16WSK5fUYHvWf_UxAg
602602

603603
Let us extend this application. What we would like to do is not only search internal PDFs (specific to an Organisation) but also search over external LLMs using Oracle Generative AI,
604604

@@ -620,21 +620,21 @@ Login to [cloud.oracle.com][39] we would need few parameters, to authenticate wi
620620
![][40]
621621

622622

623-
[40]: images/1715492621349.png
623+
[40]: https://media.licdn.com/dms/image/D4D12AQFHmDfdo7lB7w/article-inline_image-shrink_1500_2232/0/1715492621349?e=1727308800&v=beta&t=vlBK2lulwJiYtAuyqJY9Sr2FjlObtl8LD8YPlroWYaA
624624

625625
Add a new API Key or re-use an API Key if it has already been created
626626

627627
![][41]
628628

629629

630-
[41]: images/1715408798709.png
630+
[41]: https://media.licdn.com/dms/image/D4D12AQGCTaTCoBfsMA/article-inline_image-shrink_1500_2232/0/1715408798709?e=1727308800&v=beta&t=QYJQdHwIOk6KviPmIobOSRTlUidj_rSVfGEQNvg6KXo
631631

632632
Next to API Key you can find **configuration** details as shown below.
633633

634634
![][42]
635635

636636

637-
[42]: images/1715408842001.png
637+
[42]: https://media.licdn.com/dms/image/D4D12AQE12ElYsR3cXw/article-inline_image-shrink_1000_1488/0/1715408842001?e=1727308800&v=beta&t=y4c1nL0fK6qgSRgESEZS53zq-BioJKFmlWhZMv28V08
638638

639639
We would need to copy the following and save it in a notepad for later use:
640640

@@ -703,21 +703,21 @@ Create functional declaration with any required variables -
703703
![][43]
704704

705705

706-
[43]: images/1715409746505.png
706+
[43]: https://media.licdn.com/dms/image/D4D12AQFblG7oAOt4Ow/article-inline_image-shrink_1500_2232/0/1715409746505?e=1727308800&v=beta&t=sK8kahq76qE3t-B4unITkzNEE1h-oRg4bvvYHXh1PUc
707707

708708
Search Internal PDFs
709709

710710
![][44]
711711

712712

713-
[44]: images/1715409803724.png
713+
[44]: https://media.licdn.com/dms/image/D4D12AQFsDTA_Bz2Ajw/article-inline_image-shrink_1500_2232/0/1715409803724?e=1727308800&v=beta&t=6qCeo_2fuYtIAgFwJxhJmqBbKRixG3ds_fDpMxkmiTs
714714

715715
Create Search Parameters and Search Oracle Generative AI, Append both the search results and return the output back to calling function.
716716

717717
![][45]
718718

719719

720-
[45]: images/1715409940189.png
720+
[45]: https://media.licdn.com/dms/image/D4D12AQF7pxws9_S3Uw/article-inline_image-shrink_1500_2232/0/1715409940189?e=1727308800&v=beta&t=5XSFZBgpD2IqlvQzIIZPsorm-39Nr2n5Y9ztwczJO1w
721721

722722
You can [download this complete code from my GitHub repository][46]
723723

@@ -731,33 +731,33 @@ Our Oracle APEX page will have Input text box, Submit button and an Output rich
731731
![][47]
732732

733733

734-
[47]: images/1715410305544.png
734+
[47]: https://media.licdn.com/dms/image/D4D12AQHARzAiCHJQ1A/article-inline_image-shrink_1500_2232/0/1715410305544?e=1727308800&v=beta&t=CvROFvz8dP61V0tv2-LNKjbrCTSevzV6Sa2omMBjmos
735735

736736
We will need a process handle submit button
737737

738738
![][48]
739739

740740

741-
[48]: images/1715410408885.png
741+
[48]: https://media.licdn.com/dms/image/D4D12AQHV2z-eOxUyUA/article-inline_image-shrink_1000_1488/0/1715410408885?e=1727308800&v=beta&t=vyRMA0FVhRaOnsRdZANJ1wF5LbcWQ67jyko8waOvR_M
742742

743743
The PL/SQL code to call the function will be as shown below -
744744

745745
![][49]
746746

747747

748-
[49]: images/1715410489617.png
748+
[49]: https://media.licdn.com/dms/image/D4D12AQEroHUN7QsuGw/article-inline_image-shrink_1500_2232/0/1715410489617?e=1727308800&v=beta&t=1WnSMTpRyrtr4-Rvypz3g9A3jFcV7Dg4oHm77_vAX1I
749749

750750
Thats all we would need, just run the page now
751751

752752
![][50]
753753

754754

755-
[50]: images/1715410651913.png
755+
[50]: https://media.licdn.com/dms/image/D4D12AQGhL_64zTMncA/article-inline_image-shrink_1500_2232/0/1715410651913?e=1727308800&v=beta&t=LUZ0UBtLt_awE70XyVVr5eUuOVJwwcr_WEnOn5ZcfBE
756756

757757
![][51]
758758

759759

760-
[51]: images/1715410924914.png
760+
[51]: https://media.licdn.com/dms/image/D4D12AQF1SisOxZGQfg/article-inline_image-shrink_1500_2232/0/1715410924914?e=1727308800&v=beta&t=bRRhuBqlKFUgxNaBDr3ien9__ctzu-rVod5bka2L35M
761761

762762

763763
Input How is breast cancer staged?
@@ -773,7 +773,7 @@ Thats all we would need, just run the page now
773773
![][52]
774774

775775

776-
[52]: images/1715411555406.png
776+
[52]: https://media.licdn.com/dms/image/D4D12AQEc-o7F0gGS7g/article-inline_image-shrink_1500_2232/0/1715411555406?e=1727308800&v=beta&t=xAmPsUgl1OebdZaqkRCqyuSpOHqbHBYVVujCdrSTgLU
777777

778778

779779
Input: What is the risk of being diagnosed with breast cancer?
@@ -789,7 +789,7 @@ Thats all we would need, just run the page now
789789
![][53]
790790

791791

792-
[53]: images/1715412314340.png
792+
[53]: https://media.licdn.com/dms/image/D4D12AQHduxo1i5uv5g/article-inline_image-shrink_1500_2232/0/1715412314340?e=1727308800&v=beta&t=xtfNBZVFflJQbDtFKwf_M1TaKDFYp6y50lyk65Rg6ec
793793

794794

795795
Input: what are the symptoms of covid-19
@@ -807,12 +807,12 @@ Thats all we would need, just run the page now
807807
![][54]
808808

809809

810-
[54]: images/1715584271093.png
810+
[54]: https://media.licdn.com/dms/image/D4D12AQGg1TQ-sCW1Jw/article-inline_image-shrink_1500_2232/0/1715584271093?e=1727308800&v=beta&t=yfzxCU52bNREsJ4rfWmm8gzoIVmTRX07RpV1XKh6y88
811811

812812
![][55]
813813

814814

815-
[55]: images/1715583595019.png
815+
[55]: https://media.licdn.com/dms/image/D4D12AQGybVu9OmrX6Q/article-inline_image-shrink_1500_2232/0/1715583595019?e=1727308800&v=beta&t=XbHZihTJvLY2o641OIOVUWMiPFbwIt7dsgsyt9eIFLA
816816

817817
Reference Architecture
818818

ai-vector-search-and-rag/images/1715344779151.png

Lines changed: 0 additions & 10 deletions
This file was deleted.

ai-vector-search-and-rag/images/1715346700078.png

Lines changed: 0 additions & 10 deletions
This file was deleted.

ai-vector-search-and-rag/images/1715347038280.png

Lines changed: 0 additions & 10 deletions
This file was deleted.

ai-vector-search-and-rag/images/1715349973247.png

Lines changed: 0 additions & 10 deletions
This file was deleted.

ai-vector-search-and-rag/images/1715358256114.png

Lines changed: 0 additions & 10 deletions
This file was deleted.

ai-vector-search-and-rag/images/1715358954759.png

Lines changed: 0 additions & 10 deletions
This file was deleted.

0 commit comments

Comments
 (0)