You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+72Lines changed: 72 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,6 +12,24 @@ For each PR made, an entry should be added to this changelog. It should contain
12
12
- etc.
13
13
14
14
## Changelog
15
+
16
+
- 1209-bug-fix-document-type-creator-form
17
+
- Description: The dropdown on the pattern creation form needs to be set as multi as the default option since this is why the doc type creator form is used for the majority of multi-URL pattern creations. This should be applied to doc types, division types, and titles as well.
18
+
- Changes:
19
+
- Set the default value for `match_pattern_type` in `BaseMatchPattern` class is set to `2`
20
+
- Changed `test_create_simple_exclude_pattern` test within `TestDeltaExcludePatternBasics`
21
+
- Changed `test_create_division_pattern` and `test_create_document_type_pattern_single` within `TestFieldModifierPatternBasics`
- Description: The original automation set up to generate the scrapers and indexers automatically based on a collection workflow status change needed to be updated to more accurately reflect the curation workflow. It would also be good to generate the jobs during this process to streamline the same.
25
+
- Changes:
26
+
- Updated function nomenclature. Scrapers are Sinequa connector configurations that are used to scrape all the URLs prior to curation. Indexers are Sienqua connector configurations that are used to scrape the URLs post to curation, which would be used to index content on production. Jobs are used to trigger the connectors which are included as parts of joblists.
27
+
- Parameterized the convert_template_to_job method to include the job_source to streamline the value added to the `<Collection>` tag in the job XML.
28
+
- Updated the fields that are pertinenet to transfer from a scraper to an indexer. Also added a third level of XML processing to facilitate the same.
29
+
- scraper_template.xml and indexer_template.xml now contains the templates used for the respective configuration generation.
30
+
- Deleted the redundant webcrawler_initial_crawl.xml file.
31
+
- Added and updated tests on workflow status triggers.
32
+
15
33
- 2889-serialize-the-tdamm-tags
16
34
- Description: Have TDAMM serialzed in a specific way and exposed via the Curated URLs API to be consumed into SDE Test/Prod
17
35
- Changes:
@@ -36,13 +54,38 @@ For each PR made, an entry should be added to this changelog. It should contain
36
54
- Used regex to catch any HTML content comming in as an input to form fields
37
55
- Called this class within the serializer for necessary fields
- Description: When URLs of a given collection are imported into COSMOS, a Slack notification is sent. This notification includes the name of the collection imported,count of the existing curated URLs, total URLs count as per the server, URLs successfully imported from the server, delta URLs identified and delta URLs marked for deletion.
64
+
- Changes:
65
+
- The get_full_texts() function in sde_collections/sinequa_api.py is updated to yeild total_count along with rows.
66
+
- fetch_and_replace_full_text() function in sde_collections/tasks.py captures the total_server_count and triggers send_detailed_import_notification().
67
+
- Added a function send_detailed_import_notification() in sde_collections/utils/slack_utils.py to structure the notification to be sent.
68
+
- Updated the associated tests effected due to inclusion of this functionality.
- Description: Upon selecting a document type on any individual URL, the page refreshes and returns to the top. This is not necessarily a bug but an inconvenience, especially when working at the bottom of the page. Fix the JS code.
72
+
- Changes:
73
+
- Added a constant `scrollPosition` within `postDocumentTypePatterns` to store the y coordinate postion on the page
74
+
- Modified the ajax relaod to navigate to this position upon posting/saving the document type changes.
- Description: When selecting options from the match pattern type filter, the system does not filter the results as expected. Instead of displaying only the chosen variety of patterns, it continues to show all patterns.
41
78
- Changes:
42
79
- In `title_patterns_table` definition, corrected the column reference
43
80
- Made `match_pattern_type` searchable
44
81
- Corrected the column references and made code consistent on all the other tables, i.e., `exclude_patterns_table`, `include_patterns_table`, `division_patterns_table` and `document_type_patterns_table`
45
82
83
+
- 1190-add-tests-for-job-generation-pipeline
84
+
- Description: Tests have been added to enhance coverage for the config and job creation pipeline, alongside comprehensive tests for XML processing.
85
+
- Changes:
86
+
- Added config_generation/tests/test_config_generation_pipeline.py which tests the config and job generation pipeline, ensuring all components interact correctly
87
+
- config_generation/tests/test_db_to_xml.py is updated to include comprehensive tests for XML Processing
88
+
46
89
- 1001-tests-for-critical-functionalities
47
90
- Description: Critical functionalities have been identified and listed, and critical areas lacking tests listed
48
91
- Changes:
@@ -65,3 +108,32 @@ For each PR made, an entry should be added to this changelog. It should contain
65
108
- Added universal search functionality tests
66
109
- Created search pane filter tests
67
110
- Added pattern application form tests with validation checks
111
+
112
+
- 1101-bug-fix-quotes-not-escaped-in-titles
113
+
- Description: Title rules that include single quotes show up correctly in the sinequa frontend (and the COSMOS api) but not in the delta urls page.
114
+
- Changes:
115
+
- Added `escapeHtml` function in the `delta_url_list.js` file to handle special character escaping correctly.
116
+
- Called this function while retrieving the titles in `getGeneratedTitleColumn()` and `getCuratedGeneratedTitleColumn()` functions.
- Description: In the Document Type Pattern Form, if the user does not select a Document Type while filling out the form, an appropriate error message is displayed.
132
+
- Changes:
133
+
- Added a JavaScript validation check on form submission to ensure the document type (stored in a hidden input) is not empty.
134
+
- Display an error message and prevent form submission if the field is empty.
0 commit comments