Implementation 805 by J007X · Pull Request #905 · asyml/forte

J007X · 2022-11-18T07:54:14Z

This PR fixes [https://github.com//issues/805].

Description of changes

According to discussion and requirements in the ticket and , a new test NLP pipeline using Forte is created in a new test class "NLP_Pipeline_Performance_Test" to cover typical scenarios relating to entry/token, such as serialization, POS tagging and NER as mentioned in the ticket. Also to make sure 0.2.0 and current code base are run in exactly some way/code, some code in copied from some referenced (but changed over the version) packages, to make sure we perform an apple to apple test

Possible influences of this PR.

This can be used in several iterations of profiling-improvement efforts, that adding profiling coverage on key/typical scenarios.

Test Conducted

Profiling test on serialization, POS tagging and NER as requested in [https://github.com/J007X/forte/tree/implementation_805].

…nltk or fortex.nltk to prevent version conflicts initally and use all-in-one approach to make sure the test code will be run the same way with 0.2.0 and new version > 0.3.0.

…ameter)

… neck related to nested generator/sortedlist area in DataPack.

…ne) : please provide conll data directory to the commented out "input_dir" parameter in code.

…ovide output local dir name in serialization test case.

codecov · 2022-11-18T08:42:37Z

Codecov Report

Merging #905 (8068e13) into master (6e2d6ea) will decrease coverage by 0.07%.
The diff coverage is 72.48%.

@@            Coverage Diff             @@
##           master     #905      +/-   ##
==========================================
- Coverage   81.05%   80.99%   -0.07%     
==========================================
  Files         256      257       +1     
  Lines       19851    19999     +148     
==========================================
+ Hits        16091    16198     +107     
- Misses       3760     3801      +41

Impacted Files	Coverage Δ
tests/forte/data/data_pack_profiling_test.py	`72.29% <72.29%> (ø)`
forte/data/ontology/ontology_code_generator.py	`89.75% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

hunterhector

Maybe we also want to produce some numbers for the test?
How to run this test on actual data (like ontonotes), we shoud provide instructions.
The first line "PR fixes" message is not correctly formatted. Github won't associate the PR to the issue like this.

tests/forte/data/data_pack_profiling_test.py

…ntation_805

J007X added 6 commits October 17, 2022 17:12

Inital commit for the profiling test -- not using imports from forte.…

96df5f5

…nltk or fortex.nltk to prevent version conflicts initally and use all-in-one approach to make sure the test code will be run the same way with 0.2.0 and new version > 0.3.0.

Fixed a few parameter issues (input_path need to be supplied from par…

413bcaf

…ameter)

Added NER and serialization test

5e2da8e

PR submission for the current version of testing (that detects bottle…

8d12c4e

… neck related to nested generator/sortedlist area in DataPack.

Fixed related testing directory issue (remove dir name on local machi…

ce8a1d2

…ne) : please provide conll data directory to the commented out "input_dir" parameter in code.

Fix output dir issue in test (removed local dir name): please also pr…

d5e714a

…ovide output local dir name in serialization test case.

J007X requested a review from hunterhector November 24, 2022 11:19

J007X and others added 2 commits November 30, 2022 11:03

Merge branch 'master' into implementation_805

5c2ad55

Merge branch 'master' into implementation_805

a3b8214

hunterhector reviewed Dec 21, 2022

View reviewed changes

hunterhector requested a review from mylibrar December 21, 2022 06:12

hunterhector and others added 6 commits December 21, 2022 13:30

Merge branch 'master' into implementation_805

e034541

Merge branch 'asyml:master' into implementation_805

90ab2f1

Fixed multiple comments for this PR.

3b08be5

Merge remote-tracking branch 'origin/implementation_805' into impleme…

9530628

…ntation_805

Merge branch 'master' into implementation_805

881d62d

Merge branch 'master' into implementation_805

8068e13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation 805#905

Implementation 805#905
J007X wants to merge 14 commits intoasyml:masterfrom
J007X:implementation_805

J007X commented Nov 18, 2022 •

edited

Loading

Uh oh!

codecov bot commented Nov 18, 2022 •

edited

Loading

Uh oh!

hunterhector left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

J007X commented Nov 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of changes

Possible influences of this PR.

Test Conducted

Uh oh!

codecov bot commented Nov 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hunterhector left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

J007X commented Nov 18, 2022 •

edited

Loading

codecov bot commented Nov 18, 2022 •

edited

Loading