Adding BERT for MS-MARCO Passage re-ranking by atif93 · Pull Request #277 · asyml/texar-pytorch

atif93 · 2019-12-20T16:08:03Z

Adding BERT fine-tuned on MS-MARCO for passage re-ranking task (https://arxiv.org/abs/1901.04085)

Since this is a pretrained classifier, we had to add final linear layer parameters in PretrainedBERTMixin. Based on the pretrained_model_name, the weights of the final classifier layer will be loaded if they are present.

resolve #254

codecov · 2019-12-20T19:19:22Z

Codecov Report

Merging #277 into master will decrease coverage by 0.05%.
The diff coverage is 21.05%.

@@            Coverage Diff             @@
##           master     #277      +/-   ##
==========================================
- Coverage   83.07%   83.01%   -0.06%     
==========================================
  Files         195      195              
  Lines       15323    15338      +15     
==========================================
+ Hits        12729    12733       +4     
- Misses       2594     2605      +11

Impacted Files	Coverage Δ
texar/torch/data/tokenizers/bert_tokenizer.py	`88.88% <ø> (ø)`	⬆️
texar/torch/modules/classifiers/bert_classifier.py	`84.88% <100%> (+0.54%)`	⬆️
texar/torch/modules/pretrained/bert.py	`17.75% <6.25%> (-1.2%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 54f9fee...6e3920a. Read the comment docs.

atif93 · 2019-12-20T19:57:53Z

Wanted to get an idea of what you guys think of this design for loading a pretrained BERTClassifier config.
Next I'll be writing an example to test that we can reproduce the results of the paper using this.

gpengzhi · 2019-12-20T20:09:53Z

texar/torch/modules/classifiers/bert_classifier.py


        super().__init__(hparams=hparams)

+        self.load_pretrained_config(pretrained_model_name, cache_dir)


Will load_pretrained_config and init_pretrained_weights be called twice (once in BERTClassifier, and once in BERTEncoder)?

If that is the case, we probably should not load the pre-trained weights in self._encoder (BERTEncoder).

Discussed offline.
We can pass pretrained_model_name as None while instantiating the encoder in BERTClassifier.

If you set pretrained_model_name and pretrained_model_name in hparams to be None, BERTEncoder won't load the pre-trained weights.

Made both the changes.

hunterhector · 2019-12-20T20:15:32Z

texar/torch/data/tokenizers/bert_tokenizer.py

+
+        # BERT for MS-MARCO
+        'bert-msmarco-base': 512,
+        'bert-msmarco-large': 512,


This won't be the last/best Bert model for MS-Marco, so probably we'd come up with more specific names, say bert-msmarco-nguyen2019

Sure let me change that

Changed the names

atif93 self-assigned this Dec 20, 2019

atif93 requested review from AvinashBukkittu, ZhitingHu, gpengzhi, hunterhector and mgupta1410 December 20, 2019 19:52

gpengzhi reviewed Dec 20, 2019

View reviewed changes

hunterhector reviewed Dec 20, 2019

View reviewed changes

Atif Ahmed added 11 commits December 26, 2019 11:51

Adding BERT for MS-MARCO passage re-ranking pretrained model

e0a2da2

Adding logits layer weights and bias

90a06a1

Making the PretrainedMixin work for both encoder and classifier

014c558

Adding tokenizer part

8938520

docstring

703896f

Spelling

e4a9738

Spelling

67b1951

Lint

e8663fd

Changing name

31d3e07

Changing name

824d947

Avoiding duplicate downloads

7d56607

atif93 force-pushed the msmarco-bert branch from 7fbf864 to 7d56607 Compare December 26, 2019 16:52

lint

6e3920a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding BERT for MS-MARCO Passage re-ranking#277

Adding BERT for MS-MARCO Passage re-ranking#277
atif93 wants to merge 12 commits intoasyml:masterfrom
atif93:msmarco-bert

atif93 commented Dec 20, 2019 •

edited

Loading

Uh oh!

codecov bot commented Dec 20, 2019 •

edited

Loading

Uh oh!

atif93 commented Dec 20, 2019 •

edited

Loading

Uh oh!

gpengzhi Dec 20, 2019 •

edited

Loading

Uh oh!

gpengzhi Dec 20, 2019 •

edited

Loading

Uh oh!

atif93 Dec 20, 2019

Uh oh!

gpengzhi Dec 20, 2019

Uh oh!

atif93 Dec 23, 2019

Uh oh!

hunterhector Dec 20, 2019

Uh oh!

atif93 Dec 20, 2019

Uh oh!

atif93 Dec 23, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		super().__init__(hparams=hparams)

		self.load_pretrained_config(pretrained_model_name, cache_dir)

Conversation

atif93 commented Dec 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

atif93 commented Dec 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gpengzhi Dec 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gpengzhi Dec 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

atif93 Dec 20, 2019

Choose a reason for hiding this comment

Uh oh!

gpengzhi Dec 20, 2019

Choose a reason for hiding this comment

Uh oh!

atif93 Dec 23, 2019

Choose a reason for hiding this comment

Uh oh!

hunterhector Dec 20, 2019

Choose a reason for hiding this comment

Uh oh!

atif93 Dec 20, 2019

Choose a reason for hiding this comment

Uh oh!

atif93 Dec 23, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

atif93 commented Dec 20, 2019 •

edited

Loading

codecov bot commented Dec 20, 2019 •

edited

Loading

atif93 commented Dec 20, 2019 •

edited

Loading

gpengzhi Dec 20, 2019 •

edited

Loading

gpengzhi Dec 20, 2019 •

edited

Loading