Add 2-dimensional wordfish model and 1- and 2-dimensional NB wordfish models #32

YeWang1576 · 2020-08-14T00:09:58Z

Add the one-dimensional and two-dimensional negative binomial model, as well as the two-dimensional Poisson model

kbenoit

This is a welcome addition to the existing function that does two things:

adds SEs for the other parameters, since the existing version only provided them for theta.
adds an option to return second dimension estimates of the beta and theta parameters.

A few things that stand out, in addition to what I've noted in specific code-related comments:

Won't we need a second direction constraint for the 2D model? This could be a 3rd and 4th element, or a list of two length2 vectors.
The prior_values argument seems really important but it's not documented. Since I suspect this is how the 2D model is identified, we need to make very clear how the 2D model must set this. Please document this and provide examples of how it is used. What is the setting for non-constrained parameters? NA?
Related to priors, one possibility would be to consider the vector of betas and thetas for dimension two to be constrained at zero by default, and for these to be NA for them to be estimated. Please explain more how these are implemented and used by the model.
The helper functions such as predict() and coef() will need more code for handling the 2D case.
We should add a textmodel_scale2d() as well for this case (and it can be used for textmodel_ca() results). @koheiw and I can work on that one.

My preference then is to keep things as close as possible to the original function but extend it.

To things that @koheiw and I will also think about:

Should this be part of the original function, or should we extend it to a new function? If we can unify the framework by making this work through the constraints, or using a dims = 1 argument that can take a 2 for the 2D case, then we can use one function. But then the dir will need four, not two values.
I think we should define a new return object class called textmodel_wordfish2d for two-dimensional objects, then we can write separate predict() and coef() etc methods for this variant (and solving one of the issues above). It also means we could write a textmodel_scale2d.textmodel_wordfish2d() function for this - plus a method for textmodel_ca objects.
This is written only for dense dfms, but @koheiw do you think the code could be adapted for the sparse version too?

kbenoit · 2020-08-28T16:12:33Z

R/textmodel_wordfish.R

+#'   binomial model
+#' @param dim2 a boolean variable that specifies whether to estimate the second
+#'   dimension of theta and beta
+#' @param prior_values NEEDS DOCUMENTING


Please add a description here.

kbenoit · 2020-08-28T16:12:51Z

R/textmodel_wordfish.R

-#' @references Slapin, J. & Proksch, S.O. (2008).
-#'   [A Scaling Model
-#'   for Estimating Time-Series Party Positions from Texts](https://doi.org/10.1111/j.1540-5907.2008.00338.x). *American
+#' @references


Is there a reference we can add for the 2D model?

kbenoit · 2020-08-28T16:13:28Z

R/textmodel_wordfish.R

 #' @note In the rare situation where a warning message of "The algorithm did not
 #'   converge." shows up, removing some documents may work.
 #' @seealso [predict.textmodel_wordfish()]
-#' @references Slapin, J. & Proksch, S.O. (2008).


We should add a description of the new return objects containing the word-level SEs and new parameters.

kbenoit · 2020-08-28T16:14:05Z

R/textmodel_wordfish.R

+#'   Benchmark](http://doi.org/10.1093/pan/mpt002). *Political Analysis*, 21(3),
+#'   298--313.
 #' @author Benjamin Lauderdale, Haiyan Wang, and Kenneth Benoit
 #' @examples


Please add some examples for the 2D case.

kbenoit · 2020-08-28T16:15:18Z

R/textmodel_wordfish.R

+        "estimated.feature.scores" = as.coefficients_textmodel(head(coef(object)$features, n))
    )
    return(as.summary.textmodel(result))
 }


predict will need modification for the 2D case.

kbenoit · 2020-08-28T16:17:17Z

R/textmodel_wordfish.R

        phi = as.numeric(result$phi),
-        se.theta = as.numeric(result$thetaSE) ,
+        zeta = as.numeric(result$zeta),
+        theta2 = as.numeric(result$theta2),


It probably makes more sense for the 2D case to return theta as an ndoc x 2 matrix instead of as two vectors of length ndoc. Same for the other parameters and SEs.

YeWang1576 · 2020-12-01T04:31:33Z

@kbenoit Thanks for the comments! I will work on the sparse version of the two-dimensional model and modify the dense version according to the suggestions as well.

codecov · 2021-02-01T17:56:02Z

Codecov Report

Merging #32 (639f6b8) into master (34017dc) will decrease coverage by 5.55%.
The diff coverage is 28.02%.

❗ Current head 639f6b8 differs from pull request most recent head 989ef98. Consider uploading reports for the commit 989ef98 to get more accurate results

@@            Coverage Diff             @@
##           master      #32      +/-   ##
==========================================
- Coverage   64.27%   58.71%   -5.56%     
==========================================
  Files          19       19              
  Lines        2536     2873     +337     
==========================================
+ Hits         1630     1687      +57     
- Misses        906     1186     +280

Impacted Files	Coverage Δ
src/wordfish_dense.cpp	`40.91% <25.00%> (-59.09%)`	⬇️
R/textmodel_wordfish.R	`79.41% <65.51%> (-4.66%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 17f1c84...989ef98. Read the comment docs.

YeWang1576 and others added 5 commits August 12, 2020 23:31

merge

5b30f51

Merge branch 'master' into RLW

26b9ae0

Merge branch 'master' into RLW

0b21e62

minor linting

bf7908c

Fix documentation issues

e7ab644

kbenoit requested changes Aug 28, 2020

View reviewed changes

Merge branch 'master' into RLW

d281621

Merge branch 'master' into RLW

639f6b8

kbenoit changed the title ~~merge~~ Add 2-dimensional wordfish model and 1- and 2-dimensional NB wordfish models Mar 20, 2022

Merge branch 'master' into RLW

989ef98

kbenoit mentioned this pull request Aug 28, 2025

Interested in adding this to quanteda.textmodels? naiveError/wordkrill#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add 2-dimensional wordfish model and 1- and 2-dimensional NB wordfish models #32

Add 2-dimensional wordfish model and 1- and 2-dimensional NB wordfish models #32

Uh oh!

YeWang1576 commented Aug 14, 2020

Uh oh!

kbenoit left a comment

Uh oh!

kbenoit Aug 28, 2020

Uh oh!

kbenoit Aug 28, 2020

Uh oh!

kbenoit Aug 28, 2020

Uh oh!

kbenoit Aug 28, 2020

Uh oh!

kbenoit Aug 28, 2020

Uh oh!

kbenoit Aug 28, 2020

Uh oh!

YeWang1576 commented Dec 1, 2020

Uh oh!

codecov bot commented Feb 1, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add 2-dimensional wordfish model and 1- and 2-dimensional NB wordfish models #32

Are you sure you want to change the base?

Add 2-dimensional wordfish model and 1- and 2-dimensional NB wordfish models #32

Uh oh!

Conversation

YeWang1576 commented Aug 14, 2020

Uh oh!

kbenoit left a comment

Choose a reason for hiding this comment

Uh oh!

kbenoit Aug 28, 2020

Choose a reason for hiding this comment

Uh oh!

kbenoit Aug 28, 2020

Choose a reason for hiding this comment

Uh oh!

kbenoit Aug 28, 2020

Choose a reason for hiding this comment

Uh oh!

kbenoit Aug 28, 2020

Choose a reason for hiding this comment

Uh oh!

kbenoit Aug 28, 2020

Choose a reason for hiding this comment

Uh oh!

kbenoit Aug 28, 2020

Choose a reason for hiding this comment

Uh oh!

YeWang1576 commented Dec 1, 2020

Uh oh!

codecov bot commented Feb 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Feb 1, 2021 •

edited

Loading