Merge pull request #176 from matteoceriscioli/mlingam_docs

ikeuchi-screen · web-flow · commit 967c1e819e70 · 2025-10-29T10:46:33.000+09:00
Update m-LiNGAM docs and add reference
diff --git a/README.md b/README.md
@@ -163,4 +163,8 @@ Should you use this package for performing **LiM algorithm**, we kindly request
 
 * Y. Zeng, S. Shimizu, H. Matsui, F. Sun. **Causal discovery for linear mixed data**. In Proc. First Conference on Causal Learning and Reasoning (CLeaR2022). PMLR 177, pp. 994-1009, 2022. [[PDF]](https://proceedings.mlr.press/v177/zeng22a.html)
 
+### Missing data
 
+Should you use this package for performing the **Missingness-LiNGAM algorithm**, we kindly request you to cite the following paper:
+
+* M. Ceriscioli, S. Shimizu, and K. Mohan. **Discovering Linear Non-Gaussian Models for All Categories of Missing Data (Student Abstract)**. The 40th Annual AAAI Conference on Artificial Intelligence (AAAI-26) Student Abstract and Poster Program, 2026. [[PDF]](https://raw.githubusercontent.com/matteoceriscioli/matteoceriscioli.github.io/master/files/Discovering_Linear_NonGaussian_Models_for_All_Categories_of_Missing_Data_(Student_Abstract).pdf)
diff --git a/docs/tutorial/missingness_lingam.rst b/docs/tutorial/missingness_lingam.rst
@@ -4,10 +4,10 @@ m-LiNGAM
 Model
 -------------------
 
-Missingness-LiNGAM (m-LiNGAM) extends the basic LiNGAM [1]_ model to handle datasets affected by missing values, including Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR) cases.  
+Missingness-LiNGAM (m-LiNGAM) [1]_ extends the basic LiNGAM model [2]_ to handle datasets affected by missing values, including Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR) cases.  
 It enables the identification of the true underlying causal structure and provides unbiased parameter estimates even when data are not fully observed.
 
-The model combines the principles of LiNGAM and the graphical representation of missingness mechanisms using *missingness graphs* (m-graphs) [2]_.  
+The model combines the principles of LiNGAM and the graphical representation of missingness mechanisms using *missingness graphs* (m-graphs) [3]_.  
 In this framework, variables can be fully observed or partially observed, and each partially observed variable is associated with a missingness mechanism and a proxy variable.  
 
 Let the set of variables be:
@@ -32,7 +32,7 @@ The induced subgraph :math:`G[V_o \cup V_m]` follows a LiNGAM model, meaning tha
 
 where :math:`i\in\{1,\dots,n\}\mapsto k(i)` denotes a causal order, and the non-gaussian error terms are independent.
 
-The induced subgraph :math:`G[V_o \cup V_m \cup R]` follows a LiM model. The missingness mechanisms :math:`R_i \in R` follow a logistic model as for binary variables in LiM [3]_:
+The induced subgraph :math:`G[V_o \cup V_m \cup R]` follows a LiM model. The missingness mechanisms :math:`R_i \in R` follow a logistic model as for binary variables in LiM [4]_:
 
 .. math::
     x_i = \mathbf 1\llbracket\sum_{k(j)<k(i)} b_{ij} x_j + e_i > 0\rrbracket, \qquad e_i \sim \text{Logistic}(0,1)
@@ -45,24 +45,28 @@ The following assumptions are made to ensure identifiability:
 
 #. No latent confounders (:math:`U = \emptyset`).
 #. No causal interactions between missingness mechanisms (:math:`R_i \notin Pa(R_j)` for all :math:`i \neq j`).
-#. No direct self-masking (:math:`X_i \notin Pa(R_i)` for any :math:`X_i \in V_m`).
+#. No self-masking (:math:`X_i \notin Pa(R_i)` for any :math:`X_i \in V_m`).
 
-Note that even if direct self-masking is not allowed, a partially observed variable can be an indirect cause (an ancestor) of its own missingness mechanism (indirect self-masking).
+Note that even if self-masking is not allowed, indirect self-masking is: a partially observed variable can be an indirect cause (an ancestor) of its own missingness mechanism.
 Under these assumptions, m-LiNGAM guarantees identifiability of both the causal structure and parameters from observational data in the large-sample limit.
 
 An example Python notebook demonstrating m-LiNGAM is available `here <https://github.com/cdt15/lingam/blob/master/examples/MissingnessLiNGAM.ipynb>`__.
 
 References
 -------------------
 
-.. [1] S. Shimizu, P. O. Hoyer, A. Hyvärinen, and A. J. Kerminen.  
+.. [1] M. Ceriscioli, S. Shimizu, and K. Mohan.  
+       *Discovering Linear Non-Gaussian Models for All Categories of Missing Data (Student Abstract).*  
+       The 40th Annual AAAI Conference on Artificial Intelligence (AAAI-26) Student Abstract and Poster Program, 2026.
+
+.. [2] S. Shimizu, P. O. Hoyer, A. Hyvärinen, and A. J. Kerminen.  
        *A Linear Non-Gaussian Acyclic Model for Causal Discovery.*  
        Journal of Machine Learning Research, 7:2003–2030, 2006.
 
-.. [2] K. Mohan, J. Pearl, and J. Tian.  
+.. [3] K. Mohan, J. Pearl, and J. Tian.  
        *Graphical Models for Inference with Missing Data.*  
        Advances in Neural Information Processing Systems (NeurIPS), 2013.
 
-.. [3] Y. Zeng, S. Shimizu, H. Matsui, and F. Sun.  
+.. [4] Y. Zeng, S. Shimizu, H. Matsui, and F. Sun.  
        *Causal Discovery for Linear Mixed Data.*  
        In Proceedings of the First Conference on Causal Learning and Reasoning (CLeaR 2022), PMLR 177, pp. 994–1009, 2022.