Skip to content

Commit 967c1e8

Browse files
Merge pull request #176 from matteoceriscioli/mlingam_docs
Update m-LiNGAM docs and add reference
2 parents 36ac16f + 47561b3 commit 967c1e8

File tree

2 files changed

+16
-8
lines changed

2 files changed

+16
-8
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -163,4 +163,8 @@ Should you use this package for performing **LiM algorithm**, we kindly request
163163

164164
* Y. Zeng, S. Shimizu, H. Matsui, F. Sun. **Causal discovery for linear mixed data**. In Proc. First Conference on Causal Learning and Reasoning (CLeaR2022). PMLR 177, pp. 994-1009, 2022. [[PDF]](https://proceedings.mlr.press/v177/zeng22a.html)
165165

166+
### Missing data
166167

168+
Should you use this package for performing the **Missingness-LiNGAM algorithm**, we kindly request you to cite the following paper:
169+
170+
* M. Ceriscioli, S. Shimizu, and K. Mohan. **Discovering Linear Non-Gaussian Models for All Categories of Missing Data (Student Abstract)**. The 40th Annual AAAI Conference on Artificial Intelligence (AAAI-26) Student Abstract and Poster Program, 2026. [[PDF]](https://raw.githubusercontent.com/matteoceriscioli/matteoceriscioli.github.io/master/files/Discovering_Linear_NonGaussian_Models_for_All_Categories_of_Missing_Data_(Student_Abstract).pdf)

docs/tutorial/missingness_lingam.rst

Lines changed: 12 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -4,10 +4,10 @@ m-LiNGAM
44
Model
55
-------------------
66

7-
Missingness-LiNGAM (m-LiNGAM) extends the basic LiNGAM [1]_ model to handle datasets affected by missing values, including Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR) cases.
7+
Missingness-LiNGAM (m-LiNGAM) [1]_ extends the basic LiNGAM model [2]_ to handle datasets affected by missing values, including Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR) cases.
88
It enables the identification of the true underlying causal structure and provides unbiased parameter estimates even when data are not fully observed.
99

10-
The model combines the principles of LiNGAM and the graphical representation of missingness mechanisms using *missingness graphs* (m-graphs) [2]_.
10+
The model combines the principles of LiNGAM and the graphical representation of missingness mechanisms using *missingness graphs* (m-graphs) [3]_.
1111
In this framework, variables can be fully observed or partially observed, and each partially observed variable is associated with a missingness mechanism and a proxy variable.
1212

1313
Let the set of variables be:
@@ -32,7 +32,7 @@ The induced subgraph :math:`G[V_o \cup V_m]` follows a LiNGAM model, meaning tha
3232
3333
where :math:`i\in\{1,\dots,n\}\mapsto k(i)` denotes a causal order, and the non-gaussian error terms are independent.
3434

35-
The induced subgraph :math:`G[V_o \cup V_m \cup R]` follows a LiM model. The missingness mechanisms :math:`R_i \in R` follow a logistic model as for binary variables in LiM [3]_:
35+
The induced subgraph :math:`G[V_o \cup V_m \cup R]` follows a LiM model. The missingness mechanisms :math:`R_i \in R` follow a logistic model as for binary variables in LiM [4]_:
3636

3737
.. math::
3838
x_i = \mathbf 1\llbracket\sum_{k(j)<k(i)} b_{ij} x_j + e_i > 0\rrbracket, \qquad e_i \sim \text{Logistic}(0,1)
@@ -45,24 +45,28 @@ The following assumptions are made to ensure identifiability:
4545

4646
#. No latent confounders (:math:`U = \emptyset`).
4747
#. No causal interactions between missingness mechanisms (:math:`R_i \notin Pa(R_j)` for all :math:`i \neq j`).
48-
#. No direct self-masking (:math:`X_i \notin Pa(R_i)` for any :math:`X_i \in V_m`).
48+
#. No self-masking (:math:`X_i \notin Pa(R_i)` for any :math:`X_i \in V_m`).
4949

50-
Note that even if direct self-masking is not allowed, a partially observed variable can be an indirect cause (an ancestor) of its own missingness mechanism (indirect self-masking).
50+
Note that even if self-masking is not allowed, indirect self-masking is: a partially observed variable can be an indirect cause (an ancestor) of its own missingness mechanism.
5151
Under these assumptions, m-LiNGAM guarantees identifiability of both the causal structure and parameters from observational data in the large-sample limit.
5252

5353
An example Python notebook demonstrating m-LiNGAM is available `here <https://github.com/cdt15/lingam/blob/master/examples/MissingnessLiNGAM.ipynb>`__.
5454

5555
References
5656
-------------------
5757

58-
.. [1] S. Shimizu, P. O. Hoyer, A. Hyvärinen, and A. J. Kerminen.
58+
.. [1] M. Ceriscioli, S. Shimizu, and K. Mohan.
59+
*Discovering Linear Non-Gaussian Models for All Categories of Missing Data (Student Abstract).*
60+
The 40th Annual AAAI Conference on Artificial Intelligence (AAAI-26) Student Abstract and Poster Program, 2026.
61+
62+
.. [2] S. Shimizu, P. O. Hoyer, A. Hyvärinen, and A. J. Kerminen.
5963
*A Linear Non-Gaussian Acyclic Model for Causal Discovery.*
6064
Journal of Machine Learning Research, 7:2003–2030, 2006.
6165
62-
.. [2] K. Mohan, J. Pearl, and J. Tian.
66+
.. [3] K. Mohan, J. Pearl, and J. Tian.
6367
*Graphical Models for Inference with Missing Data.*
6468
Advances in Neural Information Processing Systems (NeurIPS), 2013.
6569
66-
.. [3] Y. Zeng, S. Shimizu, H. Matsui, and F. Sun.
70+
.. [4] Y. Zeng, S. Shimizu, H. Matsui, and F. Sun.
6771
*Causal Discovery for Linear Mixed Data.*
6872
In Proceedings of the First Conference on Causal Learning and Reasoning (CLeaR 2022), PMLR 177, pp. 994–1009, 2022.

0 commit comments

Comments
 (0)