You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+4Lines changed: 4 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -163,4 +163,8 @@ Should you use this package for performing **LiM algorithm**, we kindly request
163
163
164
164
* Y. Zeng, S. Shimizu, H. Matsui, F. Sun. **Causal discovery for linear mixed data**. In Proc. First Conference on Causal Learning and Reasoning (CLeaR2022). PMLR 177, pp. 994-1009, 2022. [[PDF]](https://proceedings.mlr.press/v177/zeng22a.html)
165
165
166
+
### Missing data
166
167
168
+
Should you use this package for performing the **Missingness-LiNGAM algorithm**, we kindly request you to cite the following paper:
169
+
170
+
* M. Ceriscioli, S. Shimizu, and K. Mohan. **Discovering Linear Non-Gaussian Models for All Categories of Missing Data (Student Abstract)**. The 40th Annual AAAI Conference on Artificial Intelligence (AAAI-26) Student Abstract and Poster Program, 2026. [[PDF]](https://raw.githubusercontent.com/matteoceriscioli/matteoceriscioli.github.io/master/files/Discovering_Linear_NonGaussian_Models_for_All_Categories_of_Missing_Data_(Student_Abstract).pdf)
Copy file name to clipboardExpand all lines: docs/tutorial/missingness_lingam.rst
+12-8Lines changed: 12 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,10 +4,10 @@ m-LiNGAM
4
4
Model
5
5
-------------------
6
6
7
-
Missingness-LiNGAM (m-LiNGAM) extends the basic LiNGAM [1]_ model to handle datasets affected by missing values, including Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR) cases.
7
+
Missingness-LiNGAM (m-LiNGAM) [1]_ extends the basic LiNGAM model [2]_ to handle datasets affected by missing values, including Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR) cases.
8
8
It enables the identification of the true underlying causal structure and provides unbiased parameter estimates even when data are not fully observed.
9
9
10
-
The model combines the principles of LiNGAM and the graphical representation of missingness mechanisms using *missingness graphs* (m-graphs) [2]_.
10
+
The model combines the principles of LiNGAM and the graphical representation of missingness mechanisms using *missingness graphs* (m-graphs) [3]_.
11
11
In this framework, variables can be fully observed or partially observed, and each partially observed variable is associated with a missingness mechanism and a proxy variable.
12
12
13
13
Let the set of variables be:
@@ -32,7 +32,7 @@ The induced subgraph :math:`G[V_o \cup V_m]` follows a LiNGAM model, meaning tha
32
32
33
33
where :math:`i\in\{1,\dots,n\}\mapsto k(i)` denotes a causal order, and the non-gaussian error terms are independent.
34
34
35
-
The induced subgraph :math:`G[V_o \cup V_m \cup R]` follows a LiM model. The missingness mechanisms :math:`R_i \in R` follow a logistic model as for binary variables in LiM [3]_:
35
+
The induced subgraph :math:`G[V_o \cup V_m \cup R]` follows a LiM model. The missingness mechanisms :math:`R_i \in R` follow a logistic model as for binary variables in LiM [4]_:
@@ -45,24 +45,28 @@ The following assumptions are made to ensure identifiability:
45
45
46
46
#. No latent confounders (:math:`U = \emptyset`).
47
47
#. No causal interactions between missingness mechanisms (:math:`R_i \notin Pa(R_j)` for all :math:`i \neq j`).
48
-
#. No direct self-masking (:math:`X_i \notin Pa(R_i)` for any :math:`X_i \in V_m`).
48
+
#. No self-masking (:math:`X_i \notin Pa(R_i)` for any :math:`X_i \in V_m`).
49
49
50
-
Note that even if direct self-masking is not allowed, a partially observed variable can be an indirect cause (an ancestor) of its own missingness mechanism (indirect self-masking).
50
+
Note that even if self-masking is not allowed, indirect self-masking is: a partially observed variable can be an indirect cause (an ancestor) of its own missingness mechanism.
51
51
Under these assumptions, m-LiNGAM guarantees identifiability of both the causal structure and parameters from observational data in the large-sample limit.
52
52
53
53
An example Python notebook demonstrating m-LiNGAM is available `here <https://github.com/cdt15/lingam/blob/master/examples/MissingnessLiNGAM.ipynb>`__.
54
54
55
55
References
56
56
-------------------
57
57
58
-
.. [1] S. Shimizu, P. O. Hoyer, A. Hyvärinen, and A. J. Kerminen.
58
+
.. [1] M. Ceriscioli, S. Shimizu, and K. Mohan.
59
+
*Discovering Linear Non-Gaussian Models for All Categories of Missing Data (Student Abstract).*
60
+
The 40th Annual AAAI Conference on Artificial Intelligence (AAAI-26) Student Abstract and Poster Program, 2026.
61
+
62
+
.. [2] S. Shimizu, P. O. Hoyer, A. Hyvärinen, and A. J. Kerminen.
59
63
*A Linear Non-Gaussian Acyclic Model for Causal Discovery.*
60
64
Journal of Machine Learning Research, 7:2003–2030, 2006.
61
65
62
-
.. [2] K. Mohan, J. Pearl, and J. Tian.
66
+
.. [3] K. Mohan, J. Pearl, and J. Tian.
63
67
*Graphical Models for Inference with Missing Data.*
64
68
Advances in Neural Information Processing Systems (NeurIPS), 2013.
65
69
66
-
.. [3] Y. Zeng, S. Shimizu, H. Matsui, and F. Sun.
70
+
.. [4] Y. Zeng, S. Shimizu, H. Matsui, and F. Sun.
67
71
*Causal Discovery for Linear Mixed Data.*
68
72
In Proceedings of the First Conference on Causal Learning and Reasoning (CLeaR 2022), PMLR 177, pp. 994–1009, 2022.
0 commit comments