## Causal Discovery Algorithms

*23 Sep 2021 09:45*

Yet Another Inadequate Placeholder, because I should really split Graphical Causal Models into (at least) two notebooks, this being one.

*Things I need to think/learn more about*:

- How important is faithfulness? How unreasonable is it, and what strictly weaker assumptions will do (most of) the same work?
- Confidence sets for the causal graph. The simplest approach is to go over every possible graph, test it, and return the ones which pass the test. But this is computationally infeasible (the number of graphs grows crazily fast with the number of variables) and hard to interpret. Are there more comprehensible representations possible, and/or ones which would make this more computationally tractable?

- Recommended, big picture:
- Peter Spirtes, Clark Glymour and Richard Scheines, Causation, Prediction, and Search [Comments]

- Recommended, close-ups:
- Séverine Affeldt and Hervé Isambert, "Robust reconstruction of causal graphical models based on conditional 2-point and 3-point information", UAI 2015
- Wenyu Chen, Mathias Drton, Ali Shojaie, "Causal Structural Learning via Local Graphs", arxiv:2107.03597
- Diego Colombo, Marloes H. Maathuis, Markus Kalisch, and Thomas S. Richardson, "Learning High-Dimensional Directed Acyclic Graphs with Latent and Selection Variables", Annals of Statistics
**40**(2012): 294--321, arxiv:1104.5617 - Nicola Gnecco, Nicolai Meinshausen, Jonas Peters, Sebastian Engelke, "Causal discovery in heavy-tailed models", arxiv:1908.05097
- Dominik Janzing and Daniel J. L. Herrmann, "Reliable and Efficient Inference of Bayesian Networks from Sparse Data by Statistical Learning Theory", cs.LG/0309015
- Markus Kalisch and Peter Bühlmnann, "Estimating
High-Dimensional Directed Acyclic Graphs with the
PC-Algorithm", Journal
of Machine Learning Research
**8**(2007): 616--636 - Marloes H. Maathuis, Markus Kalisch, Peter Bühlmann, "Estimating high-dimensional intervention effects from observational data", Annals
of Statistics
**37**(2009): 3133--31654, arxiv:0810.4214 - Katerina Marazopoulou, Marc Maier and David Jensen, "Learning the Structure of Causal Models with Relational and Temporal Dependence", UAI 2015
- Garvesh Raskutti, Caroline Uhler, "Learning directed acyclic graphs based on sparsest permutations", arxiv:1307.0366
- Thomas Richardson
- "A Discovery Algorithm for Directed Cyclic Graphs", UAI 1996 [Short version without details]
- "A Discovery Algorithm for Directed Cyclic Graphs", Technical Report 68, CMU department of philosophy (postscript) [Long version with details]

- Arjun Sondhi, Ali Shojaie, "The Reduced PC-Algorithm: Improved Causal Structure Learning in Large Random Networks", arxiv:1806.06209
- Eric V. Strobl, Kun Zhang, Shyam Visweswaran, "Approximate Kernel-based Conditional Independence Tests for Fast Non-Parametric Causal Discovery",
Journal of Causal Inference
**7**(2019): 20180017, arxiv:1702.03877 - Robert E. Tillman, Arthur Gretton and Peter Spirtes, "Nonlinear Directed Acyclic Structure Learning with Weakly Additive Noise Models", NIPS 2009 [Thanks to Prof. Spirtes for a preprint]
- Pawel Wocjan, Dominik Janzing, and Thomas Beth, "Required sample size for learning sparse Bayesian networks with many variables," cs.LG/0204052

- Modesty forbids me to recommend:
- CRS, Advanced Data Analysis from an Elementary Point of View, Part III (chapters on causal discovery for statistics students)
- Octavio Mesner, Alex Davis, Elizabeth Casman, Hyagriv Simhan, CRS, Lauren Keenan-Devlin, Ann Borders and Tamar Krishnamurt, "Using graph learning to understand adverse pregnancy outcomes and stress pathways", PLoS One
**14**(2019): e0223319

- To read:
- Constantin F. Aliferis, Alexander Statnikov, Ioannis Tsamardinos, Subramani Mani, Xenofon D. Koutsoukos, "Local Causal and Markov Blanket Induction for Causal Discovery and Feature Selection for Classification"
- "Part I: Algorithms and Empirical Evaluation", Journal of Machine Learning Research
**11**(2010): 171--234 - "Part II: Analysis and Extensions", Journal of Machine Learning Research
**11**(2010): 235--284

- "Part I: Algorithms and Empirical Evaluation", Journal of Machine Learning Research
- Holly Andersen, "When to expect violations of causal faithfulness and why it matters", phil-sci/9204
- Facuno Bromberg and Dimitris Margaritis, "Improving the Reliability
of Causal Discovery from Small Data Sets Using Argumentation",
Journal of Machine Learning Research
**10**(2009): 301--340 - Peter Bühlmann, Jonas Peters, Jan Ernest, "CAM: Causal additive models, high-dimensional order search and penalized regression",
Annals of Statistics
**42**(2014): 2526--2556, arxiv:1310.1533 - Wenyu Chen, Mathias Drton, Y Samuel Wang, "On causal discovery with an equal-variance assumption ", Biometrika
**106**(2019): 973--980 - David Maxwell Chickering, "Optimal Structure Identification
With Greedy Search," Journal of Machine Learning Research
**3**(2002): 507--554 - Luis M. de Campos, "A Scoring Function for Learning Bayesian Networks based on Mutual Information and Conditional Independence Tests",
Journal of
Machine Learning Research
**7**(2006): 2149--2187 - Mathias Drton and Marloes H. Maathuis, "Structure Learning in Graphical Modeling" Annual Review of Statistics and Its Application
**4**(2017): 365--393 - Robin J. Evans, "Model selection and local geometry", Annals of Statistics
**48**(2020): 3513--3544, arxiv:1801.08364 - Ming Gao, Yi Ding, Bryon Aragam, "A polynomial-time algorithm for learning nonparametric causal graphs", arxiv:2006.11970
- Glymour and Cooper (eds.), Computation, Causation and Discovery
- Naftali Harris, Mathias Drton, "PC Algorithm for Nonparanormal Graphical Models", Journal of Machine Learning Research
**14**(2013): 3365--3383 - Alain Hauser, Peter Bühlmann, "Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs",
Journal of Machine Learning Research
**13**(2012): 2409--2464, arxiv:1104.2808 - Yang-Bo He and Zhi Geng, "Active Learning of Causal
Networks with Intervention Experiments and Optimal Designs",
Journal of
Machine Learning Research
**9**(2008): 2523--2547 - Biwei Huang, Kun Zhang, Mingming Gong, Clark Glymour, "Causal Discovery and Forecasting in Nonstationary Environments with State-Space Models", arxiv:1905.10857
- Biwei Huang, Kun Zhang, Jiji Zhang, Joseph Ramsey, Ruben Sanchez-Romero, Clark Glymour, Bernhard Schölkopf, "Causal Discovery from Heterogeneous/Nonstationary Data with Independent Changes", arxiv:1903.01672
- Antti Hyttinen, Frederick Eberhardt, Patrik O. Hoyer, "Experiment Selection for Causal Discovery", Journal of Machine Learning Research
**14**(2013): 3041--3071 - Markus Kalisch and Peter Bühlmnann, "Robustification of the PC-Algorithm for Directed Acyclic Graphs", Journal of Computational and
Graphical Statistics
**17**(2008): 773--789 - Xiaohan Kang, Bruce Hajek, "Lower Bounds on Information Requirements for Causal Network Inference", arxiv:2102.00055
- Manuele Leonelli, Gherardo Varando, "Context-Specific Causal Discovery for Categorical Data Using Staged Trees", arxiv:2106.04416
- Junning Li, Z. Jane Wang, "Controlling the False Discovery Rate of the Association/Causality Structure Learned with the PC Algorithm", Journal of Machine
Learning Research
**10**(2009): 475--514 - Svante Linusson, Petter Restadh, Liam Solus, "Greedy Causal Discovery is Geometric", arxiv:2103.03771
- Po-Ling Loh, Peter Bühlmann, "High-dimensional learning of linear causal networks via inverse covariance estimation", arxiv:1311.3492
- Dhafer Malouche and Sylvie Sevestre-Ghalila, "Estimating High dimensional faithful Gaussian graphical Models : uPC-algorithm", arxiv:0705.1613
- Jean-Philippe Pellett and Andre Elisseeff, "Using Markov Blankets for Causal Structure Learning", Journal of Machine Learning
Research
**9**(2008): 1295--1342 - Sergey Plis, David Danks and Jianyu Yang, "Mesochronal Structure Learning", UAI 2015
- Anant Raj, Luigi Gresele, Michel Besserve, Bernhard Schölkopf, Stefan Bauer, "Orthogonal Structure Search for Efficient Causal Discovery from Observational Data", arxiv:1903.02456
- Kayvan Sadeghi, Terry Soo, "Conditions and Assumptions for Constraint-based Causal Structure Learning", arxiv:2103.13521
- Silva, Scheines, Glymour and Spirtes, "Learning the Structure
of Linear Latent Variable Models", Journal of Machine Learning Research
**7**(2006): 191--246 - Peter Spirtes, Jiji Zhang, "A Uniformly Consistent Estimator of Causal Effects under the $k$-Triangle-Faithfulness Assumption", Statistical Science
**29**(2014): 662--678 - David Strieder, Tobias Freidling, Stefan Haffner, Mathias Drton, "Confidence in Causal Discovery with Linear Causal Models", arxiv:2106.05694
- Sara van de Geer, Peter Bühlmann, "$\ell_0$-penalized maximum likelihood for sparse directed acyclic graphs", arxiv:1205.5473
- Xianchao Xie, Zhi Geng, "A Recursive Method for Structural Learning
of Directed Acyclic Graphs", Journal of
Machine Learning Research
**9**(2008): 459--483 - Raanan Yehezkel, Boaz Lerner, "Bayesian Network Structure Learning by Recursive Autonomy Identification", Journal of
Machine Learning Research
**10**(2009): 1527--1570 - Xun Zheng, Chen Dan, Bryon Aragam, Pradeep Ravikumar, Eric P. Xing, "Learning Sparse Nonparametric DAGs", arxiv:1909.13189

- To write:
- CRS, "An Apology for Causal Discovery", expanding on two talks, "Lessons (?) for causal discovery from Markov models" and "Commentary on Kun Zhang"