Table of Contents

Teaching

BIOINF 2119 Probabilistic Methods in Artificial Intelligence (Onsite course)
Taught every spring, 2010 - 2017

This course introduces fundamental concepts and methods in artificial intelligence that are applicable to problems in biomedicine. This course is designed for students who do not necessarily have a background in computer science. The course provides the foundations in artificial intelligence methods including search (breadth-first search, depth-first search, greedy search, etc), probabilistic knowledge representation and reasoning (Bayesian networks: model, independencies, semantics, parameter estimation and inference), decision theory, and machine learning (regression, neural networks, classification trees, support vector machines, Markov models and hidden Markov models).

Resources and pointers

Writing

Four steps to writing papers in 6 weeks:

Resources and pointers

Software

The Informative Bayesian Model Selection (IBMS) is a computationally efficient method that can be applied to genome-wide data to detect both SNP-SNP interactions and interactions between two groups of SNPs (e.g., a group may consist of SNPs that map to a gene). The software for IBMS is available for download. The paper describing this method is Informative Bayesian Model Selection: A method for identifying interactions in genome-wide data and is published in Molecular BioSystems.

The backward elimination algorithm uses a kernel-based conditional dependence measure to identify the Markov blanket in a fully multivariate fashion. The software for the backward elimination algorithm is available for download at the open source code repository GitHub. The paper describing this algorithm is Markov blanket ranking using kernel-based conditional dependence measures and is published in the proceedings of NIPS 2013 Workshop on Causality: Large-scale Experiment Design and Inference of Causal Mechanisms.

The deep multiple kernel learning algorithm tunes a deep multiple kernel net by alternating optimization with the span bound. It is an attempt to extend deep learning to small sample sizes. The software for the deep multiple kernel learning algorithm is available for download at the open source code repository GitHub. The paper describing this algorithm is Deep multiple kernel learning and is published in the proceedings of the IEEE 12th International Conference on Machine Learning and Applications (ICMLA 2013).

The Modular Relief Framework (MoRF) can be used to develop novel variations of the Relief algorithm for application to genome-wide data for ranking of single nucleotide variants (SNVs). The software for MoRF is available for download at the open source code repository GitHub. The paper describing this algorithm is Application of a spatially-weighed Relief algorithm for ranking genetic predictors of disease and is published in BioData Mining.

The Model Averaged Naive Bayes (MANB) is an algorithm that predicts patient outcomes from genome-wide data by efficiently model averaging over an exponential number of naive Bayes models. The software for MANB is available for download. The paper describing this algorithm is The application of naive Bayes model averaging to predict Alzheimer's disease from genome-wide data and is published in JAMIA.

Presentations

Artificial Intelligence in Medicine - slides are here

Artificial Intelligence in Medicine MSTP Workshop, 8 March 2017. The slides are here and the citations are below:

Courses in Statistics and Data Science

Undergraduate calculus sequence:
Textbook: James Stewart, Essential Calculus, Early Transcendentals, 2nd Edition

Undergraduate linear algebra:

Introductory probability and statistics:
Textbook: I. Miller and M. Miller, John E. Freund’s Mathematical Statistics with Applications, 8th Edition

Intermediate probability and statistics:
Textbook: Morris H. DeGroot and Mark J. Schervish, Probability and Statistics, 4th Edition

Advanced probability:

Advanced statistics:

Data science:

Other topics:

Syllabi for Courses in Statistics and Data Science

MATH 0220 Analytic Geometry and Calculus 1
Textbook: James Stewart, Essential Calculus, Early Transcendentals, 2nd edition

MATH 0230 Analytic Geometry and Calculus 2
Textbook: James Stewart, Essential Calculus, Early Transcendentals, 2nd edition

MATH 0240 Analytic Geometry and Calculus 3
Textbook: James Stewart, Essential Calculus, Early Transcendentals, 2nd edition

MATH 1180 Linear Algebra 1
Textbook: D. Poole, Linear Algebra: a Modern Introduction, 4th edition

STAT 1151 Introduction to Probability
Textbook: I. Miller and M. Miller, John E. Freund’s Mathematical Statistics with Applications, 8th Edition

STAT 1152 Introduction to Mathematical Statistics
Textbook: I. Miller and M. Miller, John E. Freund’s Mathematical Statistics with Applications, 8th Edition