Publications

(2024). Towards Verifiable Text Generation with Symbolic References. COLM 2024.

PDF Code

(2024). Learning to Decode Collaboratively with Multiple Language Models. ACL 2024.

PDF Code

(2024). Prediction-powered Generalization of Causal Inferences. ICML 2024.

PDF Code

(2024). A Data-Centric Approach to Generate Faithful and High Quality Patient Summaries with Large Language Models. Conference on Health, Inference, and Learning (CHIL) 2024.

PDF

(2024). Benchmarking observational studies with experimental data under right-censoring. AISTATS 2024.

PDF Code

(2024). Machine learning to predict notes for chart review in the oncology setting: a proof of concept strategy for improving clinician note-writing. Journal of the American Medical Informatics Association (JAMIA).

PDF

(2024). Joint AI-driven event prediction and longitudinal modeling in newly diagnosed and relapsed multiple myeloma. NPJ Digital Medicine.

(2023). Effective Human-AI Teams via Learned Natural Language Rules and Onboarding. Advances in neural information processing systems (NeurIPS).

Code

(2023). Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes. Machine Learning for Healthcare Conference (MLHC), 2023.

PDF

(2023). Large-Scale Study of Temporal Shift in Health Insurance Claims. Conference on Health, Inference, and Learning.

PDF Code

(2023). Who Should Predict? Exact Algorithms For Learning to Defer to Humans. Proceedings of International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF Code

(2023). TabLLM: Few-shot Classification of Tabular Data with Large Language Models. Proceedings of International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF Code

(2023). Falsification of Internal and External Validity in Observational Studies via Conditional Moment Restrictions. Proceedings of International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF

(2023). Conformalized Unconditional Quantile Regression. Proceedings of International Conference on Artificial Intelligence and Statistics (AISTATS).

(2023). A Deep Dive into Single-Cell RNA Sequencing Foundation Models. bioRxiv.

DOI URL

(2022). Training Subset Selection for Weak Supervision. Advances in Neural Information Processing Systems.

PDF

(2022). Falsification before Extrapolation in Causal Effect Estimation. Advances in Neural Information Processing Systems (NeurIPS).

PDF

(2022). Evaluating Robustness to Dataset Shift via Parametric Robustness Sets. Advances in Neural Information Processing Systems (NeurIPS).

PDF Code

(2022). Sample Efficient Learning of Predictors that Complement Humans. Proceedings of the Thirty-Eighth International Conference on Machine Learning (ICML).

PDF Code

(2022). Co-training Improves Prompt-based Learning for Large Language Models. Proceedings of the Thirty-Eighth International Conference on Machine Learning (ICML).

PDF

(2022). Bias-robust Integration of Observational and Experimental Estimators. arXiv preprint (2205.10467).

PDF

(2022). Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models. Proceedings of International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF

(2022). The Potential For Bias In Machine Learning And Opportunities For Health Insurers To Address It. Health Affairs.

PDF

(2022). Teaching Humans When To Defer to a Classifier via Exemplars. Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI).

PDF Code

(2022). Single cell characterization of myeloma and its precursor conditions reveals transcriptional signatures of early tumorigenesis. Nature Communications.

PDF Code

(2022). Leveraging Time Irreversibility with Order-Contrastive Pre-training. Proceedings of International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF

(2022). Large Language Models are Few-Shot Clinical Information Extractors. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.

PDF

(2022). ETAB: A Benchmark Suite for Visual Representation Learning in Echocardiography. Advances in Neural Information Processing Systems Datasets and Benchmarks Track.

PDF

(2022). Clustering Interval-Censored Time-Series for Disease Phenotyping. Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI).

PDF

(2021). MedKnowts: Unified Documentation and Information Retrieval for Electronic Health Records. UIST ‘21: The 34th Annual ACM Symposium on User Interface Software and Technology.

PDF

(2021). Finding Regions of Heterogeneity in Decision-Making via Expected Conditional Covariance. Proceedings of the 35th Conference on Neural Information Processing Systems.

PDF Code

(2021). Regularizing towards Causal Invariance: Linear Models with Proxies. Proceedings of the Thirty-Eighth International Conference on Machine Learning (ICML).

PDF Code

(2021). Neural Pharmacodynamic State Space Modeling. Proceedings of the Thirty-Eighth International Conference on Machine Learning (ICML).

PDF Code

(2021). Graph cuts always find a global optimum for Potts models (with a catch). Proceedings of the Thirty-Eighth International Conference on Machine Learning (ICML).

PDF

(2021). Trajectory Inspection: A Method for Iterative Clinician-Driven Design of Reinforcement Learning Studies. AMIA 2021 Virtual Informatics Summit.

PDF Code

(2021). Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer. JCO Clinical Cancer Informatics.

PDF DOI

(2021). Assessing the Impact of Automated Suggestions on Decision Making: Domain Experts Mediate Model Errors but Take Less Initiative. CHI Conference on Human Factors in Computing Systems.

PDF

(2021). PClean: Bayesian Data Cleaning at Scale with Domain-Specific Probabilistic Programming. Proceedings of International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF

(2021). Beyond perturbation stability: LP recovery guarantees for MAP inference on noisy stable instances. Proceedings of the Twenty-Fourth International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF

(2021). Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media. Journal of Biomedical Informatics.

PDF DOI

(2021). Directing Human Attention in Event Localization for Clinical Timeline Creation. Proceedings of the Machine Learning for Healthcare Conference.

PDF

(2021). Deep Contextual Clinical Prediction with Reverse Distillation. Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence.

PDF

(2020). Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes. Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD).

PDF Code

(2020). Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph. Proceedings of the Pacific Symposium on Biocomputing (PSB).

PDF

(2020). Robust Benchmarking for Machine Learning of Clinical Entity Extraction. Proceedings of the Machine Learning for Healthcare Conference.

PDF

(2020). Predicting Remission Among Patients With Rheumatoid Arthritis Starting Tocilizumab Monotherapy: Model Derivation and Remission Score Development. ACR Open Rheumatology.

PDF DOI

(2020). Predicting human health from biofluid-based metabolomics using machine learning. Scientific Reports.

PDF DOI

(2020). Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects. arXiv preprint arXiv:2001.07426.

PDF

(2020). Fast, Structured Clinical Documentation via Contextual Autocomplete. Proceedings of the Machine Learning for Healthcare Conference.

PDF

(2020). Estimation of Bounds on Potential Outcomes For Decision Making. Proceedings of the Thirty-Seventh International Conference on Machine Learning (ICML).

PDF

(2020). Empirical Study of the Benefits of Overparameterization in Learning Latent Variable Models. Proceedings of the Thirty-Seventh International Conference on Machine Learning (ICML).

PDF Code

(2020). Consistent Estimators for Learning to Defer to an Expert. Proceedings of the Thirty-Seventh International Conference on Machine Learning (ICML).

PDF

(2020). Characterization of Overlap in Observational Studies. Proceedings of the Twenty-Third International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF Code

(2020). A decision algorithm to promote outpatient antimicrobial stewardship for uncomplicated urinary tract infection. Science Translational Medicine.

PDF Code DOI

(2019). Derivation and validation of a machine learning record linkage algorithm between emergency medical services and the emergency department. Journal of the American Medical Informatics Association.

PDF DOI

(2019). Train and Test Tightness of LP Relaxations in Structured Prediction. Journal of Machine Learning Research.

PDF

(2019). Support and Invertibility in Domain-Invariant Representations. Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF

(2019). Overcomplete Independent Component Analysis via SDP. Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF

(2019). MEng Thesis: Fine-tuning Generative Models.

PDF

(2019). Improving documentation of presenting problems in the emergency department using a domain-specific ontology and machine learning-driven user interfaces. International Journal of Medical Informatics.

PDF DOI

(2019). Guidelines for reinforcement learning in healthcare.. Nature medicine.

PDF

(2019). Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models. Proceedings of the Thirty-Sixth International Conference on Machine Learning (ICML).

PDF Code

(2019). Block Stability for MAP Inference. Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF

(2018). Why Is My Classifier Discriminatory?. Proceedings of the 32nd International Conference on Neural Information Processing Systems.

PDF

(2018). Semi-Amortized Variational Autoencoders. Proceedings of the 35th International Conference on Machine Learning (ICML).

PDF

(2018). Recurrent Neural Networks for Multivariate Time Series with Missing Values. Nature Scientific Reports.

PDF

(2018). Optimality of Approximate Inference Algorithms on Stable Instances. Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics (AI-STATS).

PDF

(2018). Max-margin learning with the Bayes Factor. Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI).

PDF

(2018). Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions. arXiv preprint arXiv:1811.05975.

PDF

(2018). Learning Weighted Representations for Generalization Across Designs. ArXiv e-prints arXiv:1802.08598.

PDF

(2018). Learning Topic Models - Provably and Efficiently. Communications of the ACM.

PDF

(2018). Evaluating Reinforcement Learning Algorithms in Observational Health Settings.

PDF

(2018). Cell-specific prediction and application of drug-induced gene expression profiles. Proceedings of the Pacific Symposium on Biocomputing (PSB).

PDF

(2017). Using Machine Learning to Recommend Oncology Clinical Trials. Machine Learning for Health Care (Clinical abstract).

PDF

(2017). Structured Inference Networks for Nonlinear State Space Models. Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence.

PDF

(2017). Simultaneous Learning of Trees and Representations for Extreme Classification and Density Estimation. Proceedings of the 34th International Conference on Machine Learning.

PDF

(2017). Objective Assessment of Depressive Symptoms with Machine Learning and Wearable Sensors Data. Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).

PDF

(2017). Learning a Health Knowledge Graph from Electronic Medical Records. Nature Scientific Reports.

PDF

(2017). Grounded Recurrent Neural Networks. ArXiv e-prints arXiv:1705.08557.

PDF

(2017). Estimating individual treatment effect: generalization bounds and algorithms. Proceedings of the 34th International Conference on Machine Learning.

PDF

(2017). Electronic phenotyping with APHRODITE and the Observational Health Sciences and Informatics (OHDSI) data network. Proceedings of the AMIA Summit on Clinical Research Informatics (CRI).

PDF

(2017). Early Identification of Patients with Acute Decompensated Heart Failure. Journal of Cardiac Failure.

PDF DOI

(2017). Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning. arXiv preprint arXiv:1705.00557.

PDF

(2017). Creating an Automated Trigger for Sepsis Clinical Decision Support at Emergency Department Triage using Machine Learning. PLoS ONE.

PDF

(2017). Contextual Autocomplete: A Novel User Interface Using Machine Learning to Improve Ontology Usage and Structured Data Capture for Presenting Problems in the Emergency Department. bioRxiv.

PDF

(2017). Causal Effect Inference with Deep Latent-Variable Models. Proceedings of the 31st International Conference on Neural Information Processing Systems.

PDF

(2016). Train and Test Tightness of LP Relaxations in Structured Prediction. Proceedings of The 33rd International Conference on Machine Learning.

PDF

(2016). Tightness of LP Relaxations for Almost Balanced Models. Proceedings of the 19th International Conference on Artificial Intelligence and Statistics.

PDF

(2016). Population-Level Prediction of Type 2 Diabetes using Claims Data and Analysis of Risk Factors. Big Data.

PDF

(2016). Multi-task Prediction of Disease Onsets from Longitudinal Laboratory Tests. Proceedings of the 1st Machine Learning for Healthcare Conference.

PDF

(2016). Learning Representations for Counterfactual Inference. Proceedings of The 33rd International Conference on Machine Learning.

PDF

(2016). Learning Low-Dimensional Representations of Medical Concepts. Proceedings of the AMIA Summit on Clinical Research Informatics (CRI).

PDF

(2016). Identifiable Phenotyping using Constrained Non-Negative Matrix Factorization. Proceedings of the 1st Machine Learning for Healthcare Conference.

PDF

(2016). Electronic Medical Record Phenotyping using the Anchor & Learn Framework. Journal of the American Medical Informatics Association (JAMIA).

PDF

(2016). Comparison of approaches for heart failure case identification from electronic health record data. JAMA Cardiology.

PDF

(2016). Clinical Tagging with Joint Probabilistic Models. Proceedings of the 1st Machine Learning for Healthcare Conference.

PDF

(2016). Character-Aware Neural Language Models. Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence.

PDF

(2015). Visual Exploration of Temporal Data in Electronic Medical Records. Proceedings of the American Medical Informatics Association (AMIA) Annual Symposium (Abstract).

PDF

(2015). Temporal Convolutional Neural Networks for Diagnosis from Lab Tests. arXiv preprint arXiv:1511.07938.

PDF

(2015). Predicting chronic comorbid conditions of type 2 diabetes in Newly-Diagnosed Diabetic Patients. Value in Health (Abstract).

PDF

(2015). How Hard is Inference for Structured Prediction?. Proceedings of the 32nd International Conference on Machine Learning (ICML).

PDF

(2015). Deep Kalman Filters. arXiv preprint arXiv:1511.05121.

PDF

(2015). Barrier Frank-Wolfe for Marginal Inference. Proceedings of the 28th International Conference on Neural Information Processing Systems.

PDF

(2015). Anchored Discrete Factor Analysis. arXiv preprint arXiv:1511.03299.

PDF

(2015). A Fast Variational Approach for Learning Markov Random Field Language Models. Proceedings of the 32nd International Conference on Machine Learning (ICML).

PDF

(2014). Using Anchors to Estimate Clinical State without Labeled Data. Proceedings of the American Medical Informatics Association (AMIA) Annual Symposium.

PDF

(2014). Unsupervised Learning of Disease Progression Models. Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.

PDF

(2014). Understanding the Bethe Approximation: When and How can it go Wrong?. Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence (UAI-14).

PDF

(2014). Lifted Tree-Reweighted Variational Inference. Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence (UAI-14).

PDF

(2014). Instance Segmentation of Indoor Scenes using a Coverage Loss. Proceedings of the 13th European Conference on Computer Vision (ECCV).

PDF

(2013). Unsupervised Learning of Noisy-Or Bayesian Networks. Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI-13).

PDF

(2013). SparsityBoost: A New Scoring Function for Learning Bayesian Network Structure. Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI-13).

PDF

(2013). Predicting Chief Complaints at Triage Time in the Emergency Department. NeurIPS Workshop on Machine Learning for Clinical Data Analysis and Healthcare.

PDF

(2013). Discovering Hidden Variables in Noisy-Or Networks using Quartet Tests. Advances in Neural Information Processing Systems 26.

PDF

(2013). A Practical Algorithm for Topic Modeling with Provable Guarantees. Proceedings of the International Conference on Machine Learning (ICML).

PDF

(2012). Probabilistic models for personalizing web search. Proceedings of the Fifth ACM International Conference on Web Search and Data Mining.

PDF DOI

(2012). Introduction to Dual Decomposition for Inference. Optimization for Machine Learning.

PDF

(2012). Efficiently Searching for Frustrated Cycles in MAP Inference. Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI-12).

PDF

(2012). A Comparison of Dimensionality Reduction Techniques for Unstructured Clinical Text. ICML 2012 Workshop on Clinical Data Analysis.

PDF

(2011). Personalizing web search results by reading level. Proceedings of the 20th ACM International Conference on Information and Knowledge Management.

PDF DOI

(2011). Complexity of Inference in Latent Dirichlet Allocation. Advances in Neural Information Processing Systems 24.

PDF

(2010). On Dual Decomposition and Linear Programming Relaxations for Natural Language Processing. Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP).

PDF

(2010). More data means less inference: A pseudo-max approach to structured learning. Advances in Neural Information Processing Systems 23.

PDF

(2010). Learning Efficiently with Approximate Inference via Dual Losses. Proceedings of the 27th International Conference on Machine Learning (ICML).

PDF

(2010). Learning Bayesian Network Structure using LP Relaxations. Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AI-STATS).

PDF

(2010). Dual Decomposition for Parsing with Non-Projective Head Automata. Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP).

PDF

(2009). Tree Block Coordinate Descent for MAP in Graphical Models. Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (AI-STATS).

PDF

(2009). Scaling All-Pairs Overlay Routing. CoNEXT ‘09: Proceedings of the 5th international conference on Emerging networking experiments and technologies.

PDF DOI

(2009). Clusters and Coarse Partitions in LP Relaxations. Advances in Neural Information Processing Systems 21.

PDF

(2008). Tightening LP Relaxations for MAP using Message-Passing. 24th Conference on Uncertainty in Artificial Intelligence.

PDF

(2008). New Outer Bounds on the Marginal Polytope. Advances in Neural Information Processing Systems 20.

PDF

(2007). Probabilistic Modeling of Systematic Errors in Two-Hybrid Experiments. Pacific Symposium on Biocomputing.

PDF

(2005). BLOG: probabilistic models with unknown objects. IJCAI'05: Proceedings of the 19th international joint conference on Artificial intelligence.

PDF

(2005). Approximate Inference for Infinite Contingent Bayesian Networks. Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics.

PDF