Robust Benchmarking for Machine Learning of Clinical Entity Extraction

Abstract: Clinical studies often require understanding elements of a patient’s narrative that exist only in free text clinical notes. To transform notes into structured data for downstream use, these elements are commonly extracted and normalized to medical vocabularies. In this work, we audit the performance of and indicate areas of improvement for state-of-the-art systems. We find that high task accuracies for clinical entity normalization systems on the 2019 n2c2 Shared Task are misleading, and underlying performance is still brittle. Normalization accuracy is high for common concepts (95.3%), but much lower for concepts unseen in training data (69.3%). We demonstrate that current approaches are hindered in part by inconsistencies in medical vocabularies, limitations of existing labeling schemas, and narrow evaluation techniques. We reformulate the annotation framework for clinical entity extraction to factor in these issues to allow for robust end-to-end system benchmarking. We evaluate concordance of annotations from our new framework between two annotators and achieve a Jaccard similarity of 0.73 for entity recognition and an agreement of 0.83 for entity normalization. We propose a path forward to address the demonstrated need for the creation of a reference standard to spur method development in entity recognition and normalization.

Robust Benchmarking for Machine Learning of Clinical Entity Extraction

Monica Agrawal, Chloe O’Connell, Yasmin Fatemi, Ariel Levy, David Sontag

Comments

Similar Papers

MIMIC-Extract: a data extraction, preprocessing, and representation pipeline for MIMIC-III

Shirly Wang, Matthew B. A. McDermott, Geeticka Chauhan and Marzyeh Ghassemi, Michael C. Hughes, Tristan Naumann

Keywords Abstract Paper

Applied computing, Life and medical sciences, Health care information systems, Health informatics

Time-Aware Transformer-based Network for Clinical Notes Series Prediction

Dongyu Zhang, Jidapa Thadajarassiri, Cansu Sen, Elke Rundensteiner

Keywords Abstract Paper

Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease Progression

Zhaozhi Qian, William Zame, Lucas Fleuren and Paul Elbers, Mihaela van der Schaar

Keywords Abstract Paper

deep learning, machine learning

Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset

Xiang Yue, Bernal Jimenez Gutierrez, Huan Sun

Keywords Abstract Paper

Clinical Comprehension, Machine comprehension, annotation, question answering

Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity

Yuxia Wang, Fei Liu, Karin Verspoor, Timothy Baldwin

Keywords Abstract Paper

Self-Supervised Adversarial Distribution Regularization for Medication Recommendation

Yanda Wang, Weitong Chen, Dechang PI and Lin Yue, Sen Wang, Miao Xu

Keywords Abstract Paper

Machine Learning, Deep Learning, AI for Life Science, Bio/Medicine

Explainable Clinical Decision Support from Text

Jinyue Feng, Chantal Shaib, Frank Rudzicz

Keywords Abstract Paper

sepsis prediction, clinical models, hierarchical model, multi-task model

GRASP: Generic Framework for Health Status Representation Learning Based on Incorporating Knowledge from Similar Patients

Chaohe Zhang, Xin Gao, Liantao Ma and Yasha Wang, Jiangtao Wang, Wen Tang

Keywords Abstract Paper

INPREM: An interpretable and trustworthy predictive model for healthcare

Xianli Zhang, Buyue Qian, Shilei Cao and Yang Li, Hang Chen, Yefeng Zheng, Ian Davidson

Keywords Abstract Paper

attention mechanism, healthcare informatics, model interpretability, model uncertainty

An adversarial approach for the robust classification of pneumonia from chest radiographs

Joseph D. Janizek, Gabriel Erion, Alex J. DeGrave, Su-In Lee

Keywords Abstract Paper

Applied computing, Life and medical sciences, Computing methodologies, Machine learning, Machine learning approaches, Learning latent representations, Neural networks

Self-Guided Multiple Instance Learning forWeakly Supervised Thoracic DiseaseClassification and Localizationin Chest Radiographs

Constantin Seibold, Jens Kleesiek, Heinz-Peter Schlemmer, Rainer Stiefelhagen

Keywords Abstract Paper

Hierarchical attention propagation for healthcare representation learning

Muhan Zhang, Christopher R. King, Michael Avidan, Yixin Chen

Keywords Abstract Paper

network embedding, attention mechanism, medical ontology

Dynamically Extracting Outcome-Specific Problem Lists from Clinical Notes with Guided Multi-Headed Attention

Justin Lovelace, Nathan C. Hurley, Adrian D. Haimovich, Bobak J. Mortazavi

Keywords Abstract Paper

A Novel Sequence-to-Subgraph Framework for Diagnosis Classification

Jun Chen, Quan Yuan, Chao Lu, Haifeng Huang

Keywords Abstract Paper

Multidisciplinary Topics and Applications, Biology and Medicine, Bio/Medicine, NLP Applications and Tools

Uncertainty-Aware Training of Neural Networks for Selective Medical Image Segmentation

Yukun Ding, Jinglan Liu, Xiaowei Xu and Meiping Huang, Jian Zhuang, Jinjun Xiong, Yiyu Shi

Keywords Abstract Paper

Latent-optimization based Disease-aware Image Editing for Medical Image Augmentation

Aakash saboo, Prashnna K Gyawali, Ankit Shukla and Manoj Sharma, Neeraj Jain, Linwei Wang

Keywords Abstract Paper

Latent optimization, StyleGAN, Image Editing, Chest X-ray, Image manipulation, constrained optimization, Disease progression, Disease quantification, Manifold, Latent space traversal

Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports

Yuhao Zhang, Derek Merck, Emily Tsai and Christopher D. Manning, Curtis Langlotz

Keywords Abstract Paper

Summarizing Reports, real-world applications, summarization reports, Neural models

SOS: Selective Objective Switch for Rapid Immunofluorescence Whole Slide Image Classification

Sam Maksoud, Kun Zhao, Peter Hobson and Anthony Jennings, Brian C. Lovell

Keywords Abstract Paper

whole-slide imaging, image classification, neural networks, multi-scale networks, patch-based classification, gigapixel image analysis, digital pathology

SafeDrug: Dual Molecular Graph Encoders for Recommending Effective and Safe Drug Combinations

Chaoqi Yang, Cao Xiao, Fenglong Ma and Lucas Glass, Jimeng Sun

Keywords Abstract Paper

Multidisciplinary Topics and Applications, Biology and Medicine, Applications of Supervised Learning, Bio/Medicine

Deidentification of free-text medical records using pre-trained bidirectional transformers

Alistair E. W. Johnson, Lucas Bulgarelli, Tom J. Pollard

Keywords Abstract Paper

Applied computing, Document management and text processing, Document preparation, Annotation

Representation learning for improved interpretability and classification accuracy of clinical factors from EEG

Garrett Honke, Irina Higgins, Nina Thigpen and Vladimir Miskovic, Katie Link, Sunny Duan, Pramod Gupta, Julia Klawohn, Greg Hajcak

Shirly Wang, Matthew B. A. McDermott, Geeticka Chauhan and
Marzyeh Ghassemi, Michael C. Hughes, Tristan Naumann

Keywords Paper

Keywords Paper

Zhaozhi Qian, William Zame, Lucas Fleuren and
Paul Elbers, Mihaela van der Schaar

Keywords Paper

Keywords Paper

Keywords Paper

Yanda Wang, Weitong Chen, Dechang PI and
Lin Yue, Sen Wang, Miao Xu

Keywords Paper

Keywords Paper

Chaohe Zhang, Xin Gao, Liantao Ma and
Yasha Wang, Jiangtao Wang, Wen Tang

Keywords Paper

Xianli Zhang, Buyue Qian, Shilei Cao and
Yang Li, Hang Chen, Yefeng Zheng, Ian Davidson

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yukun Ding, Jinglan Liu, Xiaowei Xu and
Meiping Huang, Jian Zhuang, Jinjun Xiong, Yiyu Shi

Keywords Paper

Aakash saboo, Prashnna K Gyawali, Ankit Shukla and
Manoj Sharma, Neeraj Jain, Linwei Wang

Keywords Paper

Yuhao Zhang, Derek Merck, Emily Tsai and
Christopher D. Manning, Curtis Langlotz

Keywords Paper

Sam Maksoud, Kun Zhao, Peter Hobson and
Anthony Jennings, Brian C. Lovell

Keywords Paper

Chaoqi Yang, Cao Xiao, Fenglong Ma and
Lucas Glass, Jimeng Sun

Keywords Paper

Keywords Paper

Garrett Honke, Irina Higgins, Nina Thigpen and
Vladimir Miskovic, Katie Link, Sunny Duan, Pramod Gupta, Julia Klawohn, Greg Hajcak

Keywords Paper

Maya Zohar, Omri Bar, Daniel Neimark and
Gregory D. Hager, Dotan Asselmann

Keywords Paper

Keywords Paper

Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort and
Andreas Kirsch, Uri Shalit, Yarin Gal

Keywords Paper

Rohan Kodialam, Rebecca Boiarsky, Justin Lim and
Aditya Sai, Neil Dixit, David Sontag

Keywords Paper

Zhuoning Yuan, Zhishuai Guo, Yi Xu and
Yiming Ying, Tianbao Yang

Keywords Paper

Xingtong Liu, Yiping Zheng, Benjamin Killeen and
Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath

Keywords Paper

Keywords Paper

Lida Zhang, Nathan C. Hurley, Bassem Ibrahim and
Erica Spatz, Harlan M. Krumholz, Roozbeh Jafari, Mortazavi J. Bobak

Keywords Paper

Keywords Paper

Keywords Paper

Qiao Jin, Chuanqi Tan, Mosha Chen and
Xiaozhong Liu, Songfang Huang

Keywords Paper

Keywords Paper

Xuan Gong, Xin Xia, Wentao Zhu and
Baochang Zhang, David Doermann, Li'an Zhuo

Keywords Paper

Sahil Chelaramani, Manish Gupta, Vipul Agarwal and
Prashant Gupta, Ranya Habash

Keywords Paper

Tung-Che Liang, Jin Zhou, Yun-Sheng Chan and
Tsung-Yi Ho, Krishnendu Chakrabarty, Cy Lee

Keywords Paper