Saahil Jain

I am currently working on You.com, a new search engine.

I'm broadly excited by artificial intelligence (AI), deep learning, and natural language processing (NLP). At Stanford, I researched these topics under Professor Andrew Ng in the Stanford Machine Learning Group. Much of my research focused on developing better methods of learning from limited/noisy labels as well as building multimodal datasets and models in resource-constrained domains such as healthcare.

I’ve worn various hats, working across engineering, product, and research. I received my MS in Computer Science at Stanford University. Prior to Stanford, I worked as a product manager at Microsoft. I received my BS at Columbia University, where I studied Computer Science and Economics.

To get in touch, feel free to email me at saahil.jain@cs.stanford.edu.

Email  /  Google Scholar  /  LinkedIn  /  Twitter  /  Github

profile photo
Experiences
Engineer, You.com [2020-Present]
Researcher, Stanford Machine Learning Group (Andrew Ng's group)
Fellow, 2021 Accel Innovation Scholar
Engineer, Datavant
Product Manager, Microsoft
Researcher, Tatonetti Lab, Columbia Medical Center
Researcher, Wireless & Mobile Networking Lab, Columbia University
Research
RadGraph: Extracting Clinical Entities and Relations from Radiology Reports
Saahil Jain*, Ashwin Agrawal*, Adriel Saporta*, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Yuhao Zhang, Matthew P. Lungren, Andrew Y. Ng, Curtis P. Langlotz, Pranav Rajpurkar
Proceedings of the Neural Information Processing Systems (NeurIPS) Track on Datasets and Benchmarks, 2021, (Oral)
paper / dataset

VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels
Saahil Jain*, Akshay Smit*, Steven QH Truong, Chanh DT Nguyen, Minh-Thanh Huynh, Mudit Jain, Victoria A. Young, Andrew Y. Ng, Matthew P. Lungren, Pranav Rajpurkar
Proceedings of the Conference on Health, Inference, and Learning (ACM-CHIL), 2021
paper / code

Effect of Radiology Report Labeler Quality on Deep Learning Models for Chest X-Ray Interpretation
Saahil Jain*, Akshay Smit*, Andrew Y. Ng, Pranav Rajpurkar
Neural Information Processing Systems (NeurIPS) Workshop on Data-Centric AI (DCAI), 2021
paper

On the Opportunities and Risks of Foundation Models: §3.1 Healthcare and Biomedicine
Michihiro Yasunaga, Jing Huang, Camilo Ruiz, Yuhui Zhang, Giray Ogut, Saahil Jain, William Wang, Yusuf Roohani, Hongyu Ren, Antoine Bosselut, Ehsan Adeli, Jure Leskovec, Russ Altman
Whitepaper, 2021
paper

Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management
Cécile Logé*, Emily Ross*, David Yaw Amoah Dadey, Saahil Jain, Adriel Saporta, Andrew Y. Ng, Pranav Rajpurkar
Proceedings of the Neural Information Processing Systems (NeurIPS) Track on Datasets and Benchmarks, 2021, (Oral)
paper / dataset

CheXbert: Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT
Akshay Smit*, Saahil Jain*, Pranav Rajpurkar*, Anuj Pareek, Andrew Y. Ng, Matthew P. Lungren
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
paper / code

Power-Aware Neighbor Discovery for Energy Harvesting Things: Demo Abstract
Tingjun Chen, Gregory Chen, Saahil Jain, Robert Margolies, Guy Grebla, Dan Rubenstein, Gil Zussman
Proceedings of the 14th ACM Conference on Embedded Network Sensor Systems (SenSys), 2016
paper

Teaching
Teaching Assistant: Stanford CS 372 [2020, 2021], AI for Disease Diagnosis and Information Recommendations with Edward Chang
Teaching Assistant: Stanford CS 229 [2020], Machine Learning with Andrew Ng, Chris Ré, and Tengyu Ma

Website Template