Downloads
Immunomics – JL-GloVe
March 2025
We employ GloVe and random projection theory to infer immunologically meaningful T-cell receptor embeddings from adaptive immune repertoires. This repository contains the Pytorch code to replicate experiments in our paper “Scalable Universal T-Cell Receptor Embeddings from Adaptive Immune Repertoires” accepted…
RAD-DINO model
November 2024
RAD-DINO is a vision transformer model trained to encode chest X-rays using the self-supervised learning method DINOv2. RAD-DINO is described in detail in RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision (F. Pérez-García, H. Sharma, S. Bond-Taylor, et al., 2024).
MAIRA-2 model
November 2024
MAIRA-2 is a multimodal transformer designed for the generation of grounded or non-grounded radiology reports from chest X-rays. It is described in more detail in MAIRA-2: Grounded Radiology Report Generation (S. Bannur, K. Bouzid et al., 2024). MAIRA-2 has been built…
RadFact: An LLM-based Evaluation Metric for AI-generated Radiology Reporting
November 2024
RadFact is a framework for the evaluation of model-generated radiology reports given a ground-truth report, with or without grounding. Leveraging the logical inference capabilities of large language models, RadFact is not a single number but a suite of metrics, capturing aspects of precision…
PadChest-GR dataset
November 2024
PadChest-GR is a manually annotated, bilingual chest X-ray dataset designed to train and evaluate models for grounded radiology report generation. It includes bounding boxes and comprehensive annotations of all clinically relevant findings.
ProtNote: a multimodal method for protein-function annotation
October 2024
ProtNote is a multimodal deep learning model that leverages free-form text to enable both supervised and zero-shot protein function prediction.
RadFact
August 2024
RadFact is a framework for the evaluation of model-generated radiology reports given a ground-truth report, with or without grounding. Leveraging the logical inference capabilities of large language models, RadFact is not a single number but a suite of metrics, capturing…
Vaccine Search Study
September 2023
This repository contains code and data for “Accurate Measures of Vaccination and Concerns of Vaccine Holdouts from Web Search Logs” (2023) by Serina Chang, Adam Fourney, and Eric Horvitz.
HI-ML Multimodal Toolbox
May 2023
HI-ML toolbox for deep learning for medical imaging and Azure integration. The Microsoft Health Intelligence Machine Learning Toolbox aims at providing low-level and high-level building blocks for Machine Learning / AI researchers and practitioners. It helps to simplify and streamline…