Projects

A collection of research, software, and other projects.

Software

Open source software projects for physics and machine learning.

cabinetry active

cabinetry

Design and steer profile likelihood fits with pyhf.

python statistics HEP pyhf
2020 – present
MadMiner active

MadMiner

Machine learning-based inference for particle physics with neural ratio estimation.

python simulation-based-inference HEP
2018 – present
pyhf active

pyhf

Pure-Python implementation of HistFactory statistical models with auto-differentiation.

python statistics HEP
2018 – present
REANA active

REANA

Reproducible research data analysis platform for containerized computational workflows.

python reproducibility workflows containers
2017 – present
Excursion completed

Excursion

Active learning for excursion set estimation with Gaussian processes.

python active-learning Gaussian-processes
2018
TreeNiN completed

TreeNiN

Network-in-Network architecture for jet physics with recursive neural networks.

python neural-networks jets HEP
2017
carl completed

carl

Toolbox for likelihood-free inference using calibrated classifiers and ratio estimation.

python likelihood-free-inference machine-learning
2016
completed

HistFactory

A tool for building statistical models for measurements based on template histograms.

C++ statistics HEP
2012
completed

RooFit & RooStats

Statistical data analysis toolkit integrated with ROOT.

C++ statistics CERN
2003
active
Design and steer profile likelihood fits with pyhf.
2020 – present
active
Machine learning-based inference for particle physics with neural ratio estimation.
2018 – present
active
Pure-Python implementation of HistFactory statistical models with auto-differentiation.
2018 – present
active
Reproducible research data analysis platform for containerized computational workflows.
2017 – present
completed
Excursion
Active learning for excursion set estimation with Gaussian processes.
2018
completed
TreeNiN
Network-in-Network architecture for jet physics with recursive neural networks.
2017
completed
carl
Toolbox for likelihood-free inference using calibrated classifiers and ratio estimation.
2016
completed
A tool for building statistical models for measurements based on template histograms.
2012
completed
Statistical data analysis toolkit integrated with ROOT.
2003

Collaborations

Large-scale research collaborations and institutes.

IRIS-HEP active

IRIS-HEP

Institute for Research and Innovation in Software for High Energy Physics.

software HEP NSF
2018 – present
Scikit-HEP active

Scikit-HEP

Community-driven and community-oriented Python software for High Energy Physics.

python software HEP
2016 – present
ATLAS Experiment active

ATLAS Experiment

One of the two general-purpose particle physics experiments at the Large Hadron Collider.

HEP CERN LHC
2007 – present
active
Institute for Research and Innovation in Software for High Energy Physics.
2018 – present
active
Community-driven and community-oriented Python software for High Energy Physics.
2016 – present
active
One of the two general-purpose particle physics experiments at the Large Hadron Collider.
2007 – present

Research

Research initiatives and projects.

Featured active

AI for Amplitudes

Using machine learning and transformers to bootstrap scattering amplitudes and explore the space of consistent quantum field theories.

machine-learning amplitudes QFT
2024 – present
Precision EFT Measurements Featured active

Precision EFT Measurements

Constraining Effective Field Theories with machine learning for precision measurements at the LHC.

machine-learning EFT HEP
2018 – present
Ginkgo-RL completed

Ginkgo-RL

Reinforcement learning for clustering in Ginkgo particle shower simulator.

reinforcement-learning HEP jets
2020
Featured completed

AI for Lattice QCD

Machine learning approaches to sampling in lattice QCD and other field theories, including equivariant normalizing flows.

machine-learning lattice-QCD normalizing-flows
2019
Crayfis completed

Crayfis

Citizen science project using smartphone cameras to detect cosmic rays.

citizen-science mobile cosmic-rays
2014 – 2020
Discovery of the Higgs Boson Featured completed

Discovery of the Higgs Boson

Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC.

HEP Higgs ATLAS discovery
2012 – 2012
completed

APEX Experiment

Search for a new dark photon (A' boson) in the sub-GeV mass range at Jefferson Lab, looking for narrow bumps in the QED e+e- spectrum.

HEP dark-photon Jefferson-Lab
2010 – 2011
active
Using machine learning and transformers to bootstrap scattering amplitudes and explore the space...
2024 – present
active
Constraining Effective Field Theories with machine learning for precision measurements at the LHC.
2018 – present
completed
Ginkgo-RL
Reinforcement learning for clustering in Ginkgo particle shower simulator.
2020
completed
Machine learning approaches to sampling in lattice QCD and other field theories, including...
2019
completed
Citizen science project using smartphone cameras to detect cosmic rays.
2014 – 2020
completed
Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS...
2012 – 2012
completed
Search for a new dark photon (A' boson) in the sub-GeV mass range at Jefferson Lab, looking for...
2010 – 2011

Methodology

Statistical and machine learning methodology.

Featured active

Simulation-Based Inference

Machine learning methods for likelihood-free inference with complex simulators.

machine-learning statistics simulation
2016 – present
Backdrop completed

Backdrop

Implementation and demonstration of backdrop in PyTorch with GP dataset generator.

machine-learning PyTorch Gaussian-processes
2019
Active Sciencing Featured completed

Active Sciencing

Active learning + reusable workflows + likelihood-free inference.

active-learning workflows simulation-based-inference
2018
Neural Unfolding completed

Neural Unfolding

Neural network approaches to detector unfolding in particle physics.

machine-learning unfolding HEP
2018
Ambient Fisher completed

Ambient Fisher

Ambient Fisher Information in the infinite spherical geometry of the space of all distributions.

statistics information-geometry
2017
GP MC Template Smoother completed

GP MC Template Smoother

Gaussian process Monte Carlo template smoother for histogram smoothing.

Gaussian-processes statistics HEP
2016
Expected Contours completed

Expected Contours

Exploring a subtle issue for expected contours in statistical inference.

statistics visualization
2015
Decouple completed

Decouple

Tools for decoupling and recoupling statistical models.

statistics HEP
2014
Decoupled Demo completed

Decoupled Demo

Demo of recoupling a decoupled project with effective likelihoods and template parametrizations.

statistics HEP reinterpretation
2014
Featured completed

Profile Likelihood Ratio

Asymptotic formulae for likelihood-based tests of new physics. Over 10,000 citations.

statistics HEP hypothesis-testing
2011
KEYS Historical completed

KEYS Historical

Ancient kernel estimation code for PAW (Physics Analysis Workstation).

statistics kernel-estimation PAW
2000
active
Machine learning methods for likelihood-free inference with complex simulators.
2016 – present
completed
Backdrop
Implementation and demonstration of backdrop in PyTorch with GP dataset generator.
2019
completed
Active Sciencing
Active learning + reusable workflows + likelihood-free inference.
2018
completed
Neural Unfolding
Neural network approaches to detector unfolding in particle physics.
2018
completed
Ambient Fisher
Ambient Fisher Information in the infinite spherical geometry of the space of all distributions.
2017
completed
GP MC Template Smoother
Gaussian process Monte Carlo template smoother for histogram smoothing.
2016
completed
Expected Contours
Exploring a subtle issue for expected contours in statistical inference.
2015
completed
Decouple
Tools for decoupling and recoupling statistical models.
2014
completed
Decoupled Demo
Demo of recoupling a decoupled project with effective likelihoods and template parametrizations.
2014
completed
Asymptotic formulae for likelihood-based tests of new physics. Over 10,000 citations.
2011
completed
Ancient kernel estimation code for PAW (Physics Analysis Workstation).
2000

Teaching

Educational materials, courses, and tutorials.

SBI Tutorial active

SBI Tutorial

Tutorial on simulation-based inference methods.

simulation-based-inference machine-learning tutorial
2020 – present
completed

Machine Learning Chapter in PDG

Chapter on machine learning methods in the Particle Data Group Review of Particle Physics.

machine-learning HEP PDG
2021
Practical Statistics for LHC completed

Practical Statistics for LHC

A living document on practical statistics for LHC physics analyses.

statistics HEP education
2015
Intro Experimental Physics II completed

Intro Experimental Physics II

Companion materials for NYU Introduction to Experimental Physics II course.

physics education NYU
2014
completed

Statistics and Data Science for Physicists

Graduate course on statistical methods and data science for particle physics.

statistics data-science education
2007
active
SBI Tutorial
Tutorial on simulation-based inference methods.
2020 – present
completed
Chapter on machine learning methods in the Particle Data Group Review of Particle Physics.
2021
completed
Practical Statistics for LHC
A living document on practical statistics for LHC physics analyses.
2015
completed
Intro Experimental Physics II
Companion materials for NYU Introduction to Experimental Physics II course.
2014
completed
Graduate course on statistical methods and data science for particle physics.
2007

Leadership

Leadership roles and institutional initiatives.

active

Open Source Program Office

UW-Madison's Open Source Program Office, promoting open source practices across campus.

open-source UW-Madison
2024 – present
Featured active

UW-Madison Data Science Institute

David R. Anderson Director of the Data Science Institute at UW-Madison.

2022 – present
DS3 Needs Assessment Survey completed

DS3 Needs Assessment Survey

Survey to assess data science needs and priorities within NYU faculty.

data-science NYU survey
2016
Moore-Sloan Data Science Environment completed

Moore-Sloan Data Science Environment

NYU's Data Science Environment, part of the Moore-Sloan Data Science Environments initiative.

data-science NYU Moore-Sloan
2014 – 2019
active
UW-Madison's Open Source Program Office, promoting open source practices across campus.
2024 – present
active
David R. Anderson Director of the Data Science Institute at UW-Madison.
2022 – present
completed
DS3 Needs Assessment Survey
Survey to assess data science needs and priorities within NYU faculty.
2016
completed
NYU's Data Science Environment, part of the Moore-Sloan Data Science Environments initiative.
2014 – 2019

Community

Community building and collaborative efforts.

active

Hammers & Nails

Workshop series bridging machine learning and physical sciences.

machine-learning physics workshop
2017 – present
ML4PS Workshop Featured active

ML4PS Workshop

Machine Learning for Physical Sciences workshop series at NeurIPS.

machine-learning physics NeurIPS
2017 – present
completed

P5

Particle Physics Project Prioritization Panel - shaping the future of U.S. particle physics.

HEP policy planning
2023
completed

ML4Jets

Workshop series on machine learning for jet physics.

machine-learning jets HEP workshop
2017
active
Hammers & Nails
Workshop series bridging machine learning and physical sciences.
2017 – present
active
Machine Learning for Physical Sciences workshop series at NeurIPS.
2017 – present
completed
P5
Particle Physics Project Prioritization Panel - shaping the future of U.S. particle physics.
2023
completed
Workshop series on machine learning for jet physics.
2017

Publishing

Publications, journals, and editorial work.

Featured active

Machine Learning: Science and Technology

Editor-in-Chief of IOP Publishing's journal on ML for science.

journal machine-learning science
2022 – present
Featured active

Publishing Statistical Models

Initiative to publish full statistical models from ATLAS analyses for reuse and reinterpretation.

open-science HEP statistics ATLAS
2019 – present
active

GitHub → Zenodo DOIs

Helped establish the GitHub-Zenodo integration for automatic DOI minting of software releases.

software-citation DOI open-science
2014 – present
INSPIRE active

INSPIRE

High Energy Physics literature database and information system.

HEP literature database
2010 – present
RECAST Featured active

RECAST

A framework for reinterpreting LHC analyses for new physics models.

reinterpretation HEP analysis-preservation
2010 – present
active
Editor-in-Chief of IOP Publishing's journal on ML for science.
2022 – present
active
Initiative to publish full statistical models from ATLAS analyses for reuse and reinterpretation.
2019 – present
active
Helped establish the GitHub-Zenodo integration for automatic DOI minting of software releases.
2014 – present
active
High Energy Physics literature database and information system.
2010 – present
active
A framework for reinterpreting LHC analyses for new physics models.
2010 – present

Web

Web projects, tools, and visualizations.

Theory and Practice active

Theory and Practice

Personal website and blog on physics, statistics, and machine learning.

blog pelican python
2012 – present
active
Personal website and blog on physics, statistics, and machine learning.
2012 – present

Fun

Side projects and experiments.

Cassville Checkers active

Cassville Checkers

Claude Code project implementing a checkers game variant.

game claude-code python
2024 – present
Vectorscope active

Vectorscope

Interactive visualization tool for exploring high-dimensional vector embeddings.

python visualization embeddings
2024 – present
Travel Map active

Travel Map

Interactive map visualization of travel history and destinations.

visualization python maps
2023 – present
Play active

Play

Fun little experimental projects and sandbox.

experiments sandbox
2020 – present
Claude Code First Attempt completed

Claude Code First Attempt

A fun project to automatically align images using Claude Code.

claude-code image-processing
2024
LIGO Binder completed

LIGO Binder

Binderized version of LIGO GW150914 gravitational wave tutorial.

gravitational-waves LIGO Binder tutorial
2016
Standard Model Graphic completed

Standard Model Graphic

TikZ version of the Standard Model graphic from Particle Fever documentary.

visualization LaTeX TikZ physics
2014
UnicodeIt completed

UnicodeIt

Convert LaTeX markup to Unicode characters for use anywhere - email, Twitter, Slack, etc.

LaTeX Unicode web-tool
2010
active
Claude Code project implementing a checkers game variant.
2024 – present
active
Interactive visualization tool for exploring high-dimensional vector embeddings.
2024 – present
active
Interactive map visualization of travel history and destinations.
2023 – present
active
Play
Fun little experimental projects and sandbox.
2020 – present
completed
Claude Code First Attempt
A fun project to automatically align images using Claude Code.
2024
completed
LIGO Binder
Binderized version of LIGO GW150914 gravitational wave tutorial.
2016
completed
Standard Model Graphic
TikZ version of the Standard Model graphic from Particle Fever documentary.
2014
completed
Convert LaTeX markup to Unicode characters for use anywhere - email, Twitter, Slack, etc.
2010