Machine Learning | Zachary Izzo

MACHINE LEARNING

PROJECTS

PEOPLE

PUBLICATIONS

PATENTS

Zachary Izzo

Researcher

Machine Learning

Projects

Physics Informed Machine Learning

Overview: Since 2019, the parameters of large deep learning models have increased by over 300 times every 18 months. However, the future ML progress cannot continue simply based on using more data or creating larger models, because the growing gap between the model demand and resource supply is not sustainable.

Trustworthy Generative AI

Overview: The Trustworthy Generative AI Project is focused on developing advanced multimodal generative models that can create and reason with content across text, images, reports, and 3D videos. These models are designed for applications in advertisement, entertainment, law enforcement, and healthcare.

Publications

To Err Is Human: Systematic Quantification of Errors in Published AI Papers via LLM Analysis

December 4, 2025/arXiv

How many mistakes do published AI papers contain? Peer-reviewed publications form the foundation upon which new research and knowledge are built. Errors that persist in the literature can propagate unnoticed, creating confusion in follow-up studies and complicating reproducibility. The accelerating pace

Quantitative Bounds for Length Generalization in Transformers

November 10, 2025/arXiv

We study the problem of length generalization (LG) in transformers: the ability of a model trained on shorter sequences to maintain performance when evaluated on much longer, previously unseen inputs. Prior work by Huang et al. (2025) established that transformers eventually achieve length generalization

Group Relative Augmentation for Data Efficient Action Detection

July 30, 2025/arXiv

Adapting large Video-Language Models (VLMs) for action detection using only a few examples poses challenges like overfitting and the granularity mismatch between scene-level pre-training and required person-centric understanding. We propose an efficient adaptation strategy combining parameter-efficient

Quantitative Bounds for Length Generalization in Transformers

July 19, 2025/3rd Workshop on High-dimensional Learning Dynamics (HiLD), San Diego, CA

We provide quantitative bounds on the length of sequences required to be observed during training for a transformer to length generalize, e.g., to continue to perform well on sequences unseen during training. Our results improve on Huang et al. [8], who show that there is a finite training length beyond

Solving Inverse Problems via a Score-Based Prior: An Approximation-Free Posterior Sampling Approach

June 5, 2025/arXiv

Diffusion models (DMs) have proven to be effective in modeling high-dimensional distributions, leading to their widespread adoption for representing complex priors in Bayesian inverse problems (BIPs). However, current DM-based posterior sampling methods proposed for solving common BIPs rely on heuristic

Domain-Guided Weight Modulation for Semi-Supervised Domain Generalization

March 3, 2025/WACV 2025

Unarguably deep learning models capable of generalizing to unseen domain data while leveraging a few labels are of great practical significance due to low developmental costs. In search of this endeavor we study the challenging problem of semi-supervised domain generalization (SSDG) where the goal is

Subgroup Discovery with the Cox Model

December 15, 2024/NeurIPS 2024 Interpretable AI workshop

We study the problem of subgroup discovery with Cox regression models and introduce a method for finding an interpretable subset of the data on which a Cox model is highly accurate. Our method relies on two technical innovations: the emph (Unknown sysvar: (expected prediction entropy)), a novel metric

NEC Labs America Team Attending NeurIPS24 in Vancouver

December 3, 2024

NEC Labs America is proud to attend NeurIPS 2024 in Vancouver, Canada from December 10-15. Zachary Izzo will present Subgroup Discovery with the Cox Model, Shaobo Han will present VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks and Jonathan Warrell will present Discrete-Continuous

Matching Confidences and Softened Target Occurrences for Calibration

November 27, 2024/Digital Image Computing: Techniques & Applications (DICTA 2024)

The problem of calibrating deep neural networks (DNNs) is gaining attention, as these networks are becoming central to many real-world applications. Different attempts have been made to counter the poor calibration of DNNs. Amongst others, train-time calibration methods have unfolded as an effective

Introducing the Trustworthy Generative AI Project: Pioneering the Future of Compositional Generation and Reasoning

August 19, 2024

We are thrilled to announce the launch of our latest research initiative, the Trustworthy Generative AI Project. This ambitious project is set to revolutionize how we interact with multimodal content by developing cutting-edge generative models capable of compositional generation and reasoning across

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

July 21, 2024/The 41st International Conference on Machine Learning (ICML 2024)

We present an approach for estimating the fraction of text in a large corpus which is likely to be substantially modified or produced by a large language model (LLM). Our maximum likelihood model leverages expert-written and AI-generated reference texts to accurately and efficiently examine real-world

Provable Membership Inference Privacy

April 9, 2024/Transactions on Machine Learning Research

In applications involving sensitive data, such as finance and healthcare, the necessity for preserving data privacy can be a significant barrier to machine learning model development.Differential privacy (DP) has emerged as one canonical standard for provable privacy. However, DPs strong theoretical

NEC Labs America Team Heading to NeurIPS23 in New Orleans

December 7, 2023

NEC Labs America is proud to be a Silver Sponsor for NeurIPS 2023 in New Orleans from December 10-16. Visit our booth to meet our team and learn about our intern opportunities in machine learning, data science, media analytics and integrated systems. Also, our Vijay Kumar.B.G, Samuel Schulter & Manmohan

Machine Learning | Zachary Izzo

Zachary Izzo

Projects

Physics Informed Machine Learning

Trustworthy Generative AI

Publications

To Err Is Human: Systematic Quantification of Errors in Published AI Papers via LLM Analysis

Quantitative Bounds for Length Generalization in Transformers

Group Relative Augmentation for Data Efficient Action Detection

Quantitative Bounds for Length Generalization in Transformers

Solving Inverse Problems via a Score-Based Prior: An Approximation-Free Posterior Sampling Approach

Domain-Guided Weight Modulation for Semi-Supervised Domain Generalization

Subgroup Discovery with the Cox Model

NEC Labs America Team Attending NeurIPS24 in Vancouver

Matching Confidences and Softened Target Occurrences for Calibration

Introducing the Trustworthy Generative AI Project: Pioneering the Future of Compositional Generation and Reasoning

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

Provable Membership Inference Privacy

NEC Labs America Team Heading to NeurIPS23 in New Orleans

Contact Us

About Us

Our Pages

Recent Publications

Events

News