NEC Labs America Attends ICML 2026 Seoul, South Korea July 6-11, 2026

NEC Laboratories America researchers are heading to Seoul this July for ICML 2026, the Forty-Third International Conference on Machine Learning. One of the most prestigious gatherings in the field, ICML draws academic and industry researchers from around the world to share work spanning machine learning, artificial intelligence, data science, and their many applications.

ICML 2026 SMM

This year, three members of the NECLA team will be presenting accepted papers at the conference, held July 6 through 11 at the COEX Convention and Exhibition Center.

Presentations

The presentations span agentic AI and coding documentation, survival analysis and subgroup discovery, and compositional control of diffusion models. Together, they reflect the depth and range of research underway at our company.

Escaping Whack-a-Mole: Optimizing Documentation as Repo-Specific Playbooks for Coding Agents

Wei Cheng NEC Labs America
Haifeng Chen NEC Labs America

Abstract: As large language models increasingly serve as autonomous coding agents, code documentation must be optimized for agent comprehension rather than human readability. We frame agent-oriented documentation generation as a black-box optimization problem over the documentation space, where quality is measured solely by downstream code correctness. A central challenge for conventional LLM refinement methods is output coupling—program entities are interdependent, and refining the documentation of one entity can invalidate its callers, resulting in a persistent whack-a-mole phenomenon during inference-time scaling. We propose DocSearch, a dependency-guided bi-level search framework that systematically exploits test-time feedback. The outer level conducts a priority search over the program-entity dependency DAG, enforcing a callee-before-caller refinement order to prevent downstream interference. The inner level performs a beam search over documentation refinements, using diversified error message sampling from self-generated unit tests to better exploit diagnostic signals and escape local optima. We provide theoretical guarantees of monotonic progress, showing that our worthy condition prevents regression while enabling efficient exploration. On DevEval+, DocSearch achieves a 90.7% solve rate with GPT-4o, outperforming the strongest baseline by 32.6%. Cross-language experiments further demonstrate that optimized documentation transfers effectively to different target programming languages.

Subgroup Discovery with the Cox Model

Zachary Izzo NEC Labs America

Abstract: We study the problem of subgroup discovery for survival analysis, where the goal is to find an interpretable subset of the data on which a Cox model is highly accurate. We examine why existing quality functions are insufficient for this problem and introduce two technical innovations: the expected prediction entropy (EPE), a novel metric for evaluating survival models that predict hazard functions, and the conditional rank statistics (CRS), which quantifies individual point deviation from a subgroup’s survival time distribution. We study the EPE and CRS theoretically and show that they address problems with existing metrics. We then introduce seven algorithms for Cox subgroup discovery. Our main algorithm is based on the DDGroup framework of Izzo et al. (2023) and leverages both the EPE and CRS, allowing theoretical correctness guarantees in well-specified settings. Empirical evaluation on synthetic and real data confirms our theory, showing our methods recover ground-truth subgroups in well-specified cases and achieve better model fit than naively fitting the Cox model to the entire dataset. A case study using NASA jet engine simulation data demonstrates that discovered subgroups reveal known nonlinearities in the data and suggest design choices that are mirrored in practice.

Logical Guidance Rules for the Exact Composition of Diffusion Models

Jonathan Warrell NEC Labs America
  • Jonathan Warrell (Presenting), Francesco Alesiani, Tanja Bien, Henrik Christiansen, Matheus Vitor Ferreira Ferraz, Mathias Niepert
  • Poster
  • Tue, Jul 7, 2026 • 10:30 AM – 12:15 PM KST
  • Hall A #2615
  • https://icml.cc/virtual/2026/poster/64380 

Abstract: We propose LOGDIFF (Logical Guidance for the Exact Composition of Diffusion Models), a guidance framework for diffusion models that enables principled constrained generation with complex logical expressions at inference time. We study when exact score-based guidance for complex logical formulas can be obtained from guidance signals associated with atomic attributes and constraints. First, we derive an exact Boolean calculus that provides a sufficient condition for exact logical guidance. Specifically, if a formula admits a circuit representation in which conjunctions combine conditionally independent subformulas and disjunctions combine subformulas that are either conditionally independent or mutually exclusive, exact logical guidance is achievable. In this case, the guidance signal can be computed exactly from atomic scores and posterior probabilities using an efficient recursive algorithm. Moreover, we show that, for commonly encountered classes of distributions, any desired Boolean formula is compilable into such a circuit representation. Second, by combining atomic guidance scores with posterior probability estimates, we introduce a hybrid guidance approach that bridges classifierguidance and classifier-free guidance, applicable to both compositional logical guidance and standard conditional generation. We demonstrate the effectiveness of our framework on multiple image and protein structure generation tasks.

Attending

Shaobo Han NEC Labs America
  • Shaobo Han (Attending)

Read About Our Future and Past Events

FiOLS 2025 Andrea

Andrea D’Amico Presents Open and Disaggregated Optical Networks: From Vision to Reality at FiO LS on October 29th

Join our Andrea D’Amico as he presents Open and Disaggregated Optical Networks: From Vision to Reality (FW6E.1) at part of the Next-Generation Optical Fiber Transmission Systems and Networks Session at the Frontiers in Optics + Laser Science (FiO LS) conference in Denver, CO, on October 29, 2025, 3:30 PM to 4:00 PM. Open and disaggregated optical networks can potentially reshape the telecom landscape.
PICOM25 Murugan

Murugan Sankaradas presents TalentScout: Multimodal AI-Driven Expert Finding in Organizations at PICom2025 on October 21st

Murugan Sankaradas (presenting virtually) will present “TalentScout: Multimodal AI-Driven Expert Finding in Organizations” at the IEEE International Conference on Pervasive Intelligence and Computing (PICom2025) on Tuesday, October 21 (10:30am–12pm JST) | Monday, October 20 (9:30–11pm ET) in Hokkaido, Japan.
ADFM2025

Abhishek Aich is Organizing the Anomaly Detection with Foundation Models Workshop, held in conjunction with ICCV 2025

We are proud to share that our Abhishek Aich is serving as one of the organizers of the Anomaly Detection with Foundation Models Workshop, held in conjunction with the International Conference on Computer Vision, October 20, 2025, 08:55 AM – 12:15 PM HST in Room 314 at theHawaii Convention Center, Honolulu, HI.
PICOM25 Kunal

Kunal Rao presents SlideCraft: Context-Aware Slides Generation Agent at PICom 2025 on October 21st

Kunal Rao (presenting virtually) will present “SlideCraft: Context-Aware Slides Generation Agent” at the IEEE International Conference on Pervasive Intelligence and Computing hashtag#PICom2025 on Tuesday, Oct 21 (10:30am–12pm JST) | Monday, Oct 20 (9:30–11pm ET) in Hokkaido, Japan. SlideCraft uses AI to automatically generate presentation slides from research content, making technical communication faster and context-aware for scientists and professionals.