Distilling Offline Action Detection Models into Real-Time Streaming Models

Vision Transformers (ViTs) have achieved state-of-the-art performance in offline video action detection, but their reliance on processing fixed-size clips with full spatio-temporal attention makes them computationally expensive and ill-suited for real-time streaming applications due to massive computational redundancy. This paper introduces a novel framework to adapt these powerful offline models into efficient, online student models through knowledge distillation. We propose two causal, streaming-friendly attention architectures that replace the full self-attention mechanism: (1) a lightweight Temporal Shift Attention that integrates past context with minimal overhead, and (2) a Decomposed Spatial-Temporal Attention that combines intra-frame spatial attention with an LSTM for temporal modeling. Both architectures utilize caching to eliminate redundant operations on a frame-by-frame basis. To maximize knowledge transfer, we introduce an uncertainty-guided distillation process, which formulates the training as a multi-task learning problem. Our resulting models demonstrate significant efficiency gains, achieving up to a4x improvement in latency and throughput compared to the original offline teacher while ensuring state-of-the-art performance for online methods. Our work provides a practical and effective methodology for deploying high-accuracy transformer models in latency-sensitive, real-world video analysis systems.

Celebrating the Women of NEC Laboratories America

NEC Laboratories America celebrates Women’s History Month and International Women’s Day by recognizing the women researchers, engineers, and interns whose work in artificial intelligence, optical networking, cybersecurity, and data science is helping shape the future of technology.

Logical Guidance for the Exact Composition of Diffusion Models

We propose LOGDIFF (Logical Guidance for the Exact Composition of Diffusion Models), a guidance framework for diffusion models that enables principled constrained generation with complex logical expressions at inference time. We study when exact score-based guidance for complex logical formulas can be obtained from guidance signals associated with atomic properties. First, we derive an exact Boolean calculus that provides a sufficient condition for exact logical guidance. Specifically, if a formula admits a circuit representation in which conjunctions combine conditionally independent subformulas and disjunctions combine subformulas that are either conditionally independent or mutually exclusive, exact logical guidance is achievable. In this case, the guidance signal can be computed exactly from atomic scores and posterior probabilities using an efficient recursive algorithm.Moreover, we show that, for commonly encountered classes of distributions, any desired Boolean formula is compilable into such a circuit representation. Second, by combining atomic guidance scores with posterior probability estimates, we introduce a hybrid guidance approach that bridges classifier guidance and classifier-free guidance, applicable to both compositional logical guidance and standard conditional generation. We demonstrate the effectiveness of our framework on multiple image and protein structure generation tasks.

NEC Labs America Attending OFC 2026 Los Angeles, March 15-19

NEC Laboratories America’s Optical Networking & Sensing team will participate in OFC 2026 in Los Angeles, March 15–19, contributing to panels, workshops, and courses focused on optical sensing, multicore fibers, and next-generation high-capacity optical communication systems.

Honoring Black Innovators Who Shaped Technology

Honoring Black innovators who shaped modern technology, from space exploration and collaborative computing to ethical AI and GPS. Discover how pioneering researchers like Katherine Johnson, Clarence Ellis, Timnit Gebru, and others continue to influence AI, computing, and scientific innovation.

HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles

Controllable driving scene generation is critical for realistic and scalable autonomous driving simulation, yet existing approaches struggle to jointly achieve photorealism and precise control. We introduce HorizonForge, a unified framework that reconstructs scenes as editable Gaussian Splats and Meshes, enabling fine-grained 3D manipulation and language-driven vehicle insertion. Edits are rendered through a noise-aware video diffusion process that enforces spatial and temporal consistency, producing diverse scene variations in a single feed-forward pass without per-trajectory optimization. To standardize evaluation, we further propose HorizonSuite, a comprehensive benchmark spanning ego- and agent-level editing tasks such as trajectory modifications and object manipulation. Extensive experiments show that Gaussian-Mesh representation delivers substantially higher fidelity than alternative 3D representations, and that temporal priors from video diffusion are essential for coherent synthesis. Combining these findings, HorizonForge establishes a simple yet powerful paradigm for photorealistic, controllable driving simulation, achieving an 83.4% user-preference gain and a 25.19% FID improvement over the second-best state-of-the-art method. Project page: https://horizonforge.github.io/.

Field Trial of High-Sensitivity Forward-Transmission Sensing for Real-World Event Detection Over Live Urban Fiber Networks

Vibration sensing based on forward transmission is an emerging topic for network protection and environmental monitoring, especially in long-haul submarine cables and urban fiber networks. However, previous field trials of this approach have mainly focused on localizing strong events under controlled or relatively quiet conditions. In this work, we investigate the capability of forward-transmission vibration sensing to detect weak signals in noisy environments. We demonstrate a high-sensitivity vibration sensing system operating over an 80-km deployed live urban fiber loop without optical amplifiers. The system is enhanced by adaptive time-frequency masking and in-band laser phase noise suppression techniques to improve sensitivity and noise robustness. It has successfully identified and localized weak real-world vibration events with peak-to-peak amplitude lower than 20 rad, such as construction activity near a manhole and even footsteps on handhole lids. Field trial results confirm its robust performance under dynamic environments, including road traffic-induced ground vibrations and aerial cable disturbances. To the best of our knowledge, this is the first demonstration of weak vibration event detection using forward transmission in urban fiber networks. It remarks a significant step towards practical distributed vibration sensing in smart city applications.

Manhole Localization and Condition Diagnostics in Telecom Networks Using Distributed Acoustic and Temperature Sensing

We present methods and field trial results demonstrating an integrated distributed acoustic sensing (DAS) and distributed temperature sensing (DTS) system for manhole localization, condition diagnostics, and anomaly detection in pre-deployed telecommunication fiber networks. The proposed system leverages ambient environmental signals, such as vibrational patterns from traffic and day-night temperature fluctuations, and machine learning techniques for automated detection. By combining DAS waterfall traces with temperature measurements from DTS, we achieve improved classification accuracy. Experimental results from three real-world testbeds in Texas and New Jersey show a significant improvement in classification accuracy—from 78.9% and 89.5% using DAS and DTS alone, respectively, to 94.7% via cross-referenced analysis. We propose a structured prediction formulation for manhole localization based on a U-Net architecture with a gated attention mechanism, where the label of each fiber location in the waterfall image is predicted using both its neighboring context and within-patch discriminative features. The method also supports cross-route generalization for manhole localization and enables condition diagnostics, identifying issues such as cable exposure and water ingress. These results highlight the potential for scalable deployment of fiber sensing solutions for real-time, continuous monitoring of telecom infrastructure.

MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery

Uncovering causal structures from observational data is crucial for understanding complex systems and making informed decisions. While reinforcement learning (RL) has shown promise in identifying these structures in the form of a directed acyclic graph (DAG), existing methods often lack efficiency, making them unsuitable for online applications. In this paper, we propose MARLIN, an efficient multi-agent RL-based approach for incremental DAG learning. MARLIN uses a DAG generation policy that maps a continuous real-valued space to the DAG space as an intra-batch strategy, then incorporates two RL agents — state-specific and state-invariant — to uncover causal relationships and integrates these agents into an incremental learning framework. Furthermore, the framework leverages a factored action space to enhance parallelization efficiency. Extensive experiments on synthetic and real datasets demonstrate that MARLIN out-performs state-of-the-art methods in terms of both efficiency and effectiveness.

Brownian Bridge Augmented Surrogate Simulation and Injection Planning for Geological CO2 Storage

Geological CO2 storage (GCS) involves injecting captured CO2 into deep sub-surface formations to support climate goals. The effective management of GCS relies on adaptive injection planning to dynamically control injection rates and well pressures to balance both storage safety and efficiency. Prior literature, including numerical optimization methods and surrogate-optimization methods, is limited by real-world GCS requirements of smooth state transitions and goal-directed planning within limited time. To address these limitations, we propose a Brownian Bridge — augmented framework for surrogate simulation and injection planning in GCS and develop two insights (i) Brownian bridge as smooth state regularizer for better surrogate simulator; (ii) Brownian bridge as goal-time-conditioned planning guidance for better injection planning. Our method has three stages: (i) learning deep Brownian bridge representations with contrastive and reconstructive losses from historical reservoir and utility trajectories, (ii) incorporating Brownian bridge-based next state interpolation for simulator regularization (iii) guiding injection planning with Brownian utility-conditioned trajectories to generate high-quality injection plans. Experimental results across multiple datasets collected from diverse GCS settings demonstrate that our framework consistently improves simulation fidelity and planning effectiveness while maintaining low computational overhead.