Shaobo Han NEC Labs AmericaShaobo Han is a Senior Researcher in the Optical Networking and Sensing Department at NEC Laboratories America in Princeton, NJ. He received his Ph.D. in Electrical and Computer Engineering and his M.S. in Statistical Science from Duke University, where his research focused on probabilistic modeling, transfer learning, and structured variational inference. He also earned an M.Eng. degree in Signal and Information Processing from the University of Chinese Academy of Sciences.

At NEC, Dr. Han has been prototyping and delivering advanced algorithmic solutions for real-world applications of sensing AI. By leveraging massive waveform data from NEC’s distributed fiber-optic sensors and cutting-edge machine learning technologies, his work transforms telecom infrastructure into a dense, large-scale network of acoustic sensors capable of real-time situational awareness. His research has led to multiple world-first and industry-first technology field trials and commercial products. He is the recipient of the NECAM Extra Mile Award, and the Outstanding Performance Award from NEC’s Global Innovation Business Unit (GIBU).

He also conducts research on parameter-efficient fine-tuning of large language models and the flexible adaptation of audio-language models. He holds more than 10 U.S. patents and has authored over 50 peer-reviewed papers in top-tier venues, including NeurIPS, ICLR, ICML, AISTATS, ICASSP, OFC, IEEE Transactions on Signal Processing, and the Journal of Lightwave Technology. His innovations advance the learning of structured, interpretable representations of the physical world from raw sensory inputs and enable cost-effective generalization to new environments and deployment scenarios.

Posts

CLAP-S: Support Set Based Adaptation for Downstream Fiber-optic Acoustic Recognition

Contrastive Language-Audio Pretraining (CLAP) models have demonstrated unprecedented performance in various acoustic signal recognition tasks. Fiber optic-based acoustic recognition is one of the most important downstream tasks and plays a significant role in environmental sensing. Adapting CLAP for fiber-optic acoustic recognition has become an active research area. As a non-conventional acoustic sensor, fiber-optic acoustic recognition presents a challenging, domain-specific, low-shot deployment environment with significant domain shifts due to unique frequency response and noise characteristics. To address these challenges, we propose a support-based adaptation method, CLAP-S, which linearly interpolates a CLAP Adapter with the Support Set, leveraging both implicit knowledge through fine-tuning and explicit knowledge retrieved from memory for cross-domain generalization. Experimental results show that our method delivers competitive performance on both laboratory-recorded fiber-optic ESC-50 datasets and a real-world fiber-optic gunshot-firework dataset. Our research also provides valuable insights for other downstream acoustic recognition tasks.

Multi-span optical power spectrum prediction using cascaded learning with one-shot end-to-end measurement

Scalable methods for optical transmission performance prediction using machine learning (ML) are studied in metro reconfigurable optical add-drop multiplexer (ROADM) networks. A cascaded learning framework is introduced to encompass the use of cascaded component models for end-to-end (E2E) optical path prediction augmented with different combinations of E2E performance data and models. Additional E2E optical path data and models are used to reduce the prediction error accumulation in the cascade. Off-line training (pre-trained prior to deployment) and transfer learning are used for component-level erbium-doped fiber amplifier (EDFA) gain models to ensure scalability. Considering channel power prediction, we show that the data collection processof the pre-trained EDFA model can be reduced to only 5% of the original training set using transfer learning. We evaluate the proposed method under three different topologies with field deployed fibers and achieve a mean absolute error of 0.16 dB with a single (one-shot) E2E measurement on the deployed 6-span system with 12 EDFAs.

VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks

As the adoption of large language models increases and the need for per-user or per-task model customization grows, the parameter-efficient fine-tuning (PEFT) methods, such as low-rank adaptation (LoRA) and its variants, incur substantial storage and transmission costs. To further reduce stored parameters, we introduce a “divide-and-share” paradigm that breaks the barriers of low-rank decomposition across matrix dimensions, modules, and layers by sharing parameters globally via a vector bank. As an instantiation of the paradigm to LoRA, our proposed VB-LoRA composites all the low-rank matrices of LoRA from a shared vector bank with a differentiable top-k admixture module. VB-LoRA achieves extreme parameter efficiency while maintaining comparable or better performance compared to state-of-the-art PEFT methods. Extensive experiments demonstrate the effectiveness of VB-LoRA on natural language understanding, natural language generation, instruction tuning, and mathematical reasoning tasks. When fine-tuning the Llama2-13B model, VB-LoRA only uses 0.4% of LoRA’s stored parameters, yet achieves superior results. Our source code is available at https://github.com/leo-yangli/VB-LoRA. This method has been merged into the Hugging Face PEFT package.

NEC Labs America Team Attending NeurIPS24 in Vancouver

NEC Labs America is proud to attend NeurIPS 2024 in Vancouver, Canada from December 10-15. Zachary Izzo will present Subgroup Discovery with the Cox Model, Shaobo Han will present VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks and Jonathan Warrell will present Discrete-Continuous Variational Optimization with Local Gradients.

First Field Trial of Hybrid Fiber Sensing with Data Transmission Resulting in Enhanced Sensing Sensitivity and Spatial Resolution

Optical fiber cables, initially designed for telecommunications, are increasingly repurposed for environmental monitoring using distributed fiber sensing technologies [1,2]. Distributed acoustic sensing (DAS) based on phase optical time domain reflectometry (?-OTDR) of Rayleigh backscatter enables various applications including traffic monitoring [3], railway [4] and perimeter intrusion detection [5] and cable damage detection [6], etc. The sensing range of DAS is typically limited to several tens of kilometers due to low optical signal-to-noise (OSNR) of the received backscatter. Additionally, compatibility of DAS with existing fiber infrastructure is hindered by the unidirectional operation of inline amplifiers with isolators. An alternative approach based on forward transmission was recently proposed [7, 8], which involves probing an optical fiber with a continuous wave (CW) signal and measuring either changes in received phase or the state of polarization (SOP) to detect cumulative vibration-induced strain. Unlike backscatter measurement, forward transmissions methods have longer sensing range due to higher OSNR, and is compatible with existing telecom infrastructure. However, potential challenges include limited localization accuracy, and low number of simultaneous events that can be discriminated and localized [7]. In this paper, we propose a new concept of “hybrid fiber sensing” for long-haul DWDM networks where the repeater node architecture combines DAS with forward-phase sensing (FPS), enhancing sensitivity by 32%. This approach achieves a multi-span, fine-resolution fiber sensing system. The FPS method detects vibration anomalies and coarsely localizes its position to within a fiber span. A segmented DAS then refines the position estimate and provides a precise waveform measurement. Consequently, the special resolution improves from one fiber span of 80 km to 4 m. Our scheme is validated on a test bed comprising lab spools and field fibers, demonstrating the capability to detect and monitor field construction while simultaneously supporting full C-band 400-Gb/s real-time (RT) data transmission.

Seeing the Vibration from Fiber-Optic Cables: Rain Intensity Monitoring using Deep Frequency Filtering

The various sensing technologies such as cameras LiDAR radar and satellites with advanced machine learning models offers a comprehensive approach to environmental perception and understanding. This paper introduces an innovative Distributed Fiber Optic Sensing (DFOS) technology utilizing the existing telecommunication infrastructure networks for rain intensity monitoring. DFOS enables a novel way to monitor weather condition and environmental changes provides real-time continuous and precise measurements over large areas and delivers comprehensive insights beyond the visible spectrum. We use rain intensity as an example to demonstrate the sensing capabilities of DFOS system. To enhance the rain sensing performance we introduce a Deep Phase-Magnitude Network (DFMN) divide the raw sensing data into phase and magnitude component allowing targeted feature learning on each component independently. Furthermore we propose a Phase Frequency learnable filter (PFLF) for the phase component filtering and conduct standard convolution layers on the magnitude component leveraging the inherent physical properties of optical fiber sensing. We formulate the phase-magnitude channel into a parallel network and subsequently fuse the features for a comprehensive analysis in the end. Experimental results on the collected fiber sensing data show that the proposed method performs favorably against the state-of-the-art approaches.

Deep Learning-based Intrusion Detection and Impulsive Event Classification for Distributed Acoustic Sensing across Telecom Networks

We introduce two pioneering applications leveraging Distributed Fiber Optic Sensing (DFOS) and Machine Learning (ML) technologies. These innovations offer substantial benefits forfortifying telecom infrastructures and public safety. By harnessing existing telecom cables, our solutions excel in perimeter intrusion detection via buried cables and impulsive event classification through aerial cables. To achieve comprehensive intrusion detection, we introduce a label encoding strategy for multitask learning and evaluate the generalization performance of the proposed approach across various domain shifts. For accurate recognition of impulsive acoustic events, we compare several standard choices of representations for raw waveform data and neural network architectures, including convolutional neural networks (ConvNets) and vision transformers (ViT).We also study the effectiveness of the built-in inductive biases under both high- and low-fidelity sensing conditions and varying amounts of labeled training data. All computations are executed locally through edge computing, ensuring real-time detection capabilities. Furthermore, our proposed system seamlessly integrates with cameras for video analytics, significantly enhancing overall situation awareness of the surrounding environment.

NEC Labs America Team Attending CVPR 2024 in Seattle

Our team will be attending CVPR 2024 (The IEEE /CVF Conference on Computer Vision & Pattern Recognition) from June 17-21! See you there at the NEC Labs America Booth 1716! Stay tuned for more information about our participation.

Multi-Span Optical Power Spectrum Prediction using ML-based EDFA Models and Cascaded Learning

We implement a cascaded learning framework using component-level EDFA models for optical power spectrum prediction in multi-span networks, achieving a mean absolute error of 0.17 dB across 6 spans and 12 EDFAs with only one-shot measurement.

Real-time Intrusion Detection and Impulsive Acoustic Event Classification with Fiber Optic Sensing and Deep Learning Technologies over Telecom Networks

We review various use cases of distributed-fiber-optic-sensing and machine-learning technologies that offer advantages to telecom fiber networks on existing fiber infrastructures. Byleveraging an edge-AI platform, perimeter intrusion detection and impulsive acoustic event classification can be performed locally on-the-fly, ensuring real-time detection with low latency.