Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph

We target the task of cross-lingual Machine Reading Comprehension (MRC) in the direct zero-shot setting, by incorporating syntactic features from Universal Dependencies (UD), and the key features we use are the syntactic relations within each sentence. While previous work has demonstrated effective syntax-guided MRC models, we propose to adopt the inter-sentence syntactic relations, in addition to the rudimentary intra-sentence relations, to further utilize the syntactic dependencies in the multi-sentence input of the MRC task. In our approach, we build the Inter-Sentence Dependency Graph (ISDG) connecting dependency trees to form global syntactic relations across sentences. We then propose the ISDG encoder that encodes the global dependency graph, addressing the inter-sentence relations via both one-hop and multi-hop dependency paths explicitly. Experiments on three multilingual MRC datasets (XQuAD, MLQA, TyDiQA-GoldP) show that our encoder that is only trained on English is able to improve the zero-shot performance on all 14 test sets covering 8 languages, with up to 3.8 F1 / 5.2 EM improvement on-average, and 5.2 F1 / 11.2 EM on certain languages. Further analysis shows the improvement can be attributed to the attention on the cross-linguistically consistent syntactic path. Our code is available at https://github.com/lxucs/multilingual-mrc-isdg.

Weight Pruning Techniques for Nonlinear Impairment Compensation using Neural Networks

Neural networks (NNs) are attractive for nonlinear impairment compensation applications in communication systems, such as optical fiber nonlinearity, nonlinearity of driving amplifiers, and nonlinearity of semiconductor optical amplifiers. Without prior knowledge of the transmission link or the hardware characteristics, optimal parameters are completely constructed from a data-driven approach by exploring training datasets, once the NN structure is given. On the other hand, due to computational power and energy consumption, especially in high-speed communication systems, the computational complexity of the optimized NN needs to be confined to the hardware, such as FPGA or ASIC without sacrificing its performance improvement. In this paper, two approaches are presented to accommodate the NN-based algorithms for high-speed communication systems. The first approach is to reduce computational complexity of the NN-based nonlinearity compensation algorithms on the basis of weight pruning (WP). WP can significantly reduce the computational complexity, especially because the nonlinear compensation task studied here results in a sparse NN. The authors have studied an enhanced approach of WP by imposing an additional restriction on the selection of non-zero weights on each hidden layer. The second approach is to implement NNs onto a silicon-photonic integrated platform, enabling power efficiency to be further improved without sacrificing the high-speed operation.

Vibration Detection and Localization using Modified Digital Coherent Telecom Transponders

We demonstrate a vibration detection and localization scheme based on bidirectional transmission of telecom signals with digital coherent detection at the receivers. Optical phase is extracted from the digital signal processing blocks of the coherent receiver, from which the vibration component is extracted by bandpass filtering, and the position along the cable closest to the vibration’s epicenter is recovered by correlation. We demonstrate our scheme first using offline experiment with 200-Gb/s DP-16QAM, and we report field trial results over installed fiber to detect real-world vibration events.

Detection of Road Anomaly Using Distributed Fiber Optic Sensing

Road surface condition can significantly impact the interaction between vehicles and pavement structure, which may even cause high fuel consumption and safety issues of drivers and vehicles. Distributed fiber optic sensing (DFOS) technology is a useful tool to perform continuous and real-time monitoring of traffic and road surface condition. However, it is challenging to process the data for the purpose of road anomaly detection. The study proposed two approaches to detect the road anomaly using DFOS. In the first method, local binary pattern (LBP) histograms were used to extract the features of the images with and without road anomaly, and support vector machine (SVM) combined with principal component analysis (PCA) was adopted as the classifier. The convolutional neural network (CNN) was applied on the binary classification data to analyze the images in the second method. The accuracy and benefits of two methodologies were compared. The vehicle speed was estimated by detecting lines using Hough transform. The feasibility of road anomaly detection using DFOS is proved.

AI-Driven Applications over Telecom Networks by Distributed Fiber Optic Sensing Technologies

By employing distributed fiber optic sensing (DFOS) technologies, field deployed fiber cables can be utilized as not only communication media for data transmissions but also sensing media for continuously monitoring of the physical phenomenon along the entire route. The fiber can be used to monitor ambient environment along the route covering a wide geographic area. With help of artificial intelligence and machine learning (AI/ML) technologies on information processing, many applications can be developed over telecom networks. We review the recent field results and demonstrate how DFOS can work with existing communication channels and provide holistic view of road traffic monitoring included vehicle counts and average vehicle speeds. A long-term wide-area road traffic monitoring system is an efficient way of gathering seasonal vehicle activities which can be applied in future smart city applications. Additionally, DFOS also offers cable cut prevention functions such as cable self-protection and cable cut threat assessment. Detection and localization of abnormal events and evaluating the threat to the cable are realized to protect telecom facilities.

Confidence and Dispersity Speak – Characterizing Prediction Matrix for Unsupervised Accuracy Estimation

This work aims to assess how well a model performs under distribution shifts without using labels. While recent methods study prediction confidence, this work reports prediction dispersity is another informative cue. Confidence reflects whether the individual prediction is certain, dispersity indicates how the overall predictions are distributed across all categories. Our key insight is that a well performing model should give predictions with high confidence and high dispersity. That is, we need to consider both properties so as to make more accurate estimates. To this end, we use the nuclear norm that has been shown to be effective in characterizing both properties. Extensive experiments validate the effectiveness of nuclear norm for various models (e.g., ViT and ConvNeXt), different datasets (e.g., ImageNet and CUB 200), and diverse types of distribution shifts (e.g., style shift and reproduction shift). We show that the nuclear norm is more accurate and robust in accuracy estimation than existing methods. Furthermore, we validate the feasibility of other measurements (e.g., mutual information maximization) for characterizing dispersity and confidence. Lastly, we investigate the limitation of the nuclear norm, study its improved variant under severe class imbalance, and discuss potential directions.

A Dispersion Managed Phase Only Modulation 18 GHz Optoelectronic Oscillator

In this manuscript, we propose and experimentally demonstrate a dispersion-controlled optoelectronic oscillator with phase only modulator at 18 GHz. The generated microwave signal has a phase noise of −108 dBc/Hz at 10 kHz offset frequency and the integrated timing jitter is calculated to be 16.2 fs (1 kHz to 100 MHz) and 20 fs (1kHz to Nyquist).

Multi user Beam Alignment in Presence of Multi path

To overcome the high path loss and the intense shadowing in millimeter wave (mmWave) communications, effective beamforming schemes are required which incorporate narrow beams with high beamforming gains. The mmWave channel consists of a few spatial clusters each associated with an angle of departure (AoD). The narrow beams must be aligned with the channel AoDs to increase the beamforming gain. This is achieved through a procedure called beam alignment (BA). Most of the BA schemes in the literature consider channels with a single dominant path while in practice the channel has a few resolvable paths with different AoDs, hence, such BA schemes may not work correctly in the presence of multi path or at the least do not exploit such multipath to achieve diversity or increase robustness. In this paper, we propose an efficient BA scheme in presence of multi path. The proposed BA scheme transmits probing packets using a set of scanning beams and receives feedback for all the scanning beams at the end of the probing phase from each user. We formulate the BA scheme as minimizing the expected value of the average transmission beamwidth under different policies. The policy is defined as a function from the set of received feedback to the set of transmission beams (TB). In order to maximize the number of possible feedback sequences, we prove that the set of scanning beams (SB) has a special form, namely, Tulip Design. Consequently, we rewrite the minimization problem with a set of linear constraints and a reduced number of variables which is solved by using an efficient greedy algorithm.

Codebook Design for Composite Beamforming in Next generation mmWave Systems

In pursuance of the unused spectrum in higher frequencies, millimeter wave (mmWave) bands have a pivotal role. However, the high path loss and poor scattering associated with mmWave communications highlight the necessity of employing effective beamforming techniques. In order to efficiently search for the beam to serve a user and to jointly serve multiple users it is often required to use a composite beam which consists of multiple disjoint lobes. A composite beam covers multiple desired angular coverage intervals (ACIs) and ideally has maximum and uniform gain (smoothness) within each desired ACI, negligible gain (leakage) outside the desired ACIs, and sharp edges. We propose an algorithm for designing such ideal composite codebook by providing an analytical closed form solution with low computational complexity. There is a fundamental trade off between the gain, leakage and smoothness of the beams. Our design allows to achieve different values in such trade off based on changing the design parameters. We highlight the shortcomings of the uniform linear arrays (ULAs) in building arbitrary composite beams. Consequently, we use a recently introduced twin ULA (TULA) antenna structure to effectively resolve these inefficiencies. Numerical results are used to validate the theoretical findings.

Ordinal Quadruplet: Retrieval of Missing Labels in Ordinal Time Series

In this paper, we propose an ordered time series classification framework that is robust against missing classes in the training data, i.e., during testing we can prescribe classes that are missing during training. This framework relies on two main components: (1) our newly proposed ordinal quadruplet loss, which forces the model to learn latent representation while preserving the ordinal relation among labels, (2) testing procedure, which utilizes the property of latent representation (order preservation). We conduct experiments based on real world multivariate time series data and show the significant improvement in the prediction of missing labels even with 40% of the classes are missing from training. Compared with the well known triplet loss optimization augmented with interpolation for missing information, in some cases, we nearly double the accuracy.