Publications Archives | Page 36 of 65

Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph

March 1, 2022/in Publications/by NEC Labs America

We target the task of cross-lingual Machine Reading Comprehension (MRC) in the direct zero-shot setting, by incorporating syntactic features from Universal Dependencies (UD), and the key features we use are the syntactic relations within each sentence. While previous work has demonstrated effective syntax-guided MRC models, we propose to adopt the inter-sentence syntactic relations, in addition to the rudimentary intra-sentence relations, to further utilize the syntactic dependencies in the multi-sentence input of the MRC task. In our approach, we build the Inter-Sentence Dependency Graph (ISDG) connecting dependency trees to form global syntactic relations across sentences. We then propose the ISDG encoder that encodes the global dependency graph, addressing the inter-sentence relations via both one-hop and multi-hop dependency paths explicitly. Experiments on three multilingual MRC datasets (XQuAD, MLQA, TyDiQA-GoldP) show that our encoder that is only trained on English is able to improve the zero-shot performance on all 14 test sets covering 8 languages, with up to 3.8 F1 / 5.2 EM improvement on-average, and 5.2 F1 / 11.2 EM on certain languages. Further analysis shows the improvement can be attributed to the attention on the cross-linguistically consistent syntactic path. Our code is available at https://github.com/lxucs/multilingual-mrc-isdg.

AI-Driven Applications over Telecom Networks by Distributed Fiber Optic Sensing Technologies

February 21, 2022/in Publications/by NEC Labs America

By employing distributed fiber optic sensing (DFOS) technologies, field deployed fiber cables can be utilized as not only communication media for data transmissions but also sensing media for continuously monitoring of the physical phenomenon along the entire route. The fiber can be used to monitor ambient environment along the route covering a wide geographic area. With help of artificial intelligence and machine learning (AI/ML) technologies on information processing, many applications can be developed over telecom networks. We review the recent field results and demonstrate how DFOS can work with existing communication channels and provide holistic view of road traffic monitoring included vehicle counts and average vehicle speeds. A long-term wide-area road traffic monitoring system is an efficient way of gathering seasonal vehicle activities which can be applied in future smart city applications. Additionally, DFOS also offers cable cut prevention functions such as cable self-protection and cable cut threat assessment. Detection and localization of abnormal events and evaluating the threat to the cable are realized to protect telecom facilities.

Confidence and Dispersity Speak – Characterizing Prediction Matrix for Unsupervised Accuracy Estimation

February 2, 2022/in Publications/by NEC Labs America

This work aims to assess how well a model performs under distribution shifts without using labels. While recent methods study prediction confidence, this work reports prediction dispersity is another informative cue. Confidence reflects whether the individual prediction is certain, dispersity indicates how the overall predictions are distributed across all categories. Our key insight is that a well performing model should give predictions with high confidence and high dispersity. That is, we need to consider both properties so as to make more accurate estimates. To this end, we use the nuclear norm that has been shown to be effective in characterizing both properties. Extensive experiments validate the effectiveness of nuclear norm for various models (e.g., ViT and ConvNeXt), different datasets (e.g., ImageNet and CUB 200), and diverse types of distribution shifts (e.g., style shift and reproduction shift). We show that the nuclear norm is more accurate and robust in accuracy estimation than existing methods. Furthermore, we validate the feasibility of other measurements (e.g., mutual information maximization) for characterizing dispersity and confidence. Lastly, we investigate the limitation of the nuclear norm, study its improved variant under severe class imbalance, and discuss potential directions.

A Dispersion Managed Phase Only Modulation 18 GHz Optoelectronic Oscillator

February 1, 2022/in Publications/by NEC Labs America

In this manuscript, we propose and experimentally demonstrate a dispersion-controlled optoelectronic oscillator with phase only modulator at 18 GHz. The generated microwave signal has a phase noise of −108 dBc/Hz at 10 kHz offset frequency and the integrated timing jitter is calculated to be 16.2 fs (1 kHz to 100 MHz) and 20 fs (1kHz to Nyquist).

Ordinal Quadruplet: Retrieval of Missing Labels in Ordinal Time Series

January 24, 2022/in Publications/by NEC Labs America

In this paper, we propose an ordered time series classification framework that is robust against missing classes in the training data, i.e., during testing we can prescribe classes that are missing during training. This framework relies on two main components: (1) our newly proposed ordinal quadruplet loss, which forces the model to learn latent representation while preserving the ordinal relation among labels, (2) testing procedure, which utilizes the property of latent representation (order preservation). We conduct experiments based on real world multivariate time series data and show the significant improvement in the prediction of missing labels even with 40% of the classes are missing from training. Compared with the well known triplet loss optimization augmented with interpolation for missing information, in some cases, we nearly double the accuracy.

Codebook Design for Composite Beamforming in Next generation mmWave Systems

January 24, 2022/in Publications/by NEC Labs America

In pursuance of the unused spectrum in higher frequencies, millimeter wave (mmWave) bands have a pivotal role. However, the high path loss and poor scattering associated with mmWave communications highlight the necessity of employing effective beamforming techniques. In order to efficiently search for the beam to serve a user and to jointly serve multiple users it is often required to use a composite beam which consists of multiple disjoint lobes. A composite beam covers multiple desired angular coverage intervals (ACIs) and ideally has maximum and uniform gain (smoothness) within each desired ACI, negligible gain (leakage) outside the desired ACIs, and sharp edges. We propose an algorithm for designing such ideal composite codebook by providing an analytical closed form solution with low computational complexity. There is a fundamental trade off between the gain, leakage and smoothness of the beams. Our design allows to achieve different values in such trade off based on changing the design parameters. We highlight the shortcomings of the uniform linear arrays (ULAs) in building arbitrary composite beams. Consequently, we use a recently introduced twin ULA (TULA) antenna structure to effectively resolve these inefficiencies. Numerical results are used to validate the theoretical findings.

Multi user Beam Alignment in Presence of Multi path

January 24, 2022/in Publications/by NEC Labs America

To overcome the high path loss and the intense shadowing in millimeter wave (mmWave) communications, effective beamforming schemes are required which incorporate narrow beams with high beamforming gains. The mmWave channel consists of a few spatial clusters each associated with an angle of departure (AoD). The narrow beams must be aligned with the channel AoDs to increase the beamforming gain. This is achieved through a procedure called beam alignment (BA). Most of the BA schemes in the literature consider channels with a single dominant path while in practice the channel has a few resolvable paths with different AoDs, hence, such BA schemes may not work correctly in the presence of multi path or at the least do not exploit such multipath to achieve diversity or increase robustness. In this paper, we propose an efficient BA scheme in presence of multi path. The proposed BA scheme transmits probing packets using a set of scanning beams and receives feedback for all the scanning beams at the end of the probing phase from each user. We formulate the BA scheme as minimizing the expected value of the average transmission beamwidth under different policies. The policy is defined as a function from the set of received feedback to the set of transmission beams (TB). In order to maximize the number of possible feedback sequences, we prove that the set of scanning beams (SB) has a special form, namely, Tulip Design. Consequently, we rewrite the minimization problem with a set of linear constraints and a reduced number of variables which is solved by using an efficient greedy algorithm.

AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

January 4, 2022/in Publications/by NEC Labs America

StyleGANs have shown impressive results on data generation and manipulation in recent years, thanks to its disentangled style latent space. A lot of efforts have been made in inverting a pretrained generator, where an encoder is trained ad hoc after the generator is trained in a two-stage fashion. In this paper, we focus on style-based generators asking a scientific question: Does forcing such a generator to reconstruct real data lead to more disentangled latent space and make the inversion process from image to latent space easy? We describe a new methodology to train a style-based autoencoder where the encoder and generator are optimized end-to-end. We show that our proposed model consistently outperforms baselines in terms of image inversion and generation quality. Supplementary, code, and pretrained models are available on the project website.

SplitBrain: Hybrid Data and Model Parallel Deep Learning

January 3, 2022/in Publications/by NEC Labs America

The recent success of deep learning applications has coincided with those widely available powerful computational resources for training sophisticated machine learning models with huge datasets. Nonetheless, training large models such as convolutional neural networks using model parallelism (as opposed to data parallelism) is challenging because the complex nature of communication between model shards makes it difficult to partition the computation efficiently across multiple machines with an acceptable trade off. This paper presents SplitBrain, a high performance distributed deep learning framework supporting hybrid data and model parallelism. Specifically, SplitBrain provides layer specific partitioning that co locates compute intensive convolutional layers while sharding memory demanding layers. A novel scalable group communication is proposed to further improve the training throughput with reduced communication overhead. The results show that SplitBrain can achieve nearly linear speedup while saving up to 67% of memory consumption for data and model parallel VGG over CIFAR 10.

A Deep Generative Model for Molecule Optimization via One Fragment Modification

January 1, 2022/in Publications/by NEC Labs America

Molecule optimization is a critical step in drug development to improve the desired properties of drug candidates through chemical modification. We have developed a novel deep generative model, Modof, over molecular graphs for molecule optimization. Modof modifies a given molecule through the prediction of a single site of disconnection at the molecule and the removal and/or addition of fragments at that site. A pipeline of multiple, identical Modof models is implemented into Modof-pipe to modify an input molecule at multiple disconnection sites. Here we show that Modof-pipe is able to retain major molecular scaffolds, allow controls over intermediate optimization steps and better constrain molecule similarities. Modof-pipe outperforms the state-of-the-art methods on benchmark datasets. Without molecular similarity constraints, Modof-pipe achieves 81.2% improvement in the octanol–water partition coefficient, penalized by synthetic accessibility and ring size, and 51.2%, 25.6% and 9.2% improvement if the optimized molecules are at least 0.2, 0.4 and 0.6 similar to those before optimization, respectively. Modof-pipe is further enhanced into Modof-pipem to allow modification of one molecule to multiple optimized ones. Modof-pipem achieves additional performance improvement, at least 17.8% better than Modof-pipe.

Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph

AI-Driven Applications over Telecom Networks by Distributed Fiber Optic Sensing Technologies

Confidence and Dispersity Speak – Characterizing Prediction Matrix for Unsupervised Accuracy Estimation

A Dispersion Managed Phase Only Modulation 18 GHz Optoelectronic Oscillator

Ordinal Quadruplet: Retrieval of Missing Labels in Ordinal Time Series

Codebook Design for Composite Beamforming in Next generation mmWave Systems

Multi user Beam Alignment in Presence of Multi path

AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

SplitBrain: Hybrid Data and Model Parallel Deep Learning

A Deep Generative Model for Molecule Optimization via One Fragment Modification

Contact Us

About Us

Our Pages

Read Our Blog Posts