Integrated Systems | Publications

INTEGRATED SYSTEMS

PROJECTS

PEOPLE

PATENTS

Publications

LLM-based Distributed Code Generation and Cost-Efficient Execution in the Cloud

April 6, 2025/The Sixteenth International Conference on Cloud Computing, GRIDs, and Virtualization (Cloud Computing 2025)

The advancement of Generative Artificial Intelligence (AI), particularly Large Language Models (LLMs), is reshaping the software industry by automating code generation. Many LLM-driven distributed processing systems rely on serial code generation constrained by predefined libraries, limiting flexibility

Real-Time Network-Aware Roadside LiDAR Data Compression

April 2, 2025/Vehicle Technology and Intelligent Transport Systems (VEHITS), 2025

LiDAR technology has emerged as a pivotal tool in Intelligent Transportation Systems (ITS), providing unique capabilities that have significantly transformed roadside traffic applications. However, this transformation comes with a distinct challenge: the immense volume of data generated by LiDAR sensors.

CAMTUNER: Adaptive Video Analytics Pipelines via Real-time Automated Camera Parameter Tuning

March 31, 2025/IEEE Transactions on Mobile Computing Journal

In Video Analytics Pipelines (VAP), Analytics Units (AUs) such as object detection and face recognition operating on remote servers rely heavily on surveillance cameras to capture high-quality video streams to achieve high accuracy. Modern network cameras offer an array of parameters that directly influence

Optimal Single-User Interactive Beam Alignment with Feedback Delay

March 25, 2025/arXiv

Communication in Millimeter wave (mmWave) band relies on narrow beams due to directionality, high path loss, and shadowing. One can use beam alignment (BA) techniques to find and adjust the direction of these narrow beams. In this paper, BA at the base station (BS) is considered, where the BS sends a

EdgeSync: Efficient Edge-Assisted Video Analytics via Network Contention-Aware Scheduling

March 17, 2025/4th IEEE Workshop on Pervasive and Resource-constrained Artificial Intelligence (PeRConAI 2025) - part of IEEE Percom 2025

With the advancement of 5G, edge-assisted video analytics has become increasingly popular, driven by the technologys ability to support low-latency, high-bandwidth applications. However, in scenarios where multiple clients competing for network resources, network contention poses a significant challenge.

G-Litter Marine Litter Dataset Augmentation with Diffusion Models and Large Language Models on GPU Acceleration

March 12, 2025/Applications, Libraries, and Tools for Computational Science and Machine Learning on Heterogeneous HPC Environments Workshop at PDP 2025

Marine litter detection is crucial for environmental monitoring, yet the imbalance in existing datasets limits model performance in identifying various types of waste accurately. This paper presents an efficient data augmentation pipeline that combines generative diffusion models (e.g., Stable Diffusion)

RAG-check: Evaluating Multimodal Retrieval Augmented Generation Performance

January 7, 2025/arXiv

Retrieval-augmented generation (RAG) improves large language models (LLMs) by using external knowledge to guide response generation, reducing hallucinations. However, RAG, particularly multi-modal RAG, can introduce new hallucination sources: (i) the retrieval process may select irrelevant pieces (e.g.,

Re-ranking the Context for Multimodal Retrieval Augmented Generation

January 6, 2025/arXiv

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating external knowledge to generate a response within a context with improved accuracy and reduced hallucinations. However, multi-modal RAG systems face unique challenges: (i) the retrieval process may select irrelevant

DiCE-M: Distributed Code Generation and Execution for Marine Applications – An Edge-Cloud Approach

December 7, 2024/International Workshop on Edge Intelligence in conjunction with ACM SEC 2024

Edge computing has emerged as a transformative technology that reduces application latency, improves cost efficiency, enhances security, and enables large-scale deployment of applications across various domains. In environmental monitoring, systems such as MegaSense[49], use low-cost sensors to gather

DiCE: Distributed Code generation and Execution

November 5, 2024/The 22nd IEEE International Conference on Pervasive Intelligence and Computing (PICom 2024)

Generative artificial intelligence (GenAI), specifically, Large Language Models (LLMs), have shown tremendous potential in automating several tasks and improving human productivity. Recent works have shown them to be quite useful in writing and summarizing text (articles, blogs, poems, stories, songs,

Transformer-Aided Semantic Communications

October 27, 2024/Asilomar Conference on Signals, Systems, and Computers

The transformer structure employed in large language models (LLMs), as a specialized category of deep neural networks (DNNs) featuring attention mechanisms, stands out for their ability to identify and highlight the most relevant aspects of input data. Such a capability is particularly beneficial in

iRAG: Advancing RAG for Videos with an Incremental Approach

October 21, 2024/The 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024)

Retrieval-augmented generation (RAG) systems combine the strengths of language generation and information retrieval to power many real-world applications like chatbots. Use of RAG for understanding of videos is appealing but there are two critical limitations. One-time, upfront conversion of all content

TrafficLens: Multi-Camera Traffic Video Analysis Using LLMs

September 24, 2024/27th IEEE International Conference on Intelligent Transportation Systems (ITSC 2024)

Traffic cameras are essential in urban areas, playing a crucial role in intelligent transportation systems. Multiple cameras at intersections enhance law enforcement capabilities, traffic management, and pedestrian safety. However, efficiently managing and analyzing multi-camera feeds poses challenges

Knowledge-enhanced Prompt Learning for Open-domain Commonsense Reasoning

July 3, 2024/NEC Technical Journal, Special Issue on Revolutionizing Business Practices with Generative AI

Neural language models for commonsense reasoning often formulate the problem as a QA task and make predictions based on learned representations of language after fine-tuning. However, without providing any fine-tuning data and pre-defined answer candidates, can neural language models still answer commonsense

Optimizing LLM API usage costs with novel query-aware reduction of relevant enterprise data

July 3, 2024/NEC Technical Journal, Special Issue on Revolutionizing Business Practices with Generative AI

Costs of LLM API usage rise rapidly when proprietary enterprise data is used as context for user queries to generate more accurate responses from LLMs. To reduce costs, we propose LeanContext, which generates query-aware, compact and AI model-friendly summaries of relevant enterprise data context. This

ViTA: An Efficient Video-to-Text Algorithm using VLM for RAG-based Video Analysis System

June 17, 2024/Multimodal Algorithmic Reasoning (MAR) in conjunction with CVPR 2024

Retrieval-augmented generation (RAG) is used in natural language processing (NLP) to provide query-relevant information in enterprise documents to large language models (LLMs). Such enterprise context enables the LLMs to generate more informed and accurate responses. When enterprise data is primarily

A Perspective on Deep Vision Performance with Standard Image and Video Codecs

June 17, 2024/AIS: Vision, Graphics and AI for Streaming Workshop at CVPR 2024

Resource-constrained hardware such as edge devices or cell phones often rely on cloud servers to provide the required computational resources for inference in deep vision models. However transferring image and video data from an edge or mobile device to a cloud server requires coding to deal with network

Deep Video Codec Control for Vision Models

June 17, 2024/AIS: Vision, Graphics and AI for Streaming Workshop at CVPR 2024

Standardized lossy video coding is at the core of almost all real-world video processing pipelines. Rate control is used to enable standard codecs to adapt to different network bandwidth conditions or storage constraints. However standard video codecs (e.g. H.264) and their rate control modules aim to

Deep Learning-Based Real-Time Quality Control of Standard Video Compression for Live Streaming

June 9, 2024/IEEE International Conference on Communication (ICC 2024)

Ensuring high-quality video content for wireless users has become increasingly vital. Nevertheless, maintaining a consistent level of video quality faces challenges due to the fluctuating encoded bitrate, primarily caused by dynamic video content, especially in live streaming scenarios. Video compression

ECO-LLM: LLM-based Edge Cloud Optimization

June 3, 2024/AI4Sys '24 at HPDC 2024

AI/ML techniques have been used to solve systems problems, but their applicability to customize solutions on-the-fly has been limited. Traditionally, any customization required manually changing the AI/ML model or modifying the code, configuration parameters, application settings, etc. This incurs too

StreamingRAG: Real-time Contextual Retrieval and Generation Framework

June 3, 2024/AI4Sys '24 At HPDC 2024

Extracting real-time insights from multi-modal data streams from various domains such as healthcare, intelligent transportation, and satellite remote sensing remains a challenge. High computational demands and limited knowledge scope restrict the applicability of Multi-Modal Large Language Models (MM-LLMs)

LeanContext: Cost-efficient Domain-specific Question Answering Using LLMs

June 1, 2024/Natural Language Processing

Question-answering (QA) is a significant application of Large Language Models (LLMs), shaping chatbot capabilities across healthcare, education, and customer service. However, widespread LLM integration presents a challenge for small businesses due to the high expenses of LLM API usage. Costs rise rapidly

Deep Learning-Based Real-Time Rate Control for Live Streaming on Wireless Networks

May 8, 2024/IEEE International Conference on Machine Learning for Communication and Networking (IEEE ICMLCN 2024)

Providing wireless users with high-quality video content has become increasingly important. However, ensuring consistent video quality poses challenges due to variable encodedbitrate caused by dynamic video content and fluctuating channel bitrate caused by wireless fading effects. Suboptimal selection

CLAP: Cost and Latency-Aware Placement of Microservices on the Computing Continuum

May 6, 2024/2nd International Workshop on Urgent Analytics for the Computing Continuum (QUICK '24 co-located with CCGrid 2024)

For microservices-based real-time stream processing applications, computing at the edge delivers fast responses for low workloads, but as workload increases, the response time starts to slow down due to limited compute capacity. Abundant compute capacity in the cloud delivers fast responses even for

iRAG: An Incremental Retrieval Augmented Generation System for Videos

April 24, 2024/https://arxiv.org

Retrieval augmented generation (RAG) systems combine the strengths of language generation and information retrieval to power many real-world applications like chatbots. Use of RAG for combined understanding of multimodal data such as text, images and videos is appealing but two critical limitations exist:

LARA: Latency-Aware Resource Allocator for Stream Processing Applications

March 20, 2024/The 32nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP 2024)

One of the key metrics of interest for stream processing applications is latency, which indicates the total time it takes for the application to process and generate insights from streaming input data. For mission-critical video analytics applications like surveillance and monitoring, it is of paramount

Improving Real-time Data Streams Performance on Autonomous Surface Vehicles using DataX

March 20, 2024/The 32nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP 2024)

In the evolving Artificial Intelligence (AI) era, the need for real-time algorithm processing in marine edge environments has become a crucial challenge. Data acquisition, analysis, and processing in complex marine situations require sophisticated and highly efficient platforms. This study optimizes

Enabling Cooperative Hybrid Beamforming in TDD-based Distributed MIMO Systems

January 6, 2024/IEEE Consumer Communications & Networking Conference (IEEE CCNC 2024)

Distributed massive MIMO networks are envisioned to realize cooperative multi-point transmission in next-generation wireless systems. For efficient cooperative hybrid beamforming, the cluster of access points (APs) needs to obtain precise estimates of the uplink channel to perform reliable downlink precoding.

Differentiable JPEG: The Devil is in The Details

January 3, 2024/IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

JPEG remains one of the most widespread lossy image coding methods. However, the non-differentiable nature of JPEG restricts the application in deep learning pipelines. Several differentiable approximations of JPEG have recently been proposed to address this issue. This paper conducts a comprehensive

Scale Up while Scaling Out Microservices in Video Analytics Pipelines

December 18, 2023/Performance Optimization and Auto-Tuning of Software on Multicore/Manycore Systems (POAT 2023), Singapore

Modern video analytics applications comprise multiple microservices chained together as pipelines and executed on container orchestration platforms like Kubernetes. Kubernetes automatically handles the scaling of these microservices for efficient application execution. There are two popular choices for

Semantic Multi-Resolution Communications

December 4, 2023/IEEE Globecom 2023 - 3rd Workshop on Semantic Communication for 6G

Deep learning based joint source-channel coding (JSCC) has demonstrated significant advancements in data reconstruction compared to separate source-channel coding (SSCC). This superiority arises from the suboptimality of SSCC when dealing with finite block-length data. Moreover, SSCC falls short in reconstructing

Blind Cyclic Prefix-based CFO Estimation in MIMO-OFDM Systems

December 4, 2023/IEEE Global Communications Conference (Globecom 2023)

Low-complexity estimation and correction of carrier frequency offset (CFO) are essential in orthogonal frequency division multiplexing (OFDM). In this paper, we propose a low overhead blind CFO estimation technique based on cyclic prefix (CP), in multi-input multi-output (MIMO)-OFDM systems. We propose

Citizen Science for the Sea with Information Technologies: An Open Platform for Gathering Marine Data and Marine Litter Detection from Leisure Boat Instruments

October 9, 2023/IEEE eScience 2023

Data crowdsourcing is an increasingly pervasive and lifestyle-changing technology due to the flywheel effect that results from the interaction between the Internet of Things and Cloud Computing. This paper presents the Citizen Science for the Sea with Information Technologies (C4Sea-IT) framework. It

Deep Video Codec Control

August 29, 2023/https://arxiv.org

Deep Video Codec Control Lossy video compression is commonly used when transmitting and storing video data. Unified video codecs (e.g., H.264 or H.265) remain the emph(Unknown sysvar: (de facto)) standard, despite the availability of advanced (neural) compression approaches. Transmitting videos in the

Retrospective : A Dynamically Configurable Coprocessor For Convolutional Neural Networks

July 1, 2023/ISCA@50 Retrospective: 1996-2020

In 2008, parallel computing posed significant challenges due to the complexities of parallel programming and the bottlenecks associated with efficient parallel execution. Inspired by the remarkable scalability achieved by networking and storage systems in handling extensive packet traffic and persistent

FactionFormer: Context-Driven Collaborative Vision Transformer Models for Edge Intelligence

June 26, 2023/8th IEEE International Workshop on Smart Service Systems SmartSys 2023 (co-located with SMARTCOMP 2023)

Edge Intelligence has received attention in the recent times for its potential towards improving responsiveness, reducing the cost of data transmission, enhancing security and privacy, and enabling autonomous decisions by edge devices. However, edge devices lack the power and compute resources necessary

Elixir: A System To Enhance Data Quality For Multiple Analytics On A Video Stream

June 26, 2023/International Conference on Smart Computing (IEEE SMARTCOMP 2023)

IoT sensors, especially video cameras, are ubiquitously deployed around the world to perform a variety of computer vision tasks in several verticals including retail, health- care, safety and security, transportation, manufacturing, etc. To amortize their high deployment effort and cost, it is desirable

AnB: Application-In-A-Box To Rapidly Deploy and Self-Optimize 5G Apps

June 26, 2023/International Conference on Smart Computing (SMARTCOMP 2023)

We present Application in a Box (AnB) product concept aimed at simplifying the deployment and operation of remote 5G applications. AnB comes pre-configured with all necessary hardware and software components, including sensors like cameras, hardware and software components for a local 5G wireless network,

StreetAware: A High-Resolution Synchronized Multimodal Urban Scene Dataset

April 3, 2023/Sensors

Access to high-quality data is an important barrier in the digital analysis of urban settings, including applications within computer vision and urban design. Diverse forms of data collected from sensors in areas of high activity in the urban environment, particularly at street intersections, are valuable

RIS-aided mmWave Beamforming for Two-way Communications of Multiple Pairs

March 31, 2023/ITU Journal on Future and Evolving Technologies (ITU J-FET), Special issue on Intelligent Suraces and their Applications

Millimeter‑wave (mmWave) communications is a key enabler towards realizing enhanced Mobile Broadband (eMBB) as a key promise of 5G and beyond, due to the abundance of bandwidth available at mmWave bands. An mmWave coverage map consists of blind spots due to shadowing and fading especially in dense

Channel Reciprocity Calibration for Hybrid Beamforming in Distributed MIMO Systems

March 26, 2023/IEEE Wireless Communications and Networking Conference (WCNC 2023), Glasgow, Scotland, UK

Time Division Duplex (TDD)-based distributed massive MIMO systems are envisioned as candidate solution for the physical layer of 6G multi-antenna systems supporting cooperative hybrid beamforming that heavily relies on the obtained uplink channel estimates for efficient coherent downlink precoding. However,

Content-aware auto-scaling of stream processing applications on container orchestration platforms

March 1, 2023/31st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP 2023)

Modern applications are designed as an interacting set of microservices, and these applications are typically deployed on container orchestration platforms like Kubernetes. Several attractive features in Kubernetes make it a popular choice for deploying applications, and automatic scaling is one such

DyCo: Dynamic, Contextualized AI Models

December 30, 2022/ACM Transactions on Embedded Computing Systems

Devices with limited computing resources use smaller AI models to achieve low-latency inferencing. However, model accuracy is typically much lower than the accuracy of a bigger model that is trained and deployed in places where the computing resources are relatively abundant. We describe DyCo, a novel

DataX Allocator: Dynamic resource management for stream analytics at the Edge

November 29, 2022/The 9th International Conference on Internet of Things: Systems, Management and Security (IOTSMS 2022)

Serverless edge computing aims to deploy and manage applications so that developers are unaware of challenges associated with dynamic management, sharing, and maintenance of the edge infrastructure. However, this is a non-trivial task because the resource usage by various edge applications varies based

APT: Adaptive Perceptual quality based camera Tuning using reinforcement learning

November 29, 2022/The 9th International Conference on Internet of Things: Systems, Management and Security (IOTSMS 2022)

Cameras are increasingly being deployed in cities, enterprises and roads world-wide to enable many applications in public safety, intelligent transportation, retail, healthcare and manufacturing. Often, after initial deployment of the cameras, the environmental conditions and the scenes around these

Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning

November 7, 2022/The 20th ACM Conference on Embedded Networked Sensor Systems (SenSys 2022)

In Video Analytics Pipelines (VAP), Analytics Units (AUs) such as object detection and face recognition running on remote servers critically rely on surveillance cameras to capture high-quality video streams in order to achieve high accuracy. Modern IP cameras come with a large number of camera parameters

The Trade-off between Scanning Beam Penetration and Transmission Beam Gain in mmWave Beam Alignment

October 30, 2022/56th Annual Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA

Beam search algorithms have been proposed to align the beams from an access point to a user equipment. The process relies on sending beams from a set of scanning beams (SB) and tailoring a transmission beam (TB) using the received feedback. In this paper, we discuss a fundamental trade-off between the

Why is the video analytics accuracy fluctuating, and what can we do about it?

October 23, 2022/ECCV 2022 Workshop on Adversarial Robustness in the Real World

It is a common practice to think of a video as a sequence of images (frames), and re-use deep neural network models that are trained only on images for similar analytics tasks on videos. In this paper, we show that this “leap of faith” that deep learning models that work well on images will also

Efficient Compression Method for Roadside LiDAR Data

October 17, 2022/31st ACM International Conference on Information and Knowledge Management (CiKM 2022)

Roadside LiDAR (Light Detection and Ranging) sensors are recently being explored for intelligent transportation systems aiming at safer and faster traffic management and vehicular operations. A key challenge in such systems is to efficiently transfer massive point-cloud data from the roadside LiDAR devices

5GLoR: 5G LAN Orchestration for Enterprise IoT Applications

October 12, 2022/IEEE Future Networks World Forum 2022

5G-LAN is an enterprise local area network (LAN) that leverages 5G technology for wireless connectivity instead of WiFi. 5G technology is unique: it uses network slicing to distinguish customers in the same traffic class using new QoS technologies in the RF domain. This unique ability is not supported

DataXc: Flexible and efficient communication in microservices-based stream analytics pipelines

September 12, 2022/The 20th IEEE International Conference on Pervasive Intelligence and Computing (PICom 2022)

A big challenge in changing a monolithic application into a performant microservices-based application is the design of efficient mechanisms for microservices to communicate with each other. Prior proposals range from custom point-to-point communication among microservices using protocols like gRPC to

RoVaR: Robust Multi-agent Tracking through Dual-layer Diversity in Visual and RF Sensor Fusion

September 1, 2022/UbiComp 2023 (IMWUT Journal)

The plethora of sensors in our commodity devices provides a rich substrate for sensor-fused tracking. Yet, today’s solutions are unable to deliver robust and high tracking accuracies across multiple agents in practical, everyday environments – a feature central to the future of immersive and collaborative

Cosine Similarity based Few-Shot Video Classifier with Attention-based Aggregation

August 22, 2022/26th International Conference on Pattern Recognition (ICPR 2022)

Meta learning algorithms for few-shot video recognition use complex, episodic training but they often fail to learn effective feature representations. In contrast, we propose a new and simpler few-shot video recognition method that does not use meta-learning, but its performance compares well with the

Application-specific, Dynamic Reservation of 5G Compute and Network Resources by using Reinforcement Learning

August 22, 2022/ACM SIGCOMM 2022 Workshop on Network-Application Integration (NAI 2022)

5G services and applications explicitly reserve compute and network resources in today’s complex and dynamic infrastructure of multi-tiered computing and cellular networking to ensure application-specific service quality metrics, and the infrastructure providers charge the 5G services for the resources

Mosaic: Leveraging Diverse Reflector Geometries for Omnidirectional Around-Corner Automotive Radar

July 1, 2022/The 20th ACM International Conference on Mobile Systems, Applications, and Services (MobiSys 2022)

A large number of traffic collisions occur as a result of obstructed sight lines, such that even an advanced driver assistance system would be unable to prevent the crash. Recent work has proposed the use of around-the-corner radar systems to detect vehicles, pedestrians, and other road users in these

Chimera: Context-Aware Splittable Deep Multitasking Models for Edge Intelligence

June 20, 2022/SMARTCOMP 2022

Design of multitasking deep learning models has mostly focused on improving the accuracy of the constituent tasks, but the challenges of efficiently deploying such models in a device-edge collaborative setup (that is common in 5G deployments) has not been investigated. Towards this end, in this paper,

Codebook Design for Hybrid Beamforming in 5G Systems

May 16, 2022/IEEE International Conference on Communications (ICC 2022)

Massive MIMO and hybrid beamforming are among the key physical layer technologies for the next generation wireless systems. In the last stage of the hybrid beamforming, the goal is to generate sharp beam with maximal and preferably uniform gain. We highlight the shortcomings of uniform linear arrays

ROMA: Resource Orchestration for Microservices-based 5G Applications

April 25, 2022/IEEE/IFIP Network Operations and Management Symposium (NOMS 2022)

With the growth of 5G, Internet of Things (IoT), edge computing and cloud computing technologies, the infrastructure (compute and network) available to emerging applications (AR/VR, autonomous driving, industry 4.0, etc.) has become quite complex. There are multiple tiers of computing (IoT devices, near

Opportunistic Temporal Fair Mode Selection and User Scheduling in Full-Duplex Systems

April 16, 2022/JSAC: IEEE Communications Society: Journal of Selected Areas in Communications - Special Issue on Next Generation Multiple Access

In-band full-duplex (FD) communication has emerged as one of the promising techniques to improve data rates in next generation wireless systems. Typical FD scenarios considered in the literature assume FD base stations (BSs) and half-duplex (HD) users activated either in uplink (UL) or downlink (DL),

DataXe: A System for Application Self-optimization in Serverless Edge Computing Environments

March 21, 2022/First Workshop on Serverless Computing for Pervasive Cloud-Edge-Device Systems and Services (STARLESS ‘22)

A key barrier to building performant, remotely managed and self-optimizing multi-sensor, distributed stream processing edge applications is high programming complexity. We recently proposed DataX [1], a novel platform that improves programmer productivity by enabling easy exchange, transformations, and

Multi-user Beam Alignment in Presence of Multi-path

March 9, 2022/56th Annual Conference on Information Sciences and Systems (CISS 2022)

To overcome the high pathloss and the intense shadowing in millimeterwave (mmWave) communications, effective beamforming schemes are required which incorporate narrow beams with high beamforming gains. The mm Wave channel consists of a few spatial clusters each associated with an angle of departure (AoD).

Codebook Design for Composite Beamforming in Next generation mmWave Systems

January 24, 2022/arXiv

In pursuance of the unused spectrum in higher frequencies, millimeter wave (mmWave) bands have a pivotal role. However, the high path loss and poor scattering associated with mmWave communications highlight the necessity of employing effective beamforming techniques. In order to efficiently search for

AQuA: Analytical Quality Assessment for Optimizing Video Analytics Systems

December 15, 2021/The Sixth ACM/IEEE Symposium on Edge Computing (SEC 2021)

Millions of cameras at edge are being deployed to power a variety of different deep learning applications. However, the frames captured by these cameras are not always pristine – they can be distorted due to lighting issues, sensor noise, compression etc. Such distortions not only deteriorate visual

Edge-based fever screening system over private 5G

December 14, 2021/The Sixth ACM/IEEE Symposium on Edge Computing (SEC 2021)

Edge computing and 5G have made it possible to perform analytics closer to the source of data and achieve super-low latency response times, which isn’t possible with centralized cloud deployment. In this paper, we present a novel fever screening system, which uses edge machine learning techniques and

Shaping mmWave Wireless Channel via Multi-Beam Design using Reconfigurable Intelligent Surfaces

December 7, 2021/IEEE Globecom - Workshop on Reconfigurable Intelligent Surfaces for Future Wireless Communications

Millimeter-wave (mmWave) communications is considered as a key enabler towards the realization of next-generation wireless networks, due to the abundance of available spectrum at mmWave frequencies. However, mmWave suffers from high free-space path-loss and poor scattering resulting in mostly line-of-sight

Magic-Pipe: Self-optimizing video analytics pipelines

December 6, 2021/Middleware 2021

Microservices-based video analytics pipelines routinely use multiple deep convolutional neural networks. We observe that the best allocation of resources to deep learning engines (or microservices) in a pipeline, and the best configuration of parameters for each engine vary over time, often at a timescale

SmartSlice: Dynamic, Self-optimization of Application’s QoS requests to 5G networks

December 6, 2021/The 5th International Symposium on 5G Emerging Technologies (5GET 2021)

Applications can tailor a network slice by specifying a variety of QoS attributes related to application-specific performance, function or operation. However, some QoS attributes like guaranteed bandwidth required by the application do vary over time. For example, network bandwidth needs of video streams

CamTuner: Reinforcement Learning based System for Camera Parameter Tuning to enhance Analytics

October 26, 2021/arXiv

Video analytics systems critically rely on video cameras, which capture high quality video frames, to achieve high analytics accuracy. Although modern video cameras often expose tens of configurable parameter settings that can be set by end users, deployment of surveillance cameras today often uses a

UAC: An Uncertainty-Aware Face Clustering Algorithm

October 11, 2021/IEEE/CVF International Conference on Computer Vision (ICCV) RLQ Workshop

We investigate ways to leverage uncertainty in face images to improve the quality of the face clusters. We observe that popular clustering algorithms do not produce better quality clusters when clustering probabilistic face representations that implicitly model uncertainty – these algorithms predict

AppSlice: A system for application-centric design of 5G and edge computing applications

October 6, 2021/12th International Conference on Network of the Future (NoF 2021)

Applications that use edge computing and 5G to improve response times consume both compute and network resources. However, 5G networks manage only network resources without considering the application’s compute requirements, and container orchestration frameworks manage only compute resources without

DataX: A system for Data eXchange and transformation of streams

September 26, 2021/The 14th International Symposium on Intelligent Distributed Computing (IDC 2021)

The exponential growth in smart sensors and rapid progress in 5G networks is creating a world awash with data streams. However, a key barrier to building performant multi-sensor, distributed stream processing applications is high programming complexity. We propose DataX, a novel platform that improves

F3S: Free Flow Fever Screening

August 23, 2021/7th IEEE International Conference on Smart Computing (SMARTCOMP 2021)

Identification of people with elevated body temperature can reduce or dramatically slow down the spread of infectious diseases like COVID-19. We present a novel fever-screening system, F 3 S, that uses edge machine learning techniques to accurately measure core body temperatures of multiple individuals

SkyHAUL: A Self-Organizing Gigabit Network In The Sky

July 26, 2021/ACM Mobihoc 2021

We design and build SkyHaul, the first large-scale, self-organizing network of Unmanned Aerial Vehicles (UAVs) that are connected using a mm Wave wireless mesh backhaul. While the use of a mmWave backhaul paves the way for a new class of bandwidth-intensive, latency-sensitive cooperative applications

On Single-User Interactive Beam Alignment in Millimeter Wave Systems: Impact of Feedback Delay

July 12, 2021/The IEEE International Symposium on Information Theory (IEEE ISIT 2021)

Narrow beams are key to wireless communications in millimeter wave frequency bands. Beam alignment (BA) allows the base station (BS) to adjust the direction and width of the beam used for communication. During BA, the BS transmits a number of scanning beams covering different angular regions. The goal

SpaceBeam: LiDAR-Driven One-Shot mmWave Beam Management

June 24, 2021/19th ACM International Conference on Mobile Systems, Applications, and Services (MobiSys 2021)

mmWave 5G networks promise to enable a new generation of networked applications requiring a combination of high throughput and ultra-low latency. However, in practice, mmWave performance scales poorly for large numbers of users due to the significant overhead required to manage the highly-directional

ECO: Edge-Cloud Optimization of 5G applications

May 10, 2021/The 21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid 2021), Melbourne, Victoria, Australia

Centralized cloud computing with 100+ milliseconds network latencies cannot meet the tens of milliseconds to sub-millisecond response times required for emerging 5G applications like autonomous driving, smart manufacturing, tactile internet, and augmented or virtual reality. We describe a new, dynamic

Multi-user Beam Alignment for Millimeter Wave Systems in Multi-path Environments

November 1, 2020/54th Annual Asilomar Conference on Signals, Systems, and Computers

Directional transmission patterns (a.k.a. narrow beams) are the key to wireless communications in millimeter wave (mmWave) frequency bands which suffer from high path loss, severe shadowing, and intense blockage. In addition, the propagation channel in mmWave frequencies incorporates only a few number

Redefining Passive in Backscattering with Commodity Devices

September 21, 2020/The 26th Annual International Conference on Mobile Computing and Networking (MobiCom 2020)

The recent innovation of frequency-shifted (FS) backscatter allows for backscattering with commodity devices, which are inherently half-duplex. However, their reliance on oscillators for generating the frequency-shifting signal on the tag, forces them to incur the transient phase of the oscillator before

RFGo: A Seamless Self-checkout System for Apparel Stores Using RFID

September 21, 2020/The 26th Annual International Conference on Mobile Computing and Networking (MobiCom 2020)

Retailers are aiming to enhance customer experience by automating the checkout process. The key impediment here is the effort to manually align the product barcode with the scanner, requiring sequential handling of items without blocking the line-of-sight of the laser beam. While recent systems such

DeepTrack: Grouping RFID Tags Based on Spatio-temporal Proximity in Retail Spaces

July 6, 2020/IEEE International Conference on Computer Communications (IEEE Infocom 2020)

RFID applications for taking inventory and processing transactions in point-of-sale (POS) systems improve operational efficiency but are not designed to provide insights about customers’ interactions with products. We bridge this gap by solving the proximity grouping problem to identify groups of RFID

On Optimal Multi-user Beam Alignment in Millimeter Wave Wireless Systems

June 21, 2020/2020 IEEE International Symposium on Information Theory (ISIT 2020)

Directional transmission patterns (a.k.a. narrow beams) are the key to wireless communications in millimeter wave (mmWave) frequency bands which suffer from high path loss and severe shadowing. In addition, the propagation channel in mmWave frequencies incorporates only a few number of spatial clusters

Beam Training Optimization in Millimeter-wave Systems under Beamwidth, Modulation and Coding Constraints

September 9, 2019/IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC 2019)

Millimeter-wave (mmWave) bands have the potential to enable significantly high data rates in wireless systems. In order to overcome intense path loss and severe shadowing in these bands, it is essential to employ directional beams for data transmission. Furthermore, it is known that the mmWave channel

Opportunistic Temporal Fair Mode Selection and User Scheduling for Full-duplex Systems

September 9, 2019/IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC 2019)

In-band full-duplex (FD) communications – enabled by recent advances in antenna and RF circuit design – has emerged as one of the promising techniques to improve data rates in wireless systems. One of the major roadblocks in enabling high data rates in FD systems is the inter-user interference (IUI)

Robust Beam Tracking and Data Communication in Millimeter Wave Mobile Networks

June 3, 2019/The International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt 2019)

Millimeter-wave (mmWave) bands have shown the potential to enable high data rates for next generation mobile networks. In order to cope with high path loss and severe shadowing in mmWave frequencies, it is essential to employ massive antenna arrays and generate narrow transmission patterns (beams). When

TrackIO: Tracking First Responders Inside-Out

February 26, 2019/16th USENIX Symposium on Networked Systems Design and Implementation (NSDI 2019)

First responders, a critical lifeline of any society, often find themselves in precarious situations. The ability to track them in real-time in unknown indoor environments would significantly contribute to the success of their mission as well as their safety. In this work, we present the design, implementation

SkyRAN: A Self-Organizing LTE RAN in the Sky

December 4, 2018/The 14th International Conference on emerging Networking EXperiments and Technologies (ACM CoNEXT 2018)

We envision a flexible, dynamic airborne LTE infrastructure built upon Unmanned Autonomous Vehicles (UAVs) that will provide on-demand, on-time, network access, anywhere. In this paper, we design, implement and evaluate SkyRAN, a self-organizing UAV-based LTE RAN (Radio Access Network) that is a key

SkyCore: Moving Core to the Edge for Untethered and Reliable UAV-based LTE Networks

October 29, 2018/**BEST PAPER AWARD** The 24th Annual International confrence on Mobile Computing and Networking (MobiCom 2018)

The advances in unmanned aerial vehicle (UAV) technology have empowered mobile operators to deploy LTE base stations (BSs) on UAVs, and provide on-demand, adaptive connectivity to hotspot venues as well as emergency scenarios. However, today’s evolved packet core (EPC) that orchestrates the LTE RAN faces

ELI: Empowering LTE with Interference Awareness in Unlicensed Spectrum

September 24, 2018/The 26th IEEE International Conference on Network Protocols (ICNP 2018)

The advent of LTE into the unlicensed spectrum has necessitated the understanding of its operational efficiency when sharing spectrum with different radio access technologies. Our study reveals that LTE, owing to its inherent transmission characteristics, suffers significant performance degradation in

SkyLiTE: End-to-End Design of Low-altitutde UAV Networks for Providing LTE Connectivity

January 19, 2018/arXiv

Un-manned aerial vehicle (UAVs) have the potential to change the landscape of wide-area wireless connectivity by bringing them to areas where connectivity was sparing or non-existent (e.g. rural areas) or has been compromised due to disasters. While Google’s Project Loon and Facebook’s Project Aquila