Real-Time, Distributed Stream Processing

INTEGRATED SYSTEMS

PROJECTS

PEOPLE

PUBLICATIONS

PATENTS

New application needs have always sparked human innovation. A decade ago, cloud computing enabled high-value enterprise services with a global reach and scale, but with several minutes or seconds of delay. Today, we stream on-demand and time-shifted HD or 4K video from the cloud with delays of hundreds of milliseconds. In the future, the need for increased efficiency and reduced latency between measurement and action will drive the development of real-time methods for feature extraction, computation and machine learning on streaming data.

Our focus is on enabling applications to make efficient use of limited computing resources in proximity to users and sensors (rather than resources in the cloud) for AI processing like feature extraction, inferencing and periodic re-training of tiny, dynamic, contextualized AI models. Such edge-cloud processing will avoid incurring 100+-millisecond delays to the cloud and ensure personal privacy of stream data used for training. But it won’t be easy to develop. Barriers include the high programming complexity of efficiently using tiers of limited computing resources (in smart devices, edge and the cloud), high processing delays due to limited edge resources and just-in-time adaptations to dynamic environments (changes in the content of data streams, number of users or ambient conditions).

Publication Tags: stream processing

Stream Processing Publications

SimCache: Similarity Caching for Efficient VLM-based Scene Understanding

June 11, 2025/ELVM Efficient Large Vision Models CVPR Workshop (2nd Edition)

Scene understanding systems analyze visual contexts by detecting objects, their attributes, and the interactions among them to provide a holistic interpretation. Understanding a scene requires analyzing multiple salient regions within a single video frame. Recently, Vision-Language Models (VLMs) have

LARA: Latency-Aware Resource Allocator for Stream Processing Applications

March 20, 2024/The 32nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP 2024)

One of the key metrics of interest for stream processing applications is latency, which indicates the total time it takes for the application to process and generate insights from streaming input data. For mission-critical video analytics applications like surveillance and monitoring, it is of paramount

Content-aware auto-scaling of stream processing applications on container orchestration platforms

March 1, 2023/31st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP 2023)

Modern applications are designed as an interacting set of microservices, and these applications are typically deployed on container orchestration platforms like Kubernetes. Several attractive features in Kubernetes make it a popular choice for deploying applications, and automatic scaling is one such

Projects | Real-Time, Distributed Stream Processing

Stream Processing Publications

SimCache: Similarity Caching for Efficient VLM-based Scene Understanding

LARA: Latency-Aware Resource Allocator for Stream Processing Applications

Content-aware auto-scaling of stream processing applications on container orchestration platforms

Contact Us

About Us

Our Pages

Read Our Blog Posts