Overview: In problems with a large number of labels, most multi-label and multi-class techniques incur a significant computational burden at test time. This is because, for each test instance, they need to systematically evaluate every label to decide whether it is relevant for the instance or not.
Overview: This research project is at the forefront of integrating advanced Large Language Models (LLMs) into the process of deriving actionable insights from vast and complex document repositories. This initiative focuses on creating a system that allows users to interact with the LLM, guiding its analysis and refining the results based on the user’s expertise and evolving needs.
Overview: We have developed a system for real-time scene understanding and reasoning across various domains such as safety, manufacturing, retail, healthcare, and personal assistance. This system continuously monitors and analyzes video, acoustics, and time series data related to human activities, aiming to provide a comprehensive understanding of ongoing situations. The project utilizes advanced AI models to process large volumes of data, generating actionable insights that help users grasp complex scenarios.
Overview: The Trustworthy Generative AI Project is focused on developing advanced multimodal generative models that can create and reason with content across text, images, reports, and 3D videos. These models are designed for applications in advertisement, entertainment, law enforcement, and healthcare.