ViTA: An Efficient Video-to-Text Algorithm using VLM for RAG-based Video Analysis System
Retrieval-augmented generation (RAG) is used in natural language processing (NLP) to provide query-relevant information in enterprise documents to large language models (LLMs). Such enterprise context enables the LLMs to generate more informed and accurate responses. When enterprise data is primarily