Jamuna S Murthy

I am an Assistant Professor at Ramaiah Institute of Technology, Bengaluru, India. My research focuses on multimodal video understanding, anomaly detection, and generative video models, with an emphasis on real-time intelligent systems and cross-modal learning.

I completed my Master’s degree at Ramaiah Institute of Technology, where my thesis titled Distributed Framework for Real-Time Twitter Sentiment Analysis and Visualization Framework(TwitSenti) received the Best Master’s Thesis Award in 2017. In 2024, I was honored with the STEM Young Researcher Award.

My work develops practical video intelligence systems for complex real-world scenarios such as FewShot-SPT, ObjectDetect, SarcasNet-99, DHF, TTM, and TwitSenti.

Email / CV / Scholar / Twitter / GitHub

Research

My research focuses on multimodal representation learning, video understanding, real-time anomaly detection, and generative models. I study how motion, behavior, semantics, and cross-modal signals from visual, textual, and audio data can be modeled for intelligent real-world systems. My broader goal is to build reliable AI systems for understanding and generating video content in safety-critical and dynamic environments. Some papers are highlighted.

	Towards Secure Video Surveillance: A Few-Shot Spatiotemporal Perception Transformer for Unseen Behavioral Anomalies Jamuna S Murthy, Dhanashekar Kandaswamy, Wen-Cheng Lai IEEE AVSS, 2025 Area:Multimodal Learning, Jetson Orin, FewShot Learning, Anomaly Detection and Transformers PDF / Code / Paper FewShot-SPT is a novel transformer-based framework for few-shot anomaly detection in surveillance systems. It introduces Event-Guided Keyframe Extraction (EGKE), Adaptive Modality Gating (AMG) for multimodal fusion, and adaptive prototypical learning for improved generalization to unseen anomalies. The model achieves 91.6% AUC on UCF-Crime and 84.2% accuracy on XD-Violence, outperforming state-of-the-art methods. Deployed on NVIDIA Jetson TX2, the system enables real-time park surveillance under edge constraints, effectively detecting incidents such as falls, intrusions, and weapon-related activities in real-world environments.
	Multimedia Video Analytics using Deep Hybrid Fusion Algorithm Jamuna S Murthy, Siddesh G M Multimedia Tools and Applications, 2025 Area: Multimodal Learning, Video Analytics and Transformers PDF / Code / Paper Introduces a Deep Hybrid Fusion (DHF) algorithm for multimodal video analytics by integrating text, image, and audio features. The framework uses BiLSTM-based modality representation and attention-based fusion to achieve superior performance (95.84% accuracy) across benchmark datasets for humor detection and multimodal understanding.
	A Smart Video Analytical Framework for Sarcasm Detection using Adaptive Fusion and SarcasNet-99 Jamuna S Murthy, Siddesh G M The Visual Computer, 2024 Area: Multimodal Learning, Sarcasm Detection and Transformers PDF / Code / Paper Proposes a multimodal sarcasm detection framework integrating text (Enhanced-BERT), image, and audio features. A novel adaptive fusion strategy combined with SarcasNet-99 achieves 97.3% accuracy on large-scale video datasets, outperforming state-of-the-art models in sarcasm detection.
	ObjectDetect: A Real-Time Object Detection Framework for Advanced Driver Assistant Systems Using YOLOv5 Jamuna S Murthy, Siddesh G M, Wen-Cheng Lai, et al. Wireless Communications and Mobile Computing, 2022 Area:Computer Vision, YOLO, LiDAR, Real-Time Systems and Autonomous Driving PDF / Code / Paper A real-time multimodal object detection framework for Advanced Driver Assistance Systems (ADAS), integrating vision and LiDAR-based perception to enable robust environmental understanding. The system leverages YOLOv5 for high-speed and high-accuracy detection, achieving ~95% accuracy while significantly improving inference speed compared to YOLOv3 and YOLOv4. Designed with principles from remote sensing and spatial perception, the framework supports reliable object detection and tracking under real-world conditions, making it suitable for autonomous driving and large-scale inspection scenarios.
	TwitSenti: A Real-Time Twitter Sentiment Analysis and Visualization Framework Jamuna S Murthy, Siddesh G M, K G Srinivasa Journal of Information & Knowledge Management, 2019 Area: NLP, Real-Time Systems and Social Media Analytics PDF / Code / Paper Early work on real-time sentiment modeling in streaming environments, integrating NLP pipelines with scalable distributed systems (Apache Storm, Redis). Focused on low-latency inference and interactive visualization for social media analytics.
	A Real-Time Twitter Trend Analysis and Visualization Framework Jamuna S Murthy, Siddesh G M, K G Srinivasa International Journal on Semantic Web and Information Systems, 2019 Area: NLP, Real-Time Systems, Social Media Analytics PDF / Code / Paper A real-time Twitter trend analysis framework that integrates streaming data processing with semantic analysis and interactive visualization. The system identifies emerging trends, tracks their momentum, and provides insights into public discourse dynamics on social media platforms.
Funding	DST Funded Project (2021-2022): FitQua Smart Water Bottle (AI-based health tracking) Built FitQua, an intelligent hydration system integrating thermal control (Peltier-based cooling & resistive heating) with sensor fusion (temperature, load, activity) to deliver AI-driven hydration monitoring
Industry Projects	Verity Labs Inc., California (AI Researcher) Duration: April 2025 – September 2025 Developed and optimized AI/ML modules for real-world applications using TensorFlow, PyTorch, and Scikit-learn. Worked on integrating machine learning models with cloud platforms (AWS, Azure, GCP) and collaborated with cross-functional teams to deploy scalable AI-driven systems. Contributed to research initiatives and production-level AI workflows. Samsung PRISM Program (Samsung R&D) Expression Correction in Photos (GenAI) Duration: Aug 2024 – Jul 2025 Developed a generative AI-based system for facial expression correction, including blinking and emotion refinement. Leveraged multimodal learning and generative models for realistic photo enhancement. Certificate NextGen AI Datastore for Video Retrieval (Vector DB) (Ongoing) Designing a scalable vector database system for multimodal video retrieval using vision-language embeddings. Focus on semantic search, indexing optimization, and real-time retrieval for large-scale video analytics.
Awards	Best Young Researcher Award – STEM Research Society (2024) Best Master Thesis Award (2017) Graduate Teaching Assistantship, MSRIT (2016-2017)
Patent	Surgeon’s Eye: Intelligent Video Analytics for Robotic Surgery (2025)
Academic Service	Reviewer: AAAI, ACM MM, ACM Multimedia Asia, CVPR, ECCV, ICLR, ICME, WACV, Springer Nature
Teaching	CS52: Artificial Intelligence and Machine Learning (Fall Semester) CS62: Cloud Computing and Big Data (Spring Semester) Department of Computer Science and Engineering, Ramaiah Institute of Technology

Research

Funding

Industry Projects

Awards

Patent

Academic Service

Teaching