|
Jamuna S Murthy
I am an Assistant Professor at
Ramaiah Institute of Technology, Bengaluru, India.
My research focuses on multimodal video understanding, anomaly detection, and generative video models,
with an emphasis on real-time intelligent systems and cross-modal learning.
I completed my Master’s degree at
Ramaiah Institute of Technology,
where my thesis titled Distributed Framework for Real-Time Twitter Sentiment Analysis and Visualization Framework(TwitSenti)
received the Best Master’s Thesis Award in 2017. In 2024, I was honored with the
STEM Young Researcher Award.
My work develops practical video intelligence systems for complex real-world scenarios such as
FewShot-SPT,
ObjectDetect,
SarcasNet-99,
DHF,
TTM,
and
TwitSenti.
Email /
CV /
Scholar /
Twitter /
GitHub
|
|
Research
My research focuses on multimodal representation learning, video understanding, real-time anomaly detection, and generative models.
I study how motion, behavior, semantics, and cross-modal signals from visual, textual, and audio data can be modeled for intelligent real-world systems.
My broader goal is to build reliable AI systems for understanding and generating video content in safety-critical and dynamic environments.
Some papers are highlighted.
|
|
Towards Secure Video Surveillance: A Few-Shot Spatiotemporal Perception Transformer for Unseen Behavioral Anomalies
Jamuna S Murthy,
Dhanashekar Kandaswamy,
Wen-Cheng Lai
IEEE AVSS, 2025
Area:Multimodal Learning, Jetson Orin, FewShot Learning, Anomaly Detection and Transformers
PDF /
Code /
Paper
FewShot-SPT is a novel transformer-based framework for few-shot anomaly detection in surveillance systems. It introduces Event-Guided Keyframe Extraction (EGKE), Adaptive Modality Gating (AMG) for multimodal fusion, and adaptive prototypical learning for improved generalization to unseen anomalies. The model achieves 91.6% AUC on UCF-Crime and 84.2% accuracy on XD-Violence, outperforming state-of-the-art methods.
Deployed on NVIDIA Jetson TX2, the system enables real-time park surveillance under edge constraints, effectively detecting incidents such as falls, intrusions, and weapon-related activities in real-world environments.
|
|
Multimedia Video Analytics using Deep Hybrid Fusion Algorithm
Jamuna S Murthy,
Siddesh G M
Multimedia Tools and Applications, 2025
Area: Multimodal Learning, Video Analytics and Transformers
PDF /
Code /
Paper
Introduces a Deep Hybrid Fusion (DHF) algorithm for multimodal video analytics by integrating
text, image, and audio features. The framework uses BiLSTM-based modality representation and
attention-based fusion to achieve superior performance (95.84% accuracy) across benchmark
datasets for humor detection and multimodal understanding.
|
|
A Smart Video Analytical Framework for Sarcasm Detection using Adaptive Fusion and SarcasNet-99
Jamuna S Murthy,
Siddesh G M
The Visual Computer, 2024
Area: Multimodal Learning, Sarcasm Detection and Transformers
PDF /
Code /
Paper
Proposes a multimodal sarcasm detection framework integrating text (Enhanced-BERT), image,
and audio features. A novel adaptive fusion strategy combined with SarcasNet-99 achieves
97.3% accuracy on large-scale video datasets, outperforming state-of-the-art models in sarcasm
detection.
|
|
ObjectDetect: A Real-Time Object Detection Framework for Advanced Driver Assistant Systems Using YOLOv5
Jamuna S Murthy,
Siddesh G M,
Wen-Cheng Lai,
et al.
Wireless Communications and Mobile Computing, 2022
Area:Computer Vision, YOLO, LiDAR, Real-Time Systems and Autonomous Driving
PDF /
Code /
Paper
A real-time multimodal object detection framework for Advanced Driver Assistance Systems (ADAS), integrating vision and LiDAR-based perception to enable robust environmental understanding. The system leverages YOLOv5 for high-speed and high-accuracy detection, achieving ~95% accuracy while significantly improving inference speed compared to YOLOv3 and YOLOv4. Designed with principles from remote sensing and spatial perception, the framework supports reliable object detection and tracking under real-world conditions, making it suitable for autonomous driving and large-scale inspection scenarios.
|
|
TwitSenti: A Real-Time Twitter Sentiment Analysis and Visualization Framework
Jamuna S Murthy,
Siddesh G M,
K G Srinivasa
Journal of Information & Knowledge Management, 2019
Area: NLP, Real-Time Systems and Social Media Analytics
PDF /
Code /
Paper
Early work on real-time sentiment modeling in streaming environments, integrating NLP pipelines with scalable distributed systems (Apache Storm, Redis). Focused on low-latency inference and interactive visualization for social media analytics.
|
|
A Real-Time Twitter Trend Analysis and Visualization Framework
Jamuna S Murthy,
Siddesh G M,
K G Srinivasa
International Journal on Semantic Web and Information Systems, 2019
Area: NLP, Real-Time Systems, Social Media Analytics
PDF /
Code /
Paper
A real-time Twitter trend analysis framework that integrates streaming data processing with semantic analysis and interactive visualization. The system identifies emerging trends, tracks their momentum, and provides insights into public discourse dynamics on social media platforms.
|
|
Funding
|
DST Funded Project (2021-2022): FitQua Smart Water Bottle (AI-based health tracking)
Built FitQua, an intelligent hydration system integrating thermal control (Peltier-based cooling & resistive heating) with sensor fusion (temperature, load, activity) to deliver AI-driven hydration monitoring
|
|
Industry Projects
|
Verity Labs Inc., California (AI Researcher)
Duration: April 2025 – September 2025
Developed and optimized AI/ML modules for real-world applications using TensorFlow, PyTorch, and Scikit-learn.
Worked on integrating machine learning models with cloud platforms (AWS, Azure, GCP) and collaborated with
cross-functional teams to deploy scalable AI-driven systems. Contributed to research initiatives and
production-level AI workflows.
Samsung PRISM Program (Samsung R&D)
Expression Correction in Photos (GenAI)
Duration: Aug 2024 – Jul 2025
Developed a generative AI-based system for facial expression correction, including blinking and emotion refinement.
Leveraged multimodal learning and generative models for realistic photo enhancement.
Certificate
NextGen AI Datastore for Video Retrieval (Vector DB)
(Ongoing)
Designing a scalable vector database system for multimodal video retrieval using vision-language embeddings.
Focus on semantic search, indexing optimization, and real-time retrieval for large-scale video analytics.
|
|
Awards
|
Best Young Researcher Award – STEM Research Society (2024)
Best Master Thesis Award (2017)
Graduate Teaching Assistantship, MSRIT (2016-2017)
|
|
Patent
|
Surgeon’s Eye: Intelligent Video Analytics for Robotic Surgery (2025)
|
|
Academic Service
|
Reviewer: AAAI, ACM MM, ACM Multimedia Asia, CVPR, ECCV, ICLR, ICME, WACV, Springer Nature
|
|
Teaching
|
CS52: Artificial Intelligence and Machine Learning (Fall Semester)
CS62: Cloud Computing and Big Data (Spring Semester)
Department of Computer Science and Engineering, Ramaiah Institute of Technology
|
|