PAPER_TITLE

FIRST_AUTHOR_LAST, FIRST_AUTHOR_FIRST; SECOND_AUTHOR_LAST, SECOND_AUTHOR_FIRST

Designing Multi-Robot Ground Video Sensemaking with Public Safety Professionals

Puqi Zhou¹, Ali Asgarov², Aafiya Hussain², Wonjoon Park³, Amit Paudyal¹, Sameep Shrestha¹, Chia-wei Tang², Michael F. Lighthiser¹, Michael R. Hieb¹, Xuesu Xiao¹, Chris Thomas², Sungsoo Ray Hong¹,

¹George Mason University ²Virginia Tech ³University of Maryland, College Park
ACM CHI 2026

Video Dataset arXiv Slides

Abstract

Videos from fleets of ground robots can advance public safety by providing scalable situational awareness and reducing professionals’ burden. Yet little is known about how to design and integrate multi-robot videos into public safety workflows. Collaborating with six police agencies, we examined how such videos could be made practical for their workflows. In Study 1, we identified 38 events-of-interest (EoI) relevant to public safety monitoring and 6 design requirements aimed at improving current video sensemaking practice. We also developed a dataset of 20 robot patrol videos (10 day,10 night), each covering at least seven EoI types. In Study 2, we built MRVS, a tool that streams patrol videos and applies a prompt-engineered video understanding model based on our EoI definitions. Participants reported reduced manual workload and greater confidence with LLM-based explanations, while noting concerns about false alarms and privacy. We conclude with implications for designing future multi-robot video sensemaking tools.

Examples of different anomalies in our video dataset are shown in sequences. Each second column is manually zoomed in.

Video Presentation

MRVS demonstration video embedded directly in the project page for in-browser playback.

Dataset Preview

Example event sequences from the MRVS video dataset. The second column highlights manually zoomed-in regions.

BibTeX

@inproceedings{zhou2026designing,
  author = {Zhou, Puqi and Asgarov, Ali and Hussain, Aafiya and Park, Wonjoon and Paudyal, Amit and Shrestha, Sameep and Tang, Chia-Wei and Lighthiser, Michael and Hieb, Michael and Xiao, Xuesu and Thomas, Chris and Hong, Sungsoo Ray},
  title = {Designing Multi-Robot Ground Video Sensemaking with Public Safety Professionals},
  booktitle = {Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems},
  year = {2026},
  publisher = {Association for Computing Machinery},
  address = {New York, NY, USA},
  note = {To appear},
}

More Works from Our Lab

Paper Title 1

Paper Title 2

Paper Title 3