Info
- Title: MR. Video: “MapReduce” is the Principle for Long Video Understanding
- Group: UIUC
- Keywords: long video understanding, MapReduce, dense short clip perception, joint aggregation
- Venue: arXiv
Comments
Simple solution. First generate short caption for each clip, then reduce duplicate captions.