Skip to content

Latest commit

 

History

History
21 lines (16 loc) · 938 Bytes

File metadata and controls

21 lines (16 loc) · 938 Bytes

Video-Summarization

Objective: Given the frames of any 9 minute video, generate a 1.5 minute summary. The summary is made up of the shots (no camera cut) with the highest weighting based on on chosen measures of face detection, people detection, block motion detection, and audio analysis.

Please see the following presentation slides:
https://docs.google.com/presentation/d/1JzAxssiVaMH7u6PvZ92mhYaOg02fOeh8pYDmhHHBkqg/edit?usp=sharing

Shot detection is done with SSIM, centering, and color histograms.

Metrics measured are:

  • Face Detection
  • People Detection
  • Block Motion Detection
  • Audio

The final video is generated from synching the audio with the resulting summary frames.