ACM Multimedia 2024 Oct 28, Melbourne
Given an audio-visual sample containing a single speaker, the task is to identify if the video is a deepfake or real.
Given an audio-visual sample containing a single speaker, the task is to find out the timestamps [start, end] in which the manipulation is done. The assumption here is that from the perspective of spreading misinformation.