A follow-up query concerning the closing rating was answered appropriately, however Gemini obtained the title of the scorer of the primary landing unsuitable: The AI advised it was Johan Dotson. Dotson was proven getting a landing within the highlights with the scores at 0-0, nevertheless it was dominated out—an instance of the nuances that AI would not essentially choose up on.
Gemini did efficiently establish when the Kansas Metropolis Chiefs obtained their first factors, and even included a timestamp linking straight to the landing within the YouTube clip. It additionally obtained the title of the scorer proper. It appears Gemini is closely reliant on the commentary for sports activities clips, which is not stunning.
Summarize Video Contents
Subsequent, we tried placing Gemini up in opposition to a behind-the-scenes featurette for The Grand Budapest Resort, directed by Wes Anderson. The clip runs to four-and-a-half minutes, and Gemini fired again some replies virtually immediately: It recognized the title of the movie being talked about, and the principle beats of the clip’s narrative.
Nevertheless, it is all reliant on the audio (or the transcript) once more—there would not appear to be any evaluation of the particular video contents. The AI could not say who the speaking heads have been within the video, despite the fact that their names have been proven on display screen, and wasn’t in a position to say who the director was (despite the fact that this was additionally talked about within the video description).
On the plus facet, Gemini did do a powerful job of summing up the audio of the video. It appropriately recognized a few of the filmmaking challenges that have been talked about all through, and offered timestamps to them — from on the lookout for a set to signify the Grand Budapest, to filling it with extras.
Summarize Interviews
Lastly, we tried Google Gemini with an interview: Channel 4 within the UK chatting with Charlie Brooker and Siena Kelly concerning the newest sequence of Black Mirror (maybe acceptable for an article on AI). Gemini proved itself very succesful at selecting out the speaking factors, and including timestamps, although in fact the entire video is generally speaking.
Once more although, there is not any context about something exterior of the audio or the transcript. Gemini AI could not say the place the interview came about, or how the contributors have been appearing, or anything concerning the visuals of the video—which is value taking into account when you use it your self.
For movies the place the solutions you need are within the audio of a YouTube video, and its related transcript, Gemini works rather well at summarizing and offering correct solutions (offered the commentators point out when a landing is dominated out, in addition to when one is scored). For any sort of visible data, you are still going to have to observe the video your self.