How Are Movie Studios Using AI To Predict Movie Audiences’ Interests?

Table of Contents (click to expand)

Movie studios are using AI to predict movie audiences’ interests by scanning and labeling movie trailers and then comparing the data with other movie trailers to identify ‘similar movies’.

Patterns are what lie at the heart of artificial intelligence (AI), which is why businesses love it. Patterns help to predict the future, and predicting the future is a great way to mint money. Although at times it’s ambiguous how these things work together, film studios like 20th Century Fox are using AI to predict what films audiences want to see by partnering with AI tech giants like Google.

Now, to be successful in the movie industry, movie studios need to attract moviegoers, but attracting huge audiences is no easy task. The movie-going audience is a diverse group of people with a wide range of tastes and interests. Conventionally, studios rely on intuition and experience when deciding to invest in a particular script. This is a risky approach, especially when the story is unique or unprecedented. Movie studios have relied in the past on high-level data processes to map out customer segments and make forecasts for the future film’s box office earnings. Also, until recently, granular predictions at the segment level have remained elusive due to the dearth of requisite technological tools and algorithms. However, with the advent of machine learning, these barriers have been overcome. Artificial intelligence (AI) is such a tool helping in this iterative and complex process of matching scripts and audiences much more accurately.


Recommended Video for you:



Project Merlin

20th Century Fox, one of the most reputable movie studios in the world, is using advanced machine vision systems powered by sophisticated AI algorithms to examine trailer footage and predict the likelihood of audiences watching their movie. They do this investigation frame by frame, labeling important objects and events, and then comparing it to data generated from other film trailers. The core idea is that movies with similar sets of labels/scenes will attract similar sets of audiences. 20th Century Fox has partnered with Google to launch project Merlin—a codename for the movie attendance and prediction system.

20 century fox
(Image Credit: Flickr)

How Does It Work?

The first step for the team working on this project was identifying the right technology to power the system. The Cloud Machine Learning Engine (Cloud ML Engine), in conjunction with the TensorFlow deep learning framework, appeared to be the best choice after brainstorming by the team members. Most of the experts on the team voted for it because Cloud ML Engine is a managed service, and can automate all resource provisioning and monitoring. This implies that the team just needs to focus on building the deep learning model for Merlin, rather than devoting time in the arduous process of configuring its infrastructure.

Scanning Movie Trailer

The first step of project Merlin involves scanning the movie trailer. Consider the example of the movie Logan. Merlin scans the trailer and labels the objects, such as “tree”, “facial hair”, “car”,  “forest” etc.

Top 10 labels by frequency for Logan

While the first stage involves only scanning for labels and recording the frequency of these labels, the final data generated is much more complex, because the final data also considers how long these objects appear onscreen and specifically when they show up the video.

Labelling The Scanned Data

The temporal information obtained by the process of scanning and labeling is particularly rich, as it correlates with the film’s genre. For example, a trailer with a long close-up shot of a protagonist is more likely for a drama/romantic movie than for an action blockbuster. Similarly, a trailer with quick and furious shots is more common for an action movie. This surely holds true for Logan, with its trailer featuring plenty of slow shots of Hugh Jackman drenched in blood.

Comparing With Other Labelled Dataset

After completing the label analysis of a movie trailer (Logan in the considered example), the next stage is to compare that new analysis with the labels generated from other movie trailers to identify ‘similar movies’. There’s a presumption that there must be some overlap between the audiences for Logan and other action movies. In other words, moviegoers who have previously seen an action-packed film with a “rugged” male protagonist are inherently more likely to see Logan.

What Were The Results?

By comparing this data with analyses of other movie trailers, Merlin tries to predict what films might interest those moviegoers who saw Logan.

The table below shows the top 20 films that moviegoers who went to see Logan also watched. The right column shows Merlin’s predictions, while the left column shows the actual data (collected, based on the survey, i.e., literally asking the people directly).

Actual Data Prediction by Merlin Data Generated by Merlin
X-Men: Apocalypse True The Magnificent Seven
John Wick: Chapter 2 True Jason Bourne
Doctor Strange True John Wick: Chapter 2
Batman Vs. Superman: Dawn Of Justice True Terminator Genisys
Suicide Squad True The Legend Of Tarzan
Dead Pool False Mad Max Fury Road
Terminator Genisys True The Revenant
Mad Max: Fury Road True Independence Day: Resurgence
Ant Man False Spectre
Captain America: Civil War False Rogue One: A Star Wars Story
Star Trek Beyond True The Hunger Games: Mockingjay-Part 1
Independence Day: Resurgence True The Accountant
The Magnificent Seven True Star Trek Beyond
Avengers: Age of Ultron False Suicide Squad
Kingsman: The Secret Service False The Martian
Arrival False X-Men: Apocalypse
Split False Batman Vs. Superman: Dawn Of Justice
Rogue One: A Star Wars Story True San Andreas
Fantastic Beasts And Where To Find Them False Doctor Strange
Furious 7 False Mission Impossible: Rogue Nation

The Good

As seen in the table, Merlin guesses quite a few movies correctly, including other superhero movies like Batman Vs. Superman: Dawn of Justice, X Men: Apocalypse, and Doctor Strange. It even aptly recognizes John Wick: Chapter 2 as a bedfellow of Logan. That’s an impressive result, as John Wick is certainly not a typical superhero movie. However, it does feature a similarly weary protagonist with a rugged look. Merlin identifies all five of the top choices, albeit not in the same order of importance. Overall, it was successful in identifying more than half of the films.

The Bad

That being said, there were places where Merlin erred. Merlin predicted that The Legend of Tarzan would be a hit with ‘Logan-liking’ audiences, for example. Interestingly, neither Google nor 20th Century Fox offered any explanation for this. Now, my conjecture is that it could it have something to do with the “tree,” “forest,” and “light” found in Logan—elements that the Tarzan trailer also shows in a high concentration.

Movie studios are keen on adopting AI for these sorts of analytics, and several companies are striving hard to design systems that can predict the likelihood of a movie’s success through data mining and processing. With more data and further advances in the field of machine learning, we can expect these systems to get more accurate/efficient, since the accuracy remains a challenge… for now!

References (click to expand)
  1. How 20th Century Fox uses ML to predict a movie audience. Google LLC
  2. Hsieh, C.-K., Campo, M., Taliyan, A., Nickens, M., Pandya, M., & Espinoza, J. (2018). Convolutional Collaborative Filter Network for Video Based Recommendation Systems (Version 2). arXiv.