Now that the company has some idea about how to use the data, it’s time to design a classifier. Our classifier will be very simple: given a movie and a user, it will classify the movie as either "Good" or "Bad" for this user.4. Assume all the users of the company have a very simple rule in their movie taste: they like it if Tom Cruise has the lead role. Any other data is mostly irrelevant. However, no one in the company knows about this fact. Which of the following clustering models might be able to detect this rule? Check all that apply.
A. Supervised (label: rating), with data: Director, language, genre
B. Supervised (label: rating), with data: Movie length, lead role, director
C. Unsupervised, with data: Lead role, movie length, rating
D. Unsupervised, with data: Lead role, genre, director
E. Unsupervised, with data: Number of ratings, lead role
查看答案
3. What machine learning approach should the company use for cases like Bob?
A. Supervised
B. Unsupervised
2. Bob, on the other hand, is not that much into ratings. He does watch a lot of movies, but never takes the time to rate them. For users like Bob, which of the following data can the company use to determine potential interest in a specific movie? Check all that apply.
A. Metadata of movies: actors, director, genre, etc.
B. Length of the movie
C. Popularity of the movie amongst other users
D. User login patterns
The company Internet Movies, Inc. has found wide success in their streaming movie business. After many long and successful years of delivering content, they have decided to use machine learning to make their business even more successful. Luckily, they already possess a huge dataset that has grown over years and years of user activity – but they need your help to make sense of it! Answer the following questions1. Let’s start with a simple case. Assume user Alice is a particularly good member and she makes sure to rate every movie she ever watches on the website. What machine learning approach would be better for the company to use for determining whether she would be interested in a new specific movie?
A. Supervised
B. Unsupervised
7. Are there other cluster options for this data that would lead to a smaller badness value? Do not consider the case of each data point being a separate cluster.
A. Yes
B. No