What's Possible Now That AI Can See?
We fed Microsoft's GPT-4V paper to an AI to generate some preliminary ideas.
The recent release of OpenAI’s GPT-4V model is arguably the most groundbreaking development in what has already been a watershed year for AI.
AI has been capable of visual recognition in the past, identifying faces and even facilitating autonomous driving. However, these models were strictly optimized for narrow tasks.
The advent of GPT-4V inches us closer to the reality of General Artificial Intelligence and ushers in a wealth of new applications.
To better understand these potential applications, we’ll feed Microsoft’s 150-page GPT-4V paper into an AI system, which uses it to come up with ideas for brands that leverage GPT-4V’s image understanding capabilities.
Here’s how it works:
We give the AI a brief in the form of a short prompt.
The AI analyzes the the GPT-4V paper to identify relevant capabilities.
The AI uses these capabilities to develop an original creative idea for my brief.
Here’s a video of the AI system in action in case you’re curious.
GPT-4V Use Cases
Ok, enough preamble. Let’s explore some of the use cases our AI has helped us discover.
The following ideas were generated by a 🤖 with no human editing…
Coca-Cola
“Come up with an idea for Coca-Cola leveraging GPT-4’s new vision capabilities”
Portal To Play
Coca-Cola will launch an augmented reality experience called "Portal to Play" that leverages GPT-4V's image recognition to transform Coca-Cola logos found in the real world into portals to a fun virtual world. When users scan a Coke logo with their phone's camera, it will transport them to an AR game zone filled with challenges like retro arcade games, bubble poppers and dance battles. As users complete games, they'll unlock virtual prizes and badges. There will also be surprise AR moments like Coke polar bears popping out of refrigerators or virtual fireworks exploding when scanning limited-edition bottle designs. "Portal to Play" gamifies environments, drives continuous engagement through unlockable content, and showcases Coke's playful personality in an innovative way powered by groundbreaking AI technology.
Ikea
Come up with an idea for the Ikea app that leverages GPT-4V's image understanding capabilities.
Complete the Look
Ikea will launch the "Complete the Look" app feature that leverages GPT-4V image recognition to revolutionize furniture shopping. Users simply upload a photo of any room and the app will instantly identify and tag all Ikea products in the image. It then suggests coordinating accessories available for purchase to help complete the look. As users browse the recommendations, they can visualize the pieces in their space using augmented reality. This provides a seamless, personalized shopping experience that saves time and takes the guesswork out of pulling a room together. The "Complete the Look" feature enables customers to shop the full Ikea system digitally, driving engagement, conversion and brand loyalty. Its use of AI technology also positions Ikea as an innovative omnichannel leader.
Nike
“Come up with an idea for a brand activation from Nike that leverages GPT-4V's vision capabilities.”
Just Train It
Nike will launch "Just Train It", an interactive billboard powered by GPT-4V vision AI that provides athletes with real-time personalized training feedback. As runners pass by, computer vision will instantly analyze their form, cadence, foot strike and other biomechanics. Using natural language capabilities, the billboard will deliver audio guidance tailored to each athlete, like "Increase cadence by 5%" or "Land more on your mid-foot". LED lights will also visually highlight areas for improvement. Runners can opt-in to have their data logged, enabling the billboard to learn their strengths, weaknesses and preferences over time for an even more customized experience. By leveraging AI to provide bespoke coaching, Nike cements its reputation as a true ally in helping athletes reach their potential.
Mailchimp
“Come up with an idea for an educational campaign from Mailchimp that leverages GPT-4V's image understanding capabilities.”
Meme 101
Mailchimp presents "Meme 101 with GPT-4V," an educational social media campaign where the AI narrates a series of generated memes that humorously explain its own image understanding capabilities. In the short video posts, GPT-4V analyzes meme images featuring animals, celebrities, and pop culture references, using its computer vision to identify key elements. As it describes what it can detect, comical captions appear highlighting its literal interpretations. The AI pokes fun at itself when it misidentifies ambiguities in the memes. The series concludes with GPT-4V creating its own new meme to demonstrate its learning. An interactive AR experience also allows users to submit images for GPT-4V to caption. This highly shareable campaign entertains and educates about AI in an accessible, lighthearted way.
AirBnB
“Come up with an idea for a global campaign for Airbnb that leverages GPT-4V's image understanding capabilities.”
Perfect Match
Airbnb will launch "Perfect Match", using GPT-4V's image recognition to pair travelers with hosts based on visual compatibility. Users simply upload photos from their camera roll and social media, and our AI will analyze aesthetics, objects, scenery, interests and more to curate suggested local hosts with similar lifestyles and tastes. Going beyond basic demographics, Perfect Match uses computer vision to create deeper, more personalized connections. Listings will showcase hosts through an AI-generated profile summarizing their visual patterns. Travelers can explore recommendations and connect with their ideal host. By tapping into GPT-4V's visual intelligence, Airbnb can enhance our matching capabilities and help users find not just homes, but new kindred spirits.
Coca-Cola
“Come up with an idea for Coca-Cola leveraging GPT-4’s new vision capabilities”
Meme Yourself
Coca-Cola will launch an innovative new campaign leveraging GPT-4V's AI capabilities to create personalized, meme-inspired ads tailored to each consumer. By analyzing a user's social media activity, GPT-4V will generate unique memes featuring the consumer that promote Coke in a humorous, engaging way. For example, for a user whose posts suggest they love dogs, it could create a meme with a Coca-Cola bottle dressed up in dog clothes. The memes will be delivered through social/digital ads, creating a fun, tailored experience. To amplify buzz, Coke will prompt people to share their memes, driving viral chatter. This cutting-edge use of AI establishes Coke as a forward-thinking brand, while boosting ad relevance and likeability.
Addition is an AI research and development company for modern brands.
Read about us in the Wall Street Journal
Visit our website to learn about the work we do with brands and agencies
Paul, thank you for pursuing such insightful work! This post helps me grasp potential, and show useful ways forward. Keep it up!