Advertisement
  1. SEJ
  2.  ⋅ 
  3. Generative AI

GPT-4 With Vision: Examples, Limitations, And Potential Risks

Explore examples of GPT-4 with Vision, along with its limitations and potential risks, as it rolls out to ChatGPT Plus and Enterprise users.

  • OpenAI introduced GPT-4 with Vision (GPT-4V), which builds upon GPT-4 by incorporating image input capability.
  • Examples of GPT-4 with Vision in action have appeared on social media, demonstrating its capabilities on a variety of tasks.
  • While GPT-4 with Vision promises to be groundbreaking in areas like content marketing and SEO, businesses must weigh the ethical and security concerns outlined by OpenAI.
GPT-4 With Vision: Examples, Limitations, And Potential Risks

OpenAI has made waves in the tech world again with its latest innovation: GPT-4 with Vision, or GPT-4V.

GPT-4V builds on GPT-4 and incorporates visual capabilities, allowing the model to analyze images provided by ChatGPT Plus and Enterprise subscribers.

The new feature has great potential but also carries some risks for businesses.

GPT-4 With Vision Examples

As more users gain access to the new feature, they are sharing examples of how GPT-4 with Vision works.

GPT-4 with Vision can analyze handwriting.

 

It can create code for a website using a napkin drawing.

It can analyze memes.

In addition to these examples, I ran a few simple tests.

GPT-4 with Vision can write product descriptions for your sales pages and Amazon listings.

Screenshot from ChatGPT, September 2023

It can help you get started with basic coding for a particular website design based on a screenshot.

Screenshot from ChatGPT, September 2023
Screenshot from W3Schools, September 2023

It can write creative Instagram captions with hashtag suggestions.

Screenshot from ChatGPT, September 2023

It can write an article based on data from a website or ebook, such as the State of SEO 2024.

Screenshot from ChatGPT, September 2023

As with all AI-generated content, it’s essential to review output from GPT-4 with Vision for accuracy. It still hallucinates and poses other risks.

OpenAI Reveals Potential Risks Of GPT-4V

OpenAI released a paper outlining potential risks associated with the use of GPT-4V, which include:

  • Privacy risks from identifying people in images or determining their location, potentially impacting companies’ data practices and compliance. The paper notes that GPT-4V has some ability to identify public figures and geolocate images.
  • Potential biases during image analysis and interpretation could negatively impact different demographic groups.
  • Safety risks from providing inaccurate or unreliable medical advice, specific directions for dangerous tasks, or hateful/violent content.
  • Cybersecurity vulnerabilities such as solving CAPTCHAs or multimodal jailbreaks.

Risks posed by the model have resulted in limitations, such as its refusal to offer analysis of images with people.

Screenshot from ChatGPT, September 2023
Screenshot from ChatGPT, September 2023

Overall, brands interested in leveraging GPT-4V for marketing must assess and mitigate these and other generative AI usage risks to use the technology responsibly and avoid negative impacts on consumers and brand reputation.

OpenAI’s First Partner To Prepare Image Input For “Wider Availability”

OpenAI announced that the GPT-4 with Vision model will power Be My Eyes Virtual Volunteer, a digital visual assistant designed for the visually impaired.

Although the tech is still in beta, the possibilities are tantalizing. For example, this technology could assist businesses in elevating accessibility in customer service.

Be My Eyes plans to beta-test the feature with corporate clients, emphasizing its commercial potential beyond its primary audience.

The Future Of GPT-4 With Vision

The potential applications of GPT-4 With Vision for businesses, marketers, and SEO professionals could be groundbreaking.

However, all users should remain cautious due to the potential privacy, fairness, and cybersecurity issues posed by GPT-4 with Vision and other AI models.

In addition to image input capability, OpenAI reenabled the Browse with Bing feature for web browsing through ChatGPT.


Featured image: Tada Images/Shutterstock

Category News Generative AI
ADVERTISEMENT
Kristi Hines kristhines.com

Covering the latest news in AI, search, and social media.