Microsoft quietly released a system called Visual ChatGPT based on ChatGPT, which is a system that uses ChatGPT to generate drawings based on text input. The traditional "you say, I draw" systems like Stable Diffusion are well-known, but what makes V ChatGPT different?
Taking this opportunity, I decided to use the ChatGPT toolchain to interpret the newest member of the ChatGPT family.
Since we mentioned the ChatGPT toolchain, let's establish a rule: starting from this point, except for step-by-step instructions on tool usage, all text will be sourced from the ChatGPT toolchain. First of all, the original ChatGPT cannot access the internet and its data only goes up to 2021, so it doesn't have any content about Visual ChatGPT. Therefore, we need additional tools to provide it with relevant information.
Fortunately, the GitHub page of Visual ChatGPT comes with their paper, which provides detailed explanations. So, we can use the ChatPDF tool developed by @xathis to read the paper.
First, let's ask it what V ChatGPT is. Here's the generated content for "Give me a summary about Visual ChatGPT":
It looks great. But what about students who don't understand English? We can ask for a summary in Chinese.
Of course, you can also use the openai-translator developed by yetone.
However, both the English and Chinese summaries exceed the character limit for a tweet. Can we make them more concise? Yes, here is its answer:
Visual ChatGPT is a system that combines different visual models, allowing users to interact with AI systems through language and images. It supports complex visual tasks, including visual editing instructions that require collaboration between multiple AI models. The system has been validated through extensive experiments for its understanding and generation capabilities.
Let's ask for something more complex: What are the differences between Visual ChatGPT and Stable Diffusion?
Please provide a detailed explanation of the workflow in Visual ChatGPT:
Further explain the Prompt Manager in Visual ChatGPT:
Here's an example of using Visual ChatGPT:
How should I install and use Visual ChatGPT? The PDF doesn't provide any information on that.
That's all for this content. As you can see, with the help of automation, ChatGPT can replace a lot of traffic work recommended by various technologies and tools.
My Twitter: