Landing AI
IXC25 Image VQA

A a tool that can answer any questions about arbitrary images including regular images or images of documents or presentations. It returns text as an answer to the question.

Input

What are these people doing in this image?

Who is playing guitar in this image?

Output

python
import requests url = "https://api.landing.ai/v1/tools/internlm-xcomposer2" files = { "image": open("{{path_to_image}}", "rb") } data = { "prompt": "{{prompt}}" } response = requests.post(url, files=files, data=data) print(response.json())