IXC25 Image VQA
A a tool that can answer any questions about arbitrary images including regular images or images of documents or presentations. It returns text as an answer to the question.
Output
python
import requests
url = "https://api.landing.ai/v1/tools/internlm-xcomposer2"
files = {
"image": open("{{path_to_image}}", "rb")
}
data = {
"prompt": "{{prompt}}"
}
response = requests.post(url, files=files, data=data)
print(response.json())