Landing AI
OWLv2 Image

A tool that can detect and count multiple objects given a text prompt such as category names or referring expressions on images. The categories in text prompt are separated by commas. It returns a list of bounding boxes with normalized coordinates, label names and associated probability scores.

Input

Find windows and doors from a photo

Output

python
import requests url = "https://api.landing.ai/v1/tools/text-to-object-detection" files = { "image": open("{{path_to_image}}", "rb") } data = { "prompts": [ "{{prompt1}}", "{{prompt2}}" ], "model": "owlv2" } response = requests.post(url, files=files, data=data) print(response.json())