多模态输入
一些模型,如图像识别模型,支持多模态输入,允许您将文本与媒体文件结合使用。以下示例演示了如何提供图像:
POST /v1/chat/completions
curl https://api-platform.ope.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $YOUR_API_KEY" \
-d '{
"model": "$MODEL_ID",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": [
{
"type": "image_url",
"image_url": {"url": "$URL"} #图片URL或Data URL(base64)
},
{
"type": "image_url",
"image_url": {"url": "$URL"} #支持多张输入
},
{
"type": "text",
"text": "解释一下这个图片是什么含义"
}
]
}
]
}'
# First, install the OpenAI library:
# pip install openai
from openai import OpenAI
client = OpenAI(
api_key="$YOUR_API_KEY",
base_url="https://api-platform.ope.ai/v1/"
)
completion = client.chat.completions.create(
model="$MODEL_ID",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": [
{
"type": "image_url",
"image_url": {"url": $URL} #图片URL或Data URL(base64)
},
{
"type": "image_url",
"image_url": {"url": $URL} #支持多张输入
},
{
"type": "text",
"text": "解释一下这些图片是什么含义"
}
]}
]
)
print(completion.choices[0].message)