Data:
{
"text": "Hi! \n**Context**:\n\n* I'm working with a food delivery company trying to use ChatGPT to view an image and determine if the \"stocking job\" (how the items were placed on the shelf) was a good job or a bad job (i.e. they have items facing the wrong way, there is empty space, etc)\n* I've confirmed in ChatGPT that this is possible by creating a custom GPT, giving it image examples and so on\n* However, now I want to expose this externally, I'm trying to use the Assistants API but the context window is too small to system prompt with sufficient photo examples - I run into rate limits every time\n\n**Where I need help:** \nMy thought as to how I can get around this is to use fine-tuning but I'm not sure how to fine tune on images. I looked at converting the image to text but that didn't seem scalable. Is there a way I could upload the image somewhere and give it a reference and then use that reference in the fine-tuning? This may seem like a novice question and that is because I am novice :)\n\n \nAny help, guidance, or links would be much appreciated. ",
"label": "r/gpt4",
"dataType": "post",
"communityName": "r/GPT4",
"datetime": "2024-04-18",
"username_encoded": "Z0FBQUFBQm5LakwxTHIyenIzd2pSSTk0WGd4Vnd2amhPNUYxRU14ak5feHNoS0t1S3ZxR0hudzVoWUI0M0doLTBlQjZmeXBKQVdRMmRNVW1zZDQxZ0pRQjFCVkJQYVRVZVE9PQ==",
"url_encoded": "Z0FBQUFBQm5Lak9GRkpxMzREY01YLUJIUVF6b3hsaDFCWkZlM1BDVEdjVkZZWlA3MXExdzdnQkZZMUlzbWdvTWF6anVUd0JsbWR0bWdBSG5HTXR1bi1CcE5Zd0wyMWVlNmlXNjBRNGVlbXBmTXdTV1hQU1NhS3E1bWgzZzMwVE5FcFdndE1GelFoSjdyajlILW9aVUlDOUVPbE5tR0FxYnZxWG51cGJNU1dFNWFNdWFxbHZqU0Zud3RRY2N1S2t6SXVFa3BCZDJ6TW9LeXV3OFlHcDdfU0dEZjcxUDhFd2x1dz09"
}