← Back to Directory

Row Details #4592

Data:

{
  "text": "Hi!  \n**Context**:\n\n* I'm working with a food delivery company trying to use ChatGPT to view an image and determine if the \"stocking job\" (how the items were placed on the shelf) was a good job or a bad job (i.e. they have items facing the wrong way, there is empty space, etc)\n* I've confirmed in ChatGPT that this is possible by creating a custom GPT, giving it image examples and so on\n* However, now  I want to expose this externally, I'm trying to use the Assistants API but the context window is too small to system prompt with sufficient photo examples  - I run into rate limits every time\n\n**Where I need help:**  \nMy thought as to how I can get around this is to use fine-tuning but I'm not sure how to fine tune on images. I looked at converting the image to text but that didn't seem scalable. Is there a way I could upload the image somewhere and give it a reference and then use that reference in the fine-tuning? This may seem like a novice question and that is because I am novice :)\n\n  \nAny help, guidance, or links would be much appreciated. ",
  "label": "r/gpt4",
  "dataType": "post",
  "communityName": "r/GPT4",
  "datetime": "2024-04-18",
  "username_encoded": "Z0FBQUFBQm5LakwxTHIyenIzd2pSSTk0WGd4Vnd2amhPNUYxRU14ak5feHNoS0t1S3ZxR0hudzVoWUI0M0doLTBlQjZmeXBKQVdRMmRNVW1zZDQxZ0pRQjFCVkJQYVRVZVE9PQ==",
  "url_encoded": "Z0FBQUFBQm5Lak9GRkpxMzREY01YLUJIUVF6b3hsaDFCWkZlM1BDVEdjVkZZWlA3MXExdzdnQkZZMUlzbWdvTWF6anVUd0JsbWR0bWdBSG5HTXR1bi1CcE5Zd0wyMWVlNmlXNjBRNGVlbXBmTXdTV1hQU1NhS3E1bWgzZzMwVE5FcFdndE1GelFoSjdyajlILW9aVUlDOUVPbE5tR0FxYnZxWG51cGJNU1dFNWFNdWFxbHZqU0Zud3RRY2N1S2t6SXVFa3BCZDJ6TW9LeXV3OFlHcDdfU0dEZjcxUDhFd2x1dz09"
}