• Usernameblankface@lemmy.world
    link
    fedilink
    arrow-up
    2
    ·
    1 年前

    You did this? I saw some examples in the “show me your best hearse if it were a ___” post, and I’m flabbergasted at the computer-based witchcraft it’s pulling off over there.

    What image generator is it using? Is it using chatgpt to expand the prompts?

    • Rikudou_Sage@lemmings.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 年前

      It’s using DALL-E, the version 3 also sometimes expands the prompt (IMO it’s using GPT in the background to do so, but I haven’t checked). The prompt is only printed below the image if it has been changed. I’ve noticed that if the prompt is long already, it doesn’t change it, but I haven’t really confirmed anything. Well, you can try it yourself, just tag the bot and it will respond with the image.

      It also falls back to DALL-E version 2 to lower the costs and there’s some daily limit (per-user and global).

      • Usernameblankface@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        1 年前

        Very interesting. So all the requests from the bot get sent through one account? Seems like it would get expensive fast as people find out that this option exists.

        • Rikudou_Sage@lemmings.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 年前

          Yeah, but there’s rate limiting in place that makes sure I won’t pay more than I’m comfortable with. I’m thinking of making it possible for users to supply their own api key.