Dall-e image generation

pirijan · November 4, 2022, 2:03pm

Allow you to specify an image for use in a card

Depending on cost, this may be a feature that’s only available to paid users.

bentsai · November 4, 2022, 2:06pm

One use case: I was creating a presentation with kinopio this week. I wanted some interesting images to go along with what I was saying. Used the existing GIF search and that worked. Having an AI generated image would have been a cool option.

kordumb · November 4, 2022, 3:50pm

FYI on what I could find about pricing for devs.

And limited to 10 images per minute or 25 images per 5 minutes.

kordumb · November 4, 2022, 3:54pm

For me, it’d just be a cool way to decorate spaces with weird images.

I use ai as one of my potential tools when creating my dumb design graphics so besides just decorating spaces, I could see Kinopio as a little design workshop for me, where I bring in inspo, fire off a few AI images, come up with various blocks of text (ai text generation is an option on OpenAI dev API too ) all to brainstorm and collect in one place.

bentsai · November 4, 2022, 4:04pm

I haven’t done any prompt generation myself, but I’ve seen others do it. Kinopio affords making different combinations of descriptions. Imagine you have a bunch of ideas, descriptions, etc. And then you can quickly mix and match them to submit to DALL-E.

kordumb · November 4, 2022, 4:16pm

Love this idea

pirijan · November 4, 2022, 4:50pm

how do you envision the interaction of creating an ai image from multiple cards working?

kordumb · November 4, 2022, 8:47pm

Same AI button that you’d use for the Image pop-up could go in the multi-select pop-up next to the comment button…

Then it would create a new card using the words as a prompt for the image to dalle.

pirijan · November 7, 2022, 6:53pm

there are a couple reasons why I think generating images from cards directly like this might not work so well in practice:

for non-experts (most ppl) making ai prompts isn’t going to be an intuitive thing, instead they’ll need guidance
you might need to tweak your query , this’ll will be a lot awkward if you have to go back and edit your cards.
Philosophically, cards should be cards (eg thoughts/ideas), and images should be images (supplementary graphic elements). Blending these two together as a requirement for using ai generated images may make for worse spaces overall
adding another button to the multiselect dialog is hard to justify bc basically that dialog is full/big already, so only very fundamental controls should be there
there already is an existing ui workflow for adding images the ‘’ button, it’s easier to teach 1 way of creating an image than it is to teach 2 ways

My biggest concern is regarding pricing. Some back of the envelop math:

imageCost = $0.018 
userPrice = $6
userPrice / imageCost  = 333

A person would only have to generate 330 images to make them free, and could go on to make a lot more. So there may need to be a limit, but I don’t want to over-engineer limitations in the first release.

questions

If I said ‘paid users get 100 ai images a month’ would that create a mentality of scarcity where you felt like you should be using up number of images per month to get your ‘moneys worth’?
what if the limit wasn’t stated but then you hit it, with a message saying you’d have to wait n days or something to generate some more?
in both of these cases, I’d need to track when you made images. So I could maybe also track the result and query you used – would looking back on a historical record of your ai images be something you’d be interested in? (maybe it’d live in the sidebar)

pirijan · November 7, 2022, 8:30pm

the raw ui , ready to hook up to dall-e

pirijan · November 7, 2022, 8:51pm

the good thing is it looks like I can set a monthly spend limit so i wont go bankrupt or something if someone decides to go ham

kordumb · November 7, 2022, 10:41pm

I would definitely put an image cap - don’t think people would rush to use their allotment every month, but tough to know for sure.

Being able to see history would be cool.

I don’t feel strongly about this, but to clarify a few things: I don’t think the cards and image should be tied together or be blended in any many.

It would only be a way to create, not edit/modify/etc.

Once you use the multiselect → it creates an image card that would act and behave just like if you went through the standard route and the query would be edited in the same manner - so if you used five cards to be your prompt, changing them or deleting them would have no impact on the image.

But again, I don’t think this is needed.

pirijan · November 8, 2022, 2:55pm

Do you have thoughts on dalle vs stable diffusion? In my tests sd produces not as good results but try them both out and let me know what you think.

What do you think of the results from

?

The advantage of sd is that I wouldn’t need usage caps because of the much lower pricing, and possibly also wouldn’t need to limit it to paid users

pirijan · November 8, 2022, 3:33pm

re limits, as a kind of freemium thing try giving free users n(10 or 20?) free searches per acct, and then upgraded users get 100 or 50 searches / month

bentsai · November 8, 2022, 6:17pm

This feels like a freemium feature to me. I’ve not gone the rabbit hole of image generation, but Kinopio feels like a great place for people to fall down such a hole. It is a visual platform, easy to create an manipulate text. And a limited number for free accounts feels right. Enough to dip your toes and see how cool it is.

I was thinking the same thing. What were your philisophical concerns, @pirijan ?

I could see a few options with the output:

One card with the image and the prompt text
Two cards: 1) image and 2) prompt text. Maybe connect them?
One card with special rendering so that the prompt text is accessible, but not visible on the front. Maybe it’s not worth making this a special card. Folks can always take options 1 or two and swap them. I imagine a lot of people would not want the prompt text as displayed by default because the point is to show the nice generated image.

kordumb · November 8, 2022, 6:30pm

Yeah, I feel like 10 for free, 100 for paid would be a good approach. I could see it barely getting used, but I could also see people trying to take advantage, so good to have those controls.

In terms of output (from Ben’s message) I was thinking something like the third option. It’s an image card, but if you clicked the card and clicked the AI button, the prompt would still be there so you could potentially adjust and regenerate. I don’t think the two card approach (one image, one prompt) is great…

And in testing, I haven’t loved my initial tests with Stable Diffusion when doing side by sides, but since it’s more of a fun/brainstorming type tool, I don’t think it would be that big of a deal if the cost is really that much better. But my vote would still probably be for Dalle2.

pirijan · November 8, 2022, 6:54pm

the prompt would still be there so you could potentially adjust and regenerate

I can do this , but is there a time when you’d want the old prompt to not be there anymore? eg you do a search , pick an image, all is well. But then you make a new card and write something like a prompt in the card name, based on how other image searches work it would be expected/desirous that the prompt textarea would be pre-filled with the current card name

In the latter case, there will be a way to see your AI image history and copy the old prompts to use again/adjust

pirijan · November 8, 2022, 6:57pm

pirijan · November 8, 2022, 7:02pm

RE text cards and image cards being separate things:
Here, I’m referring to the Purpose of each.

Organically looking at spaces today, I haven’t seem many (or any) where I thought “If these cards were turned into a prompt then they’d make a relevant image”. I think the mindset of making cards to feed into a prompt is a very different mindset than making cards to represent thoughts, ideas, and tasks.

pirijan · November 8, 2022, 7:04pm

I agree, I could see it going either way, and because I’m saving the AI prompts and images for each user I’ll be able to statistically model trends/usage if I need to later. In the short term I’m using this data to provide the AI history ui