- 7 Posts
- 12 Comments
AdComfortable1514@lemmy.worldto Stable Diffusion@lemmy.dbzer0.com•EasyCaption - a right-click menu item for captioning imagesEnglish1·7 months agoSimple and cool.
Florence 2 image captioning sounds interesting to use.
Do people know of any other image-to-text models (apart from CLIP) ?
AdComfortable1514@lemmy.worldOPto Stable Diffusion@lemmy.dbzer0.com•Making a better CLIP interrogator with the FLUX T5 encoder?English2·7 months agoWow , yeah I found a demo here: https://huggingface.co/spaces/Qwen/Qwen2.5
A whole host of LLM models seems to be released. Thanks for the tip!
I’ll see if I can turn them into something useful 👍
AdComfortable1514@lemmy.worldOPto Stable Diffusion@lemmy.dbzer0.com•Making a better CLIP interrogator with the FLUX T5 encoder?English2·7 months agoThat’s good to know. I’ll try them out. Thanks.
AdComfortable1514@lemmy.worldOPto Stable Diffusion@lemmy.dbzer0.com•Making a better CLIP interrogator with the FLUX T5 encoder?English1·7 months agoHmm. I mean the FLUX model looks good
, so there must maybe be some magic with the T5 ?
I have no clue, so any insights are welcome.
T5 Huggingface: https://huggingface.co/docs/transformers/model_doc/t5
T5 paper : https://arxiv.org/pdf/1910.10683
Any suggestions on what LLM i ought to use instead of T5?
AdComfortable1514@lemmy.worldOPto Stable Diffusion@lemmy.dbzer0.com•Prompt+Token % Similarity Calculator. No GPU required.English1·8 months agoNew stuff
Paper: https://arxiv.org/abs/2303.03032
Takes only a few seconds to calculate.
Most similiar suffix tokens : "vfx "
most similiar prefix tokens : “imperi-”
AdComfortable1514@lemmy.worldtoData is Beautiful@lemmy.world•Who Stops a "Bad Guy With a Gun"?English1·8 months agoI count casualty_rate = number_shot / (number_shot + number_subdued)
Which in this case is 22/64 = 34% casualty rate for civilians
and 98/131 = 75% casualty rate for police
AdComfortable1514@lemmy.worldtoData is Beautiful@lemmy.world•Who Stops a "Bad Guy With a Gun"?English61·8 months agoSo its 64-131 between work done by bystanders vs. work done by police?
And casualty rate is actually lower for bystanders doing the work (with their guns) than the police?
AdComfortable1514@lemmy.worldto Perchance - Create a Random Text Generator@lemmy.world•Perchance censorshipEnglish01·9 months agoI’m not even going to adress your post.
I’ve heard this lingo before.
Godamn redditors trying to infiltrate the perchance forums.
There’s no reasoning with this rhetoric
AdComfortable1514@lemmy.worldto Perchance - Create a Random Text Generator@lemmy.world•Deleting images / galleriesEnglish1·11 months agoAgree.
IMO The owner of a generator should be entitled to remove content from their image gallery
IDGAF about peoples NSFW preferences , but I’ve had occasional past experiences with users on my generator (fusion gen) who post content for the sake of harassment.
For example scat , geriatrics , bugs and gore , or just writing messages in the prompt. This happens rarely, thankfully.
But having the tools at hand to remove/hide such content would be good.
AdComfortable1514@lemmy.worldOPto Perchance - Create a Random Text Generator@lemmy.world•[Request] AI Image gallery features - Add comments section below images + other stuffEnglish1·1 year agoI learn new stuff about Lemmy everyday. Thanks mate :)!
My solution has been to left click , select “inspect element” to open the browsers HTML window.
Then zoom out the generator as far as it goes , and scroll down so the entire image gallery (or a part of it at least) is rendered within the browser.
The ctrl+c copy the HTML and paste it in notepad++ , and use regular expressions to sort out the image prompts (and image source links) from the HTML code
Not exactly a good fix , but it gets the job done at least.